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Editors'  note 

This  special  issue  of  Pramana  on  "Nonlinearity  and  chaos  in  the  physical 
sciences"  differs  in  some  ways  from  other  recent  special  issues  we  have 
published. 

First,  it  is  not  the  proceedings  of  a  conference  or  a  meeting  but  has  resulted 
from  a  "virtual  conference"  involving  the  members  of  the  Indian  community 
working  in  the  area  and  their  friends  and  collaborators.  The  response  has 
been  greater  than  we  anticipated  and  the  credit  is  due  to  the  guest  editors, 
Profs  R  Ramaswamy  and  K  R  Sreenivasan,  the  contributors,  and  of  course 
the  vigorous  state  of  the  subject  itself. 

A  second  difference  is  that  Pramana  is  not  bringing  out  a  separate  supple- 
ment this  time,  but  presenting  the  material  as  the  January  and  February 
issues.  While  this  split  has  to  be  made  for  practical  reasons,  the  two  parts 
should  be  regarded  as  one  unit  and  we  are  indeed  reproducing  this  note,  the 
guest  editors'  preface,  and  the  full  table  of  contents  in  both  the  parts.  Given 
the  amount  of  work  involved  in  proof-reading  and  copy  editing  (even  with 
many  of  the  manuscripts  in  electronic  form)  it  would  have  been  impractical  to 
bring  out  separate  January  and  February  issues  as  well.  The  extra  load  has  in 
fact  meant  a  delay  in  the  appearance  of  this  two-part  special  issue  which  we 
sincerely  regret.  We  hope  to  be  better  prepared  when  such  occasions  come 
up  in  the  future. 

This  special  issue  reflects  the  current  active  state  of  the  subject  of  nonlinear 
phenomena  in  physics  and  allied  disciplines  in  the  country.  We  hope  that  it 
will  be  read  with  great  interest  by  followers  and  practitioners  of  this  rapidly 
expanding  area  and  stimulate  further  work.  We  also  hope  that  many  of  the 
authors  who  contributed  to  this  special  issue  will  submit  some  of  their  work 
to  Pramana-  J.  Phys.  in  future.  And  finally,  there  is  no  reason  why  other 
areas  in  which  there  is  a  strong  community  cannot  follow  the  example 
set  here. 

It  is  a  pleasure  to  thank  Profs  K  R  Sreenivasan  and  R  Ramaswamy  for  their 
efforts  which  have  helped  this  journal  bring  out  an  extended  special  issue 
devoted  to  an  area  of  great  current  interest  to  physicists  all  over  the  world. 


R  Nityananda 
H  R  Krishnamurthy 


Preface 


One  of  the  most  important  conceptual  advances  of  the  modern  era  has  been  the 
realization   that   the   temporal   evolution   of  many    nonlinear    systems, 
independent  of  their  physical  origin,  is  profoundly  similar.  The  similarity  is 
dynamical  and  quantitative,  not  merely  statistical  and  descriptive.  The  subject 
that  cultivates  and  explores — to  some  degree  or  another — this  universality  of  nonlinear 
problems  is  called  nonlinear  science,  or  nonlinear  dynamics,  or  chaos,  depending  to  some 
extent  on  the  orientation  of  the  researcher.  Initial  progress  in  the  area  was  rapid  and 
simultaneously  occurred  in  several  disciplines  in  the  natural  and  mathematical  sciences. 
As  we  have  begun  to  appreciate  better  the  richness  of  nonlinear  systems,  the  challenges 
have  proliferated  as  well. 

From  the  time  of  inception  of  this  multidisciplinary  subject,  several  groups  of  scientists 
in  India  have  contributed  to  it.  These  groups  were  initially  scattered  with  no  serious 
contact  among  them.  In  1987,  a  summer  school  on  the  subject  was  organized  in 
Bangalore,  under  the  auspices  of  the  Department  of  Science  and  Technology,  with  the 
intention  of  providing  an  impetus  for  further  research  and  for  developing  a  sense  of 
community  within  the  country.  Many  bright  and  young  scientists  participated  with 
excitement  and  eagerness.  Some  of  the  seeds  sown  at  the  time  have  now  matured,  and  the 
nonlinear  science  community  seems  to  have  established  an  identity  of  its  own. 

It  was  felt,  ten  years  later,  that  a  collection — in  a  single  place — of  the  current  work  in 
the  field  would  spur  further  synergism  and  collaborations.  Thus  came  the  inspiration  for 
this  special  volume.  The  scope  was  restricted,  by  and  large,  to  the  physical  sciences. 
Various  researchers  active  in  the  field  within  the  country  (and  some  living  abroad  with 
strong  ties  to  local  activities)  were  invited  to  contribute  to  the  volume.  Given  the  special 
nature  of  the  volume,  they  were  encouraged  to  include  in  their  contributions  both  review 
elements  and  new  results.  The  enthusiasm  with  which  the  proposal  was  greeted  by  them 
is  a  testimony  to  both  the  interest  that  the  field  commands  and  the  way  in  which  the  idea 
resonated  with  their  own  thinking.  The  articles  were  all  subjected  to  peer  review.  While 
the  papers  are  representative  of  the  authors'  specific  interests  and  individually  provide  a 
glimpse  of  their  own  area  of  inquiry,  it  is  our  belief  that  the  collection  provides  a 
comprehensive  account  of  the  work  occurring  in  the  field  as  a  whole. 

Given  the  vast  scope  of  the  work  that  this  volume  represents,  it  is  difficult  to 
pigeonhole  it  into  a  few  neat  categories  but  an  attempt — even  if  imperfect — seems 
worthwhile.  The  forty  two  contributions  have  been  grouped  in  six  sections  as  follows, 

A  mathematical  tool  that  is  much  used  in  nonlinear  science,  especially  in  quantum 
chaos  and  increasingly  in  other  sub-fields  of  dynamical  systems  as  well,  is  the  theory  of 
random  matrices.  Mehta's  article,  which  begins  this  volume  as  well  as  Section  I  on 
mathematical  aspects,  is  a  comprehensive  survey  and  summary  of  the  most  important 
results  in  this  area.  Other  articles  in  the  section  by  Balakrishnan,  Nicolis  and  Nicolis,  by 
Nandakumaran,  and  by  Sinha  and  Das  deal  with  iterated  unimodal  maps  which  have  been 
important  to  the  mathematical  understanding  of  chaotic  dynamics.  Kolwankar  and 
Gangal  discuss  the  connection  between  fractional  differentiability  and  the  fractal 
dimension  of  curves;  this  topic  is  of  considerable  interest  because  of  the  ubiquitous 


generic  features  form  the  focus  of  the  article  by  Valsakumar,  Satyanarayana  and  Sridhar, 
who  are  concerned  with  signal  processing  and  data  analysis.  Finally,  Rangarajan  deals 
with  a  new  technique  for  integrating  the  equations  of  motion  of  a  Hamiltonian  system 
while  explicitly  respecting  its  symplectic  property. 

Dynamical  systems  that  are  exactly  integrable — and  therefore  nonchaotic — have  been 
studied  in  great  detail  well  before  the  spurt  of  activity  in  chaos  theory.  Integrable  systems 
are  constrained  by  independent  conserved  quantities,  and  support  coherent  solutions  such 
as  solitons.  The  properties  of  solitons  and  their  many  applications  form  the  focus  of 
Section  n  which  contains  articles  by  Porsezian  on  the  Painleve  property,  by  Lakshmanan 
and  Radha  on  solitons  in  higher  dimensions,  and  by  Balakrishnan  on  the  geometric 
aspects  of  nonlinear  evolution  equations.  An  area  where  the  concepts  of  solitons  has  seen 
much  application  is  that  of  particle  physics  and  field  theory,  and  the  article  by  Sriram  and 
Segar  discusses  some  of  these  issues  in  the  context  of  nonlinear  chiral  models. 

An  aspect  of  chaotic  systems  that  has  been  at  the  forefront  of  recent  research  is  their 
control.  Several  groups  in  the  country  are  involved  in  devising,  implementing  and 
studying  control  methods.  Articles  in  this  area  are  grouped  in  Section  3.  The  papers  by 
Gadre  and  Varma,  Sharma  and  Gupte,  and  Rajasekhar  are  representative  of  the  work  in 
this  area.  Related  concerns  deal  with  extended  dynamical  systems,  namely  coupled  lattice 
systems,  where  spatiotemporal  chaos  is  of  interest.  Articles  by  Sinha,  by  Parekh,  Ravi 
Kumar  and  Kulkarni,  and  by  Roy  and  Amritkar  are  on  the  latter  topic. 

While  temporal  chaos  has  unearthed  exceedingly  interesting  behaviours  of  nonlinear 
systems,  their  connections  to  fluid  turbulence  are  still  tenuous.  In  the  background 
however,  there  exists  the  hope  that  the  knowledge  one  acquires  from  simpler 
spatiotemporal  systems  and  the  methods  developed  to  probe  them  will  enhance  our 
ability  to  understand  the  richness  of  fully  developed  turbulence.  Section  W  contains  two 
articles  that  deal  with  aspects  of  turbulence  from  a  relatively  modern  perspective:  that  by 
Bhattacharjee  deals  with  statistical  mechanical  aspects  of  the  randomly  stirred  fluid, 
while  the  paper  by  Dhar,  Sain,  Pande  and  Pandit  deals  with  developments  in  the  scaling 
phenomenology,  and  emphasizes  the  lessons  learnt  from  the  GOY  model. 

The  field  of  quantum  chaos  attempts  to  understand  the  difference  in  behaviour  between 
systems  that  are  (in  some  appropriate  limit)  classically  chaotic  and  those  that  are 
classically  integrable.  There  are,  in  contrast  to  classical  chaos,  a  multitude  of  definitions 
of  what  constitutes  quantum  chaotic  behaviour.  One  very  successful  classification  of 
systems  is  based  on  the  different  universality  classes  of  random  matrices.  Papers  in 
Section  V  have  analysed  problems  in  quantum  chaos  from  a  variety  of  points  of  view. 
Ganesan  and  Gebarowski  look  at  chaos  in  the  hydrogen  atom  in  different  external  fields, 
while  Santhanam,  Sheorey  and  Lakshminarayan  study  the  morphology  and  characteristics 
of  eigenstates  of  quartic  oscillators;  the  paper  by  Mahapatra,  Sathyamurthy  and 
Ramaswamy  analyzes  chaos  and  resonances  in  a  molecular  collision  system,  while 
Kudrolli  and  Sridhar  report  on  results  of  an  experiment  to  study  quantum  chaos  in  a 
microwave  cavity.  Pal  examines  chaos  in  nuclear  motion  and  its  role  in  the  collective 
dynamics,  using  results  from  a  quantum  chaotic  analysis  in  a  classical  model  for 
dissipation  in  nuclear  fission.  Periodic  orbits  have  played  a  major  role  in  semiclassical 
theories,  and  the  papers  by  Biswas,  by  Lakshminarayan,  by  Gaspard  and  Jain  and  by 
Khare  deal  with  different  aspects  of  trace  formulae.  The  similarity  between  the 


ine  rinai  section,  broadly  ciassmea  as  Applications,  snows  wny  cnaos  nas  enjoyea 
universal  appeal:  rarely,  it  seemed,  had  the  methods  developed  in  modern  mathematics 
found  such  an  immediate  relevance  in  real-life,  although  one  often  had  to  look  for  the 
connections  rather  meticulously:  Chaos  is  in  the  details! 

Mehra,  Nayak  and  Ramaswamy  apply  the  methods  of  nonlinear  dynamics  to 
understand  the  mesoscopic  behavior  of  clusters  of  a  small  number  of  atoms. 
Lakshmibala,  Bambah,  Sriram  and  Mukku  review  studies  of  dynamical  chaos  in 
classical  gauge  theories  using  both  analytic  as  well  as  numerical  techniques.  Arnbika 
discusses  the  complex  behaviour  of  Josephson  junctions,  such  as  its  phase  locked  and 
chaotic  states,  scaling  near  basin  boundaries,  and  so  forth.  The  classical  chaos  elements 
of  collective  thermalization  of  the  quark-gluon  problem  are  discussed  by  Sengupta,  Bhatt, 
Kaw  and  Parikh.  The  paper  by  Kaw  and  Sen,  which  deals  with  the  nonlinear  propagation 
of  electromagnetic  waves  in  plasmas,  is  concerned  with  applications  to  a  variety  of 
current  experiments  in  laser  fusion  and  plasma  based  accelerators.  Olinger,  Chhabra  and 
Sreenivasan  describe  experimental  results  in  the  wake  of  a  transversely  oscillating 
cylinder  in  a  fluid  flow,  which,  they  demonstrate,  belongs  to  the  same  universality  class  as 
circle  maps  with  cubic  nonlinearity.  The  low-dimensional  chaotic  behavior  of  the  stick- 
slip  phenomenon  common  in  plastic  instability  of  flowing  materials  is  discussed  by 
Noronha,  Ananthakrishna,  Quaouire  and  Fressengeas.  Goswami  dwells  on  the  limits  to 
predictability  of  Indian  monsoons  due  to  the  phase  and  amplitude  irregularities  of 
interseasonal  oscillations  in  a  dynamic  model.  Puri  discusses  phase  ordering  of  random 
magnets  and  phase  separation  in  binary  fluids,  while  Das  treats  the  nonlinear  dynamics  of 
fluctuations  and  relaxation  times  in  supercooled  liquids.  Finally,  Bhattacharjee,  Mehta 
and  Luck  discuss  the  role  of  infrared  divergences  in  a  perturbation  theory  of  noisy 
coupled  maps. 

As  this  summary  suggests,  the  richness  and  variety  of  the  subject  and  its  practice  in  the 
country,  which  this  collection  reflects,  is  immense.  The  subject  brings  together,  in  a 
unique  way,  a  unification  of  mathematics,  fundamental  physics  in  several  of  its  branches, 
as  well  as  applications.  We  have  greatly  enjoyed  putting  the  volume  together,  and  hope 
that  you,  the  reader,  will  share  our  sense  of  discovery  and  delight. 

However,  none  of  this  could  have  happened  without  the  support  and  cooperation  of  the 
nonlinear  science  community.  As  authors  and  referees,  they  have  played  the  vital  and 
crucial  role  in  giving  this  volume  its  present  shape  and  they  deserve  our  warmest  thanks. 

Ramakrishna  Ramaswamy,  Jawaharlal  Nehru  University,  New  Delhi. 
Katepalli  R.  Sreenivasan,  Yale  University,  New  Haven. 
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Random  matrices  and  matrix  models:  The  JNU  lectures 
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Permanent  Address:  Centre  d'Etudes  Nucleaires  de  Saclay,  91191  Gif-sur-Yvette  Cedex,  France 

Abstract  A  course  of  lectures  was  given  at  the  Jawaharlal  Nehru  University  and  the  Jamia  Milia 
Islamia,  New  Delhi,  during  February-March  1996.  The  following  notes  were  distributed  to  the 
audience  before  each  lecture.  These  notes,  which  are  sketchy  and  do  not  go  in  details,  were  meant 
to  help  students  follow  the  standard  literature  on  the  subject.  They  are  collected  here  (with  the 
exercises!)  in  the  hope  that  they  might  prove  useful  to  a  larger  community  of  research  workers. 

t?P  Keywords.    Random  matrices;  matrix  ensembles;  statistical  properties  of  eigenvalue  spectra. 

PACSNos    02.10;  5.45;  5.90 

1.  Introduction 

Before  one  can  consider  the  problem  of  "random  matrices"  one  has  to  know  a  few 
preliminary  things. 

•  Numbers:  A  set  in  which  addition,  subtraction,  multiplication  and  division  can  be 
performed  with  the  usual  rules  of  arithmetic,  namely 

(a)  addition    is    commutative   and   associative:    x  +  y  =  y  +  x,    x  +  (y-\-z}  = 
\  (x  +  y)  +  z\ 

•*  (b)  multiplication  is  associative:  x(yz)  =  (xy)z\ 

(c)  existence  of  a  zero  and  of  a  unit:  x  +  0  =  0  +  x  =  x;  1.x  =  x.l  —  x\ 

(d)  multiplication  is  distributive  over  addition:   x(y  +  z)  =  xy  +  xz,   (y  +  z}x  = 
yx  +  zx\ 

(e)  existence  of  a  negative  and  an  inverse:  x+  (-x)  =  (-or)  +  x  =  0;  x-x~l  = 
x~l  •  x  =  1  if  x  ^  0; 

is  called  a  number  field,  its  elements  are  numbers. 

•  There  are  only  three  kinds  of  numbers:  real,  complex  and  quaternion  (theorem  of 
Frobenius). 

•  Real  and  complex  numbers — are  very  familiar:  the  complex  numbers  are  formed  with 
two  real  numbers  and  one  special  unit  i  with  i2  —  —  1. 

•  Quaternions  are  not  so  familiar:  they  are  formed  with  four  real  numbers  and  three  units 
ei,  62  and  63  with  e\  =  e^  =  63  =  616263  =  — 1. 

^-  •  Multiplication  is  commutative  for  real  and  complex  numbers,  but  not  for  quaternions 

xy  ^  yx. 

•  Matrices  are  rectangular  arrays  of  numbers.  With  a  square  matrix  one  can  usually 
associate  a  number  called  its  determinant.  Matrices  can  be  added  or  multiplied  if  their 
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number  of  rows  and  columns  are  appropriate.  The  values  of  A  for  which  det(A  —  A/)  is 
zero,  are  called  eigenvalues  of  A.  (Here  I  is  the  unit  matrix).  For  an  eigenvalue  A  of  A 
the  non-zero  column  matrix  x  satisfying  the  equation  Ax  =  Xx  is  called  the 
eigenvector  of  A  corresponding  to  the  eigenvalue  A. 

A  square  matrix  A  of  order  n  x  n  has  n  eigenvalues  and  at  most  n  eigenvectors. 
One  can  define  many  elementary  operations  on  matrices  such  as  transposition, 
hermitian  conjugation  and  taking  the  dual,  the  result  is  denoted  respectively  by  AT,  A^ 
and  AD.  We  call  A  symmetric  if  A  =  AT,  hermitian  ifA  =  A*,  self-dual  if  A  =  AD, 
orthogonal  if  AAT  —  I,  unitary  if  AA^  =  I  and  symplectic  if  AAD  =  I. 
The  eigenvalues  of  a  real  symmetric  matrix,  of  a  complex  hermitian  matrix  and  of  a 
quaternion  self-dual  matrix  are  all  real;  those  of  a  real  orthogonal  matrix,  of  a  complex 
unitary  matrix  and  of  a  quaternion  symplectic  matrix  are  all  of  the  form  e19,  9  real. 
>  The  eigenvalues  of  an  arbitrary  real  or  complex  matrix  are  in  general  complex 
numbers,  and  the  question  of  the  eigenvalues  of  an  arbitrary  quaternion  matrix  is 
meaningless,  since  the  determinant  of  such  a  matrix  can  not  be  defined  in  a  reasonable 
way. 

»  The  problem  of  random  matrices  is  very  simple  to  state.  Given  an  n  x  n  real 
symmetric  random  matrix  (random  means  that  the  probability  density  of  the  matrix 
elements  is  given),  what  can  one  say  about  its  eigenvalues?  or  about  a  few  of  its 
eigenvalues?  or  of  its  eigenvectors?  or  a  few  of  its  eigenvectors?  The  words  "real 
symmetric"  in  the  above  sentence  can  be  replaced  by  any  of  the  following,  "complex 
hermitian",  "quaternion  self-dual",  "symmetric  unitary",  "unitary",  "self-dual 
unitary",  "real"  or  "complex". 

•  Why  should  one  worry  about  such  questions?  Some  history:  In  the  1920's  and  1930's 
some  mathematicians  studied  such  questions  in  relation  to  their  applications  in 
preparing  birth  and  death  statistics  for  insurance  companies.  In  1942-1944,  Selberg 
evaluated  a  related  integral  and  used  it  to  find  the  density  and  the  distribution  of  prime 
numbers.  But  this  paper  remained  unnoticed  for  about  40  years.  More  about  it  later.  In 
the  1940's  and  1950's  nuclear  bombs  and  nuclear  power  stations  appeared,  and  for 
obvious  reasons  people  wanted  to  know  about  the  neutron  resonance  energies  of  many 
nuclei  such  as  uranium,  cadmium  and  aluminium;  their  positions,  heights  and  widths. 
According  to  quantum  mechanics  each  nucleus  is  described  by  a  hamiltonian  function, 
its  eigenvalues  giving  the  positions  and  widths  of  the  resonances.  So  choose  a  base, 
express  the  hamiltonian  as  a  large  matrix  in  this  base  and  compute  its  eigenvalues.  The 
hamiltonian  function,  i.e.  the  nuclear  interactions  are  not  known  so  well,  and  even  if 
known  are  quite  complicated.  Hence  Wigner,  who  knew  the  question  very  well, 
suggested  that  for  statistical  properties  it  will  be  sufficient  to  take  the  hamiltonian 
matrix  elements  as  random  numbers.  In  the  1960's  and  1970's  a  large  amount  of 
experimental  data  about  nuclear  resonances  in  various  nuclei  was  collected;  also  the 
mathematical  problem  of  random  matrices  was  more  or  less  solved  for  one  particular 
model,  namely  when  the  matrix  elements  have  a  normal  or  Gaussian  distribution.  A 
comparison  of  the  two  was  quite  satisfactory.  In  the  1970's  it  was  discovered  somewhat 
unexpectedly  that  the  zeros  of  the  Riemann  zeta  function  on  the  critical  line  behave  as 
if  they  were  the  eigenvalues  of  a  complex  hermitian  random  matrix.  (A  small 


concrete  beams,  the  electromagnetic  resonances  in  closed  cavities  of  random  shape,  the 
possible  quantum  energies  of  a  particle  confined  in  a  box  of  random  shape,  the 
positions  of  trees  in  a  forest,  etc.  all  seem  to  follow  the  same  statistical  laws  as  the 
eigenvalues  of  a  random  matrix.  The  mathematical  methods  developed  for  the  random 
matrix  problem  find  use  in  other  fields  such  as  quantum  gravity  and  string  theory. 

A  list  of  references  is  given  at  the  end  of  these  notes  in  a  general  bibliography.  [1]  and 
[2]  are  cited  in  the  text  as  RM  and  MT  respectively. 

2.  (Gaussian)  Ensembles  of  matrices 

2.  1  Invariance  under  a  change  of  basis 

In  these  lectures  I  will  consider  ensembles  of  matrices,  H,  whose  elements  are  random 
(explained  below),  and  which  are  invariant  under  a  transformation  U  of  the  basis,  namely 


}  =  P(H).  (1) 

The  three  cases  of  importance  are  when  H  is 

•  real  symmetric.  U  is  then  real  orthogonal.  This  case  has  the  parameter  (to  be  explained 
below)  /3=1. 

•  complex  hermitian.  17  is  then  complex  unitary  and  (3  =  2. 

•  quaternion  self-dual.  This  case  is  unfamiliar,  and  the  following  subsection  deals  with 
this  case  in  detail.  U  is  then  quaternion  real  symplectic  and  /3  =  4. 

Here  (3  is  the  number  of  real  components  of  the  matrix  elements,  1  for  real,  2  for  complex 
and  4  for  quaternion.  Invariance  implies  that  P(tf)  depends  only  on  the  traces  of  powers 
of#. 

P(H)=P(t[H,trH2,...,trHn).  (2) 

(Exercise:  Why  only  the  first  n  powers?  Hint:  Only  symmetric  functions  of  the 
eigenvalues  survive  such  transformations.) 

2.2  Digression  on  quaternions 

Quaternions  (or  quaternion  numbers)  are  not  so  familiar  as  the  real  or  complex  numbers. 
Just  as  complex  numbers  have  two  components,  quaternions  have  four  components, 
a  —  OQ  +  a\e\  +  a^e-i  +  0363  =  OQ  +  a.e.  One  calls  ao  the  scalar  part,  and  a  the  vectorial 
part  of  a.  The  dual  of  a  is  aD  =  a$  —  a.e.  The  units  e\  ,  e^  ej  have  the  multiplication  laws 


=  -e^ei  =  e\  , 

<?2-  (3) 
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The  three  2x2  Pauli  matrices  a\,  cr2  and  <73  familiar  to  physicists,  also  have  simila 
multiplication  laws  and  therefore  it  is  sometimes  convenient  to  represent  the  ej  and  th 
quaternions  as  2  x  2  matrices, 


(4 

=    o  c(aD)  =     -  -3  -«,  -  1*!  /    (3 


Thus  an  n  x  n  quaternion  matrix  will  sometimes  be  represented  by  a  2n  x  In  comple 
matrix.  Usually  the  a;  are  taken  as  real  numbers,  but  we  will  allow  them  to  be  complex 
Thus  some  non-zero  quaternions  will  not  have  an  inverse.  This  being  admitted,  an 
2n  x  In  complex  matrix  can  be  thought  of  as  cut  into  2x2  blocks  and  each  2x2  bloc 
expressed  as  a  quaternion;  if  these  quaternions  are  real  (i.e.  their  components  are  res 
numbers),  we  call  it  a  quaternion  real  matrix.  This  establishes  a  one  to  on 
correspondence  between  an  n  x  n  quaternion  matrix  A  and  a  2n  x  In  complex  matri 
C(A).  One  verifies  easily  (please  do  it!)  that 

C(AD]  =  ~C(eiI}C(A)TC(eiI}.  (( 

Hence  A  is  self-dual  if  and  only  if  C(e\I}C(A)  is  an  anti-symmetric  matrix  of  twice  th 
size. 
We  state  here  one  definition  and  two  theorems  without  proof. 

Theorem  1.  For  a  quaternion  matrix  A  it  is  impossible  to  define  a  determinant  havir, 
the  three  properties: 

(i)  det  A  =  0,  if  and  only  if  Ax  =  0  has  a  non-zero  solution  x; 
(ii)  det(AB)  =  det  A-det£; 
(iii)  det  A  is  multilinear  in  the  rows  (or  in  the  columns')  of  A. 

(For  a  proof  see  Chapter  8  ofMT.)  In  order  to  define  a  determinant  of  a  quaternion  matr 
one  has  to  give  up  one  or  more  of  the  above  properties. 

DEFINITION  1 

For  a  quaternion  matrix  A  we  define 

detA  =  ^,(-rf(ahhahh  •  •  •  ajrjl  )oKfeflfcfe  •  •  •  akskl  )0  •  •  •  ( 

p 

where  the  sum  is  over  all  permutations  P  of  the  indices  (1,2,  ...  ,n)  consisting  of  t 
exclusive  cycles  (/i,  J2,«  •  -,yV),  (*i,*2,  •••,**),....  The  subscript  0  means  that  o: 
should  take  the  scalar  part  of  each  cyclic  product.  If  the  quaternion  real  matrix  A  is  se 
dual,  then  the  subscript  0  in  eq.  (7)  above  is  not  necessary  under  certain  conditions,  (s 
Chapter  8  of  MT  for  details).  (Exercise:  Which  of  the  three  properties  of  determinants 
missing  in  this  definition?). 
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Theorem  2.  Let  C(A)  be  the  In  x  In  complex,  matrix,  corresponding  to  the  n  x  n  self- 
dual  quaternion  matrix  A.  Then  C(e.\I}C(A}  is  a  2n  x  2n  anti-symmetric  matrix  and  the 
determinant  of  A  is  equal  to  the  Pfaffian  of  C(e\I}C(A) 

detA  =  Pf[C(*i/)C(A)].  (8) 

(For  a  proof  see  Chapter  8  of  MT,  or  devise  one  by  yourself.) 

2.3  Statistical  independence  of  the  various  elements 

pw  =  n  //*w  n  n  /^(4a))-  (9) 

j  j<k  a=0 

These  requirements  then  force  P(H)  to  be  an  exponential  with  only  first  two  powers  of  H. 
P(H}=&\p(-atrH2+btiH  +  c).  (10) 

For  a  proof  consider  H  =  U~1H'U,  with  U  —  I  having  non-zero  small  elements  e  and  —E 
only  in  the  position  (1,2)  and  (2,1)  respectively.  Then  compare  P(H)  and  P(H')  to  the 
first  order  in  e.  For  the  anti-symmetric  real,  anti-herrnitian  complex  or  anti-self-dual 
quaternion  real  matrices,  the  same  two  constraints,  namely  (i)  invariance  under  an 
orthogonal  real,  unitary  complex  or  symplectic  quaternion  real  matrix,  and  (ii)  statistical 
independence  of  linearly  independent  various  real  parameters  entering  in  the  matrix 
elements,  lead  to  P(H)  =  exp[—  a  trH2  +  c]. 

2.4  Change  of  variables  to  eigenvalues 

We  take  H  to  be  either  real  symmetric,  or  complex  hermitian,  or  quaternion  real  self-dual. 
The  matrix  H  has  n  real  diagonal  elements  and  \n(n  -  I)  non-diagonal  independent 
elements  which  are  real,  complex  or  quaternion  real.  Thus  the  total  number  of  real 
variables  in  H  is  n  +  ^n(n  —  l)/3.  Also  H  has  n  real  eigenvalues.  So  if  we  want  the 
eigenvalues  of  H  as  new  variables,  we  have  to  supplement  them  with  ^n(n  —  l}(3  extra 
variables.  What  is  the  best  choice?  The  matrix  which  diagonalizes  H  also  has  \n(n  —  l)/3 
(angular)  real  parameters  to  characterize  it.  Choosing  these  angular  parameters  and  the  n 
(real)  eigenvalues  of  H  as  new  variables,  the  exponential  part  of  P(H]  is  easily  expressed 
since  tr//;  =  £^=i  ^1-  ^ne  ^as  now  to  evaluate  the  Jacobian.  The  result,  after  some 
calculation,  is 


where  6J,-  are  the  eigenvalues  of  H,  /3  is  1,  2  or  4  for  real,  complex  or  quaternion  real  H, 
and/  depends  only  on  the  angular  parameters  specifying  the  matrix  U  which  digonalizes 
H.  For  anti-symmetric,  anti-hermitian  or  anti-self-dual  matrices  one  should  note  that  the 
non-zero  eigenvalues  come  in  pairs.  Changing  the  origin  and  the  scale,  we  can  thus  write 
the  joint  probability  density  of  the  eigenvalues  as 
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3.  Some  general  questions  which  are  well  posed 

The  m-point  correlation  function  Rm(xi,  .  .  .  ,xm]  is  defined  as 

I  n  /» 

flm(*l..--i*m)  =  t     _'m\]          •'•         P(x\t...,Xn)fam+i'--6xn.  (13) 

In  fact  Rm(x\,...  ,xm}Ax\  •  •  •  dxm  is  the  probability  of  having  one  eigenvalue  in  each  of 
the  m  intervals  dxi,  .  .  .  ,  dxm  around  the  points  x\,  .  .  .,  x,n  respectively.  When  the  m  points 
separate  into  two  or  more  clusters  far  away  from  each  other,  then  Rm  is  usually  a  product 
of  functions  depending  separately  on  each  cluster.  To  measure  the  effect  of  neighborhood 
on  the  probabilities  one  defines  cluster  functions  Tm(x\,  .  .  .  ,jcm).  They  are  also  known 
under  the  name  of  cumulants  and  correspond  to  connected  graphs  in  the  diagrammatical 
representation  of  the  perturbation  series  due  to  Ursell,  Yvon-Mayer  in  statistical 
mechanics,  and  more  spectacularly  due  to  Feynman  in  field  theory.  They  are  zero 
whenever  the  m  points  divide  into  two  or  more  sets  far  away  from  each  other.  The 
functions  Tm  can  be  expressed  in  terms  of  the  Rm  and  conversely.  So  it  is  sufficient  to 
evaluate  one  of  these  two  sets. 

Level  spacing  junctions.  One  might  ask  for  the  probability  £(m,  t)  of  having  exactly  m 
eigenvalues  inside  an  interval  of  length  s  =  2t  chosen  at  random.  Usually  the  length  of 
the  interval  is  measured  in  terms  of  the  local  mean  spacing.  Or,  one  might  ask  for  the 
probability  F(m,  s]  of  having  m  eigenvalues  inside  an  interval  of  length  s  measured  from 
a  randomly  chosen  eigenvalue;  or  for  the  probability  /?(m,  s}ds  that  the  distance  between 
two  eigenvalues  lies  between  5  and  s  +  d-y,  while  this  interval  5  contains  exactly  m 
eigenvalues  inside  it.  These  three  sets  of  functions  satisfy  the  relations 


),     form>0.      (14) 

;=0  7=0 

It  will  therefore  be  sufficient  to  know  E(m,  t)  from  which  the  other  functions  can  be 
deduced.  One  may  of  course  ask  other  questions;  for  example,  what  is  the  average 
number  of  eigenvalues  in  an  interval  s  chosen  at  random?  and  what  is  its  dispersion 
around  this  average?  Or  how  well  a  linear  graph  represents  the  number  of  eigenvalues  in 
an  interval  as  a  function  of  its  length? 

Correlation  functions  for  the  Gaussian  ensembles.  The  case  ft  =  2  of  complex 
hermitian  random  matrices  is  mathematically  the  simplest,  since  we  need  only  real 
symmetric  matrices  and  their  determinants.  For  the  cases  ft  =  1  (most  important  for 
applications)  and  ft  -  4,  one  needs  self-dual  quaternion  matrices  and  their  determinants. 
To  evaluate  the  m-point  correlation  function  we  will  use  the  following  theorem. 

Theorem  3.  Let  the  complex  function  f(x,y)  be  such  that 

(0     f(*,y)  =  [fly,*)]*,  (is) 
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ig  integral  of  the  nxn  determinant  det[/(xj,Xy]  over  dju(jcn)  is,  apart  from  a 
constant,  equal  to  the  (n  —  1)  x  (n  —  1)  determinant  obtained  from  the  original  one  by 
removing  the  row  and  column  containing  xn;  i.e. 


].;.=lv..irt  =  (c  -  n  +  1)  det[/(*,,  *,-)],.  y==1,...in_1.  (18) 

The  same  theorem  is  valid  when  f(x,  y}  is  a  quaternion  real  function  satisfying  the. 
requirements, 


(i)        /(*,?)  =  [/(?,*)],     (ii)         /(*,*)d/i(*)=c,  (19) 

(iii)          //(*,y)  /(y,z)d^(y)  =/(*,z)  +  A/(*,z)  -/(x,z)A,  (20) 


where  A  is  a  constant  quaternion  and  the  determinant  of  quaternion  matrices  is  given  in 
Definition  1,  eq.  (7).  As  we  said  earlier  the  determinant  of  a  quaternion  matrix  is  a 
convenient  way  of  speaking  about  Pfaffians  of  an  anti-symmetric  matrix.  For  a  proof  see 
chapter  8  of  MT.  To  calculate  the  m-point  correlation  function  all  we  have  to  do  is  to 
express 


as  an  n  x  n  determinant  det[/(^,-,jc/  )]  with  the  function/(jt,  y)  satisfying  the  requirements 
of  the  above  Theorem  3.  When  (3  =  2,  our/(*,y)  will  be  a  real  function  and  when  (3  is  1 
or  4,  it  will  be  a  quaternion  real  function. 


3.1  Alternants 

In  the  last  century  Vandermonde  evaluated  the  following  determinant 


n 


Actually,  recalling  the  elementary  properties  of  a  determinant  one  can  replace  the  powers 
of  x  by  any  polynomial  of  degree  /  Thus 

det[Cy_i  (*/)],)/=1>...in  =  constant     JJ     (Xj  -  *«•),  (23) 


where  the  constant  on  the  right  hand  side  is  the  product  of  the  coefficients  of  #j  in  Cj(x) 
for  j  =  0,  1,  .  .  .,  n  —  1.  The  polynomials  Cj(x)  being  arbitrary,  we  can  choose  them  as  we 
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tet[Qj-i(xi)t  fij-i  (*/)]  f  =  i, . . .  jW  =  constant     [[    fo  -*,-),  (24) 

jf  =  1,.. .  ,2n 

where  the  polynomials  (?/(*)  a16  again  arbitrary,  Q/(x)  is  the  derivative  of  Qj(x)  and  the 
constant  is  the  product  of  the  coefficients  of  xj  in  Qj(x)  for  j  =  0, . . . ,  2n  —  1.  We  will 
choose  three  sets  of  polynomials  Rj(x),  Cj(x)  and  Qj(x)  corresponding  to  the  three  cases 
0=1,  0  =  2  and  0  =  4  to  express  P/j(jti, . .  .,*„),  eq.  (21),  as  an  n  x  n  determinant 
det[/(jc;,^]  satisfying  the  conditions  (15-17)  or  (20).  (The  letters  R,  C  and  Q  are  chosen 
here  to  recall  that  they  correspond  to  real,  complex  and  quaternion  matrices.) 

3.2  Orthogonal  polynomials 

Given  a  non-negative  weight  function  w(jt)  for  a  <  x  <  b,  such  that  Ja  xjw(x)dx  is  finite 
for  every  integer  j  >  0,  one  can  define  a  series  of  polynomials  PJ(X]  such  that 


L 


b 
Pj(x}pk(x)\v(x}dx  =  CjSjk,  (25) 


where  <5;*  is  1  if  j  =  k  and  0  if  j  ^  A:.  The  constants  c/  are  positive  and  can  be  chosen  to  be 
1.  The  polynomials  PJ(X}  are  called  orthogonal;  they  are  called  orthonormal  if  c/  =  1. 
Actually,  these  polynomials  result  from  the  orthogonalization  of  the  power  series 
(1,  x.x2,  .  .  .)  with  respect  to  the  symmetric  scalar  product  (**,*•'')  =  J*xi-xj-\v(x)dx.  But 
this  scalar  product  need  not  be  symmetric  and  can  be  chosen  otherwise.  For  the  case 
/3  =  2,  we  find  it  convenient  to  choose  the  polynomials  Cj(x)  as  the  familiar  orthogonal 
polynomials  with  weight  w(x)  =  exp(-;c2)  for  -oo  <  x  <  co.  For  the  other  two  cases 
/3  =  1  and  /?  =  4,  we  will  choose  the  polynomials  Rj(x)  and  Qj(x)  as  skew-orthogonal 
polynomials  resulting  from  anti-symmetric  scalar  products  defined  later.  We  choose  the 
polynomials  Cj(x)  to  be  proportional  to  the  familiar  Hermite  polynomials 
HJ(X)  =  t^(-d/dx)J&-^,  and  the  functions 


(26) 
so  that  (j>j(x)  are  orthonormal  on  (—00,  oo), 

/oo 
^j(x)tf>k(x}dx  =  6jk.  (27) 

•oo 

Then  from  eq.  (23) 

JJ    (x,-  -^-)  =  constant  det[0J._1(^.)].=1     n,  (28) 

'    ' 


or,  for  the  case  /?  =  2, 

1  ,  .  .  .  ,  xn)  =  constant  det[<^_i  (x,-)]  -det[^_j  (xk)}  (29) 
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n-1 


—  constant  det 


£< 


(30) 

j,k=l ,...,« 

==  constant  det[/(jty,*jk))],  (31) 

where 

n-l 

1=0 

Now  let  us  examine  whether  this/(;t,y)  satisfies  the  conditions  of  Theorem  3.  As  /(*,y) 
is  real  and  symmetric,  condition  (15)  holds  true.  Also  due  to  orthonormality  of  the 
functions  $/(*),  eq.  (27),  one  has  J^°  f(x,x)dx  =  n,  and 


/oo  /»oo 

/(*,  y)/(y,  *)dy  =  £  £  &  W  W*)  /    &  W-bOdy  (33) 

'°°  J  —oo 


t        m 

^z)=rt*'z)-  (34) 

Thus  we  have  expressed  /^(xi,  •  •  •  ,•*«)  as  an  n  x  n  determinant  [f(xj,Xk)],  where/(;c,y) 
satisfies  the  conditions  of  Theorem  3.  Using  Theorem  3  several  times  one  gets  the  m- 
point  correlation  function  in  case  (3  =  2.  For  the  undetermined  constant  in  eq.  (31)  it  is 
sufficient  to  integrate  over  all  the  n  variables  and  check  the  normalization  of 
i  •  •  •  ?*«)•  Thus  in  case  0  =  2, 


Rm(xi,.  ..,*„,)=  det[/(^,^)](.;.=1)  )m.  (35) 

The  m-point  cluster  function  Tm(x\,  .  .  .  ,jcw)  is  then 

Tm  (xi  ,  .  .  .  ,  xm)  =  E  f(*i  »  xh  )/(^2  >  *fe  )  '  '  '  /Um  »  ^1  )  .  (36) 

where  the  sum  is  taken  over  all  (m  —  1)!  permutations  of  (2,  3,  .  .  .  ,  m). 

3.3  Skew-orthogonal  polynomials 

\ 

Instead  of  the  symmetric  scalar  product  one  may  consider  an  anti-symmetric  scalar 
product 

«dx  (37) 


or 


with  w(jc,y)  =  — w(y,  jc).  Such  a  scalar  product  can  be  used  to  construct  skew-orthogonal 
polynomials.  We  will  use  them  to  write  Pp(x\,...)Xn)  for  (3  —  4  and  /3=lasannxn 
quaternion  determinant.  Let  us  first  take  the  simpler  case  (3  =  4,  and  choose  polynomials 
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2j(x)  satisfying  the  skew-orthogonality  relations 

o 

\(~)T( TC\fl  (Y\  fV    (r\/~)~,         f-v\\i*~X^  Air  f\  f1Cl\ 

i*£41  V    /*^2«+l  \    /         "^2j V.    / !o2K+l  v^vj'^        *A*  —  ~/fc)  v^"J 

« 

/oo 
•oo 
/oo 
[Q2/+1  (*)£&+!  W  ~  Qiy+i  Wi2ak+i (x)]eT*fa  =  0,  (41) 

•oo 

i.e. 

/oo 
0f  W^We^dx  =  5yt,  (42) 

•00 

where  •£,•(*)  is  the  quaternion  having  the  2  x  2  matrix  representation 

(43) 


and  0?(x)  is  its  dual.  Now  choose  the  quaternion  /4(*,y)  as 

n-l 

/4(*i3')  =  X)^Wf(y).  (44) 

j=0 

Theorem  4.  We  c/azm  f/iar 

det[/4(^-,^)]  =  constant    JJ    (jc,-  -^)4,  (45) 

/oo 
/^.^e-^dx^n 


/o 


From  the  definition  of  f4(x,  |/),  cq.  (44),  and  the  orthonormality  relation  eq  (42) 
one  has  n' 


(48) 

£        wi 


/oo 
/4(*,y)/4(y,z)e->1dy  =  2  ^  ^(^(Z)  r  tf 

£        wi  «/—  oo 

=  E^Wtf(z)=/4(*,z),  (49) 

€ 

which  is  (47).  Also  since  ^(x)  and  ^(jc)  commute  (!), 

f°°  __  ,          "^    yoo 

/      Mx,x)txidx  =  Y^         ^W^Wc-^dx  (50) 

00  7=0    •'-oo  v      y 
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n—i       />oo 


(51) 

j-Q 

To  prove  (45),  observe  that  det  [f^(xj,Xk}}  is  equal  to  the  Pfaffian  of  the  2n  x  2n  anti- 
symmetric matrix  C(e\I}C([fa(xj,Xk}}),  which  in  turn  is  the  square  root  of 
detC([/4(*,,*iO]),i.e. 

det[/4  (*;,*,)]  =  detC([^_i(*t)])-  (52) 


But  from  (43)  and  (24),  this  is  just  (45).  Thus  we  have  expressed  P*(x\,  .  .  .  ,jcn)  as  an 
n  x  n  quaternion  determinant 

P4(*i  ,  .  .  .  ,  xn)  =  constant  det[/4(jc;,  xk}]  -e~(^+'"+^  ,  (53) 

and  the  function  /4  (*,)>)  satisfies  the  requirements  of  Theorem  3.  Hence  the  m-point 
correlation  function  can  be  written  as  an  m  x  m  quaternion  determinant 


Rm(xi  ,...,*„)=  constant  Q~+-+     -det^fo,  xt)]M=iv..,«>  (54) 

or  as  a  Pfaffian  of  a  2m  x  2m  real  anti-symmetric  matrix.  The  constant  in  the  last 
equation  can  be  fixed  by  integrating  over  all  the  variables,  the  result  should  be  1.  Next  we 
take  the  case  ft  =  1  .  The  treatment  is  simpler  when  n  is  even,  n  =  2r.  Let 

e(x  -  y)Rj(y}^dy:  (55) 


where  Rj(x)  is  a  polynomial  of  degree  j,  and  e(x)  =  (l/2)sign(jc),  i.e.  it  is  1/2  for  x  >  0, 
-1/2  for  x  <  0  and  0  f  or  x  =  0.  Let  the  quaternion  ipj(x)  have  the  2  x  2  matrix 
representation 


(56} 

(56) 

and  we  choose  the  polynomials  Rj(x)  such  that 

/oo 
Vf  (*Wk(x)te  =  6Jk.  .  (57) 

•oo 

The  quaternion  X)J=o  <0XJC)Vf>C)')  having  the  2  x  2  matrix  representation 

G(xv}-\S^^     7^>3>) 
G^y)-[D(x,y)     S(y,x} 

with 

r-l 


7=0 


gy(y)gy+i  (x)]  ,  (59) 

7=0 
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/oo  />oo 

G(x,y)G(y,z)dy  =  G(x,z),       /      G(jc,jc)d*  =  n,  (60) 

-co  J —oo 

but  its  determinant  is  zero,  since  the  matrix  [G(xj,Xk)}  is  the  product  of  two  matrices 
C(il>e(xj}}  and  C(i$(xk)}  of  orders  2n  x  n  and  nx2n  respectively  (^  taking  values  0,  1, 
. . . ,  r  —  1,  while  j  and  k  take  values  1 , 2, . . . ,  n,  and  n  =  2r.)  This  also  shows  that  the  rank 
of  the  matrix  [G(jCj,jcjt)]  is  n  and  its  last  n  columns  [I(xjtXk],  S(xk,Xj)]T  are  linear 
combinations  of  its  first  n  columns  [S(xj,Xk),  D^A*)]7.  Therefore  if  we  write 


(61) 
then  the  quaternion  f\  (jc,  y]  is  the  dual  of  fi  (3;,  x}  and  with  some  algebra  one  verifies  that 

;, z)dy  =  /! (x, z)  +  A/! (*, z)  -fi (x,  z)A,  (62) 

A  —  \e\.  For  this  verification  one  can  conveniently  use  the  2x2  matrix 
representations  of  various  quaternions  and  the  fact  that  gj(x)  =  Rj(x)e~** .  To  see  that 
PI (*i,..., *„)  is  proportional  to  det[/i(jc/,jCfc)],  one  proceeds  as  follows.  The  first  n 
columns  of  C([fi(xj,Xk)])  are  the  same  as  those  of  [G(jt/,jfy)],  while  the  last  n  columns  of 
C([/(je/,jCjk)])  differ  from  those  of  [G(xj,Xk)]  by  [e(jc/  -xk),  0]T.  Hence 

=  det 


—  uet 

=  det 
Now 


i.,^)]^  .....  J2 

-  {det^.!  (^)]A;-=1)...,n}2e-2^+-^)  (63) 

and 

det[e(^-^)]  =  l-  (64) 

Hence 

detC([/i  (*,-,**)])  =  constant  (Pi  (^,  .  .  .  ,jcn))2,  (65) 

or 

PI  (*i  ,  .  .  .  ,  j:n)  =  constant  det  [/i  (*,-,**)]  .  (66) 
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quaternion  ueienrnnani  or  me  square  root  01  me  aeiermmam  01  an  ordinary  z.m  x  ^m 
matrix.  The  m-point  cluster  function  is  given  by  (36)  where  the  function  f(x,  y)  is  now 
replaced  by  either  of  the  quaternion  /4(x,j)  or/i(jc,  v),  the  result  being  a  scalar.  The  m~ 
point  cluster  function  Tm(x\,  .  .  .  ,xm]  is  then 

Tm  (xi  ,  .  .  .  ,  jcm)  =  ^2  /(*i  '  Xi2  )  /(*fe  .  xk  )  '  '  '  f(*im  .  *i  )  i  (67) 

where  the  sum  is  taken  over  all  (m  —  1)!  permutations  of  (2,  3,  .  .  .  ,m). 

3.4  1-  and  2-point  correlation  functions 

The  one  point  function  RI  (jc)  gives  the  density  of  eigenvalues  at  jr.  For  (3  =  2,  it  is 

*i  (*)  =  £#(*)  (68) 

7=0 

and  in  the  limit  of  large  n  it  is  the  "semi-circle", 

JL 


(69) 

For  jfl  =  1  or  /?  =  4,  /?i  (x)  differs  from  (68)  and  in  the  limit  of  large  n  goes  to  the  "semi- 
circle", eq.  (69).  Thus  the  average  distance  between  the  eigenvalues  near  the  origin  is 
7r/\/2n.  The  two-point  correlation  functions  are  different  for  the  three  cases,  even  in  the 
limit  of  large  n.  When  n  —  »  oo,  Xj  —  >  0  and  Xj  \/2n/ir  =  yj  are  constants,  one  has 

*2(*)  =  l-*2(r),     0  =  2,  (70) 


=  1  -  52(2r)  +  f^5(2r))  •  f  5(2z)dz,     /3  =  4,  (72) 

\dr         /  Jo 

where  r  =  |  y\  —  yi\  and  s(r)  =  sin(7rr)/(7rr). 

4.  Circular  ensembles 

Three  other  sets  of  ensembles  (i)  symmetric  unitary  random  matrices,  (ii)  unitary  random 
matrices,  and  (iii)  self-dual  unitary  random  matrices,  have  been  extensively  studied.  They 
are  defined  by  the  following  requirements:  (i)  The  probability  density  P(A)  of  a 
symmetric  unitary  matrix  A  should  remain  invariant  under  the  transformation 
A  ->  WTAW,  W  unitary,  (case  (3  =  1),  (ii)  The  probability  density  P(A}  of  a  self-dual 
unitary  matrix  A  should  remain  invariant  under  the  transformation  A  -*  WDAW,  W 
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unitary,  (case  /3  =  4),  (iii)  The  probability  density  P(A)  of  a  unitary  matrix  A  should 
remain  invariant  under  the  transformation  A  —  >  UAW,  U  and  W  unitary,  (case  /3  =  2). 
Actually  a  small  variation  in  A  is  characterized  in  the  three  cases  /?  =  1,  (3  —  4,  and  /3  =  2 
respectively  by  (i)  dA  =  Wr-idH-W,  d#  real  symmetric,  (ii)  dA  =  WD-/dH-W,  dH 
quaternion  real  self-dual,  (iii)  dA  =  U-idH-W,  dH  complex  hermitian. 

4.  1  Probability  density  for  the  eigenvalues 

The  eigenvalues  of  a  unitary  matrix  A  are  of  the  form  e'*,  9  real.  The  matrix  A  can  be 
diagonalized  by  a  (i)  real  orthogonal,  (ii)  quaternion  real  symplectic,  or  (iii)  complex 
unitary,  matrix  respectively  in  the  three  cases.  For  the  change  of  variables  from  A  to  its 
eigenvalues  eMj  one  again  has  to  calculate  a  Jacobian,  which  in  these  cases  give 

P/3(9},...,On)  =  constant      JJ     |e'^  -  e''Y,  (73) 

i<j<*<« 

where  j3  is  1,  4  or  2. 

4.2  Correlations  and  cluster  functions 
To  compute  the  correlation  function 

*  •  •  •  ,  0«)d0m+i  •  •  •  d6n  (74) 


»      ,  .  •  .  ,  » 

(n  -mjl 

one  can  again  use  Theorem  3,  if  one  can  express  Pp(9\  ,  .  .  .  ,  9n}  as  an  n  x  n  (ordinary  or 
quaternion)  determinant  [/(#/,  0*)]  with  /(£,  77)  satisfying  the  conditions  of  that  theorem, 
namely,  (i)/(^,  77)  is  the  complex  conjugate  or  dual  of  /(TJ,  0»  (ii)  f%*  f(&  rf)  f(n,  Qdrj  = 

f(&  0  +  A/(^,  C)  -/(£,  C)  A,  A  a  constant  quaternion,  and  (iii)  /^/(^  £)d£  =  c»  a  real  or 
a  scalar  number.  The  construction  of  such  a  matrix  [/(<?/,  9k}]  is  similar  to  and  somewhat 
simpler  than  in  the  case  of  Gaussian  ensembles.  The  results  are  as  follows.  The  case 
f3  =  2  is  again  the  simplest,  one  needs  only  ordinary  matrices  and  determinants,  no 
quaternions. 

M  =  s»(t  -  ti>  (75) 


the  sum  over  p  in  (75)  is  over  the  values 

p  =  4(n  -  1),  -\(n  ~  3), . . . ,  i(n  -  3),  J(w  -  1).  (77) 

For  cases  ft  —  4  and  /3  =  1  one  needs  quaternion  matrices  and  their  determinants.  The 
results  are 
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with  Sn(0)  given  by  (76)  and 

A,(0)=-(0)  (79) 


where  the  sum  over  q  is  over  ±(n  +  l),  ±(n  +  |), —  When  n  — >  oo,  0/  — *  0,  and 
nOj/fa)  =yj  finite,  then  Sn(0)  -*  s(r),  Dn(0)  -»•  d/drs(r),  7n(0)  -*  /Ors(£)d£,  and  the 
correlation  functions  Rm(y\, . . .  ,ym)  become  identical  to  those  for  the  corresponding 
Gaussian  ensembles.  In  other  words,  the  statistical  properties  of  any  finite  number  m  of 
the  eigenvalues  in  the  limit  of  large  n  are  identical  for  (i)  Gaussian  orthogonal  and 
circular  orthogonal  ensembles,  (ii)  Gaussian  unitary  and  circular  unitary  ensembles,  (iii) 
Gaussian  symplectic  and  circular  symplectic  ensembles. 

4.3  Relation  between  the  circular  orthogonal  and  circular  symplectic  ensembles 

Take  2n-dimensional  matrices  in  the  circular  orthogonal  ensemble.  The  joint  probability 
density  for  its  eigenvalues  is  Ili<j<jfc<2n  \e'Bj  ~  e/<H  Ordering  the  eigenvalues  around  the 
unit  circle  Q\  <  62  <  •  •  •  <  02n,  if  we  integrate  over  alternate  eigenvalues,  say  9\,  03, . . ., 
02n-i,  then  we  get  constant  rii<./<*<7i(el'fl:y  ~  e'6*2*)4,  which  is  proportional  to  the  joint 
probability  density  of  the  eigenvalues  of  an  n  x  n  self-dual  unitary  matrix.  In  other  words, 
the  statistical  properties  of  n  alternate  angles  0/,  where  Ql6j  are  the  eigenvalues  of  a 
symmetric  unitary  matrix  of  order  2n  x  In  taken  from  the  orthogonal  circular  ensemble, 
are  identical  to  those  of  the  n  angles  </>/,  where  e"^  are  the  eigenvalues  of  an  n  x  n 
quaternion  self-dual  unitary  matrix  taken  from  the  circular  symplectic  ensemble. 
(Theorem  10.6. 1  of  RM.)  A  similar  relation  holds  between  a  random  superposition  of  two 
orthogonal  ensembles  and  the  unitary  ensemble.  More  about  this  later. 

5.  Spacing  functions  Ep(r,t) 

Among  other  quantities  in  the  random  matrix  theory  the  spacing  functions  have  some 
importance.  The  spacing  function  Ep(r,t]  is  the  probability  that  a  randomly  chosen 
interval  of  length  2t  contains  exactly  r  eigenvalues  inside  it.  We  will  be  interested  in  the 
limit  when  n  is  large.  The  case  /?  =  2  is  again  mathematically  the  simplest.  To  find 
#2(0,  t),  say,  one  has  to  integrate  P2(*i)  •  •  •  ,*n)  over  all  the  variables  x\,  . . .,  xn  outside 
the  interval  of  length  20.  For  simplicity  we  will  chose  the  interval  as  (—0, 0): 

P2(xl,...,xn}dxl- ••<&„.  (81) 

out 


Madan  Lai  Mehta 
But 


1 

=  det  A/,  (82) 


with 


In  order  to  compute  £2(0,  f).  write  the  first  row  of  the  matrix  M  as  the  sum  of  n  terms  and 
the  determinant  as  a  sum  of  n  determinants.  Since  all  the  other  rows  of  M  are  symmetric 
in  Xj,  all  these  n  determinants  will  give  the  same  answer  on  integration.  So  we  can  replace 
the  first  row  of  M  by  00(*i)<M*i)  ^d  multiply  the  result  by  n.  Now  subtracting  suitable 
multiples  of  the  first  row  from  other  rows  we  can  eliminate  *i  from  all  of  them.  The  n  -  1 
rows  of  M  now  contain  X2,  ...,xn  symmetrically,  therefore  we  can  replace  the  second  row 
of  M  by  <j>i(x2)(f>k(x2\  multiply  the  result  by  n  -  1  and  eliminate  x2  from  the  remaining 
rows.  And  so  on.  Thus  we  get 

E2(0,  0)  =   I   det  [tfjL.,  fo)&-i  (*,)]M=1,...,,Id*1  •  -  -  6xn.  (84) 

./out 

Since  each  variable  occurs  in  only  one  row,  we  can  integrate  over  them  separately, 
£2(0,  0}  =   I   det  [c^_i  (xj)<t>k-i  (*/)]d*i  •  •  •  dxn 

./out 

r         re  1 

=  det  \8jk-  /    <l>j(x)<f>k(x)fa\ 

I  J-B  J;,Jt=0,...,n-l 

=  11(1-^),  (85) 

J=o 

where  A/  are  the  eigenvalues  of  the  symmetric  matrix 

W0*Wdx  '  (86) 


i.e.  they  are  solutions  of  the  algebraic  equation, 

det[%-  0*1=0,  (87) 

or  of  the  integral  equation 


(88) 

-B 

with  the  kernel 

^(*,y)  =  £^W^(y)-  (89) 

;=o 
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(Verify  this!  Hint:  solutions  of  (88)  are  of  the  form  Y%=o  ci<l>i(x)>  w^*1  me  ct  not  a^  zero.) 
When  n  — »  oo,  9  — >  0,  while  ^\/2n/7r  =  t  is  finite, 

sin  TT(X  —  y) 


The  average  distance  between  the  eigenvalues  being  (n/\/2n),  we  measure  distances  in 
this  unit.  The  integral  (88)  in  the  limit  n  —  »  oo  is 


(91) 
-t 

with 

*^W-  (92) 

Much  is  known  about  this  kernel  in  the  literature.  It  is  the  square  of  another  kernel 
(2f)~1/2e'7Dy/'  over  (-t,t);  it  commutes  with  the  differential  operator  (x2  -  l)d2/ 
dx2  +  2x  d/dx  +  TT2*2  and  hence  has  common  eigenfunctions  with  it.  The  solutions  of 
(91)  are  known  as  spheroidal  functions,  have  been  extensively  studied  and  tabulated.  The 
eigenvalues  A  of  (91)  all  lie  between  0  and  1,  can  be  ordered  as  1  >  AQ  >  AI  >  A2  >  .  .  .  , 
and  when  so  ordered  the  eigenfunctions  are  alternately  even  and  odd.  The  limiting  form 
of  (85), 


(93) 
/=o 

is  a  fast  converging  infinite  product.  In  (93)  above,  we  have  written  t  for  convenience, 
though  the  length  of  the  interval  is  2t.  The  A/  =  A/(f)  depend  of  course  on  t.  To  find 
E<i(r,  6),  one  has  to  evaluate  the  integral 

xl)    JJ  (1  -  a(Xj})dxl  •  •  -dxn, 

j=r+l 

(94) 

where  a(x)  =  1  if  |jc|  <  9  and  a(x)  =  0  if  \x\  >  0.  Again  PI(X\>.  .  .  ,xn),  (82),  can  be 
written  as  a  sum  of  determinants.  Whenever  any  variable  Xj  occurs  in  two  or  more  rows, 
these  rows  are  proportional  and  the  determinant  is  zero.  Only  those  determinants  survive 
in  which  each  variable  occurs  in  one  row  only,  so  we  can  replace  PI(X\  ,  .  .  .  ,  xn)  in  (94)  by 

(95) 

the  sum  being  over  all  permutations  (i\  ,  .  .  .  ,  -£n)  of  (1  ,  2,  .  .  .  ,  n).  As  each  variable  occurs 
in  one  and  only  one  row  in  any  of  these  determinants,  we  can  integrate  over  x\,  .  .  .,  xr 
from  —  0  to  6  and  over  (*r+i,  •  •  •  ,  xn)  outside  the  interval  (—(9,  6}.  Then  we  expand  each 
determinant  in  the  Laplace  manner  according  to  the  r  rows  containing  initially  the 
variables  x\,  .  .  .  ,  xr.  This  gives 

').  (96) 
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The  indices  f  =  (i\,  .  .  .  ,ir)  are  chosen  from  (0,  ...,«  —  1)  as  also  the  indices 
j  =  (ji,  .  .  .  jV).  The  indices  (ii,  .  .  .  ,  ir)  are  not  ordered,  while  the  indices  (jr'i,  .  .  .  Jr) 
are  ordered,  ji  <  ji  <  •  •  •  <  jr.  The  sum  in  the  above  equation  is  taken  over  all  choices  of 
indices  satisfying  the  above  conditions.  The  cofactor  [/  —  G]  (/;/')  of  order 
(n  —  r]  x  (n  —  r)  is  obtained  from  [/  —  G]  by  omiting  the  rows  (i)  and  columns  (j),  its 
determinant  is  apart  from  a  sign 


det  [/  -  G](/,  0  =  ±det  [/  -  G]-det  [(/  -  G),~j].  (97) 

Ordering  the  indices  i\  ,  .  .  .,  ir,  0  <  ii  <  •  •  •  <  ir  <  n  —  1,  and  taking  into  account  (97)  we 
can  write  (96)  as 


E2(r,  6}  =  det  [7  -  G]        dc*G(i;  7)  det  [/  -  Gp1  (i;  j).  (98) 

ft;) 

Diagonalizing  G  as  in  §4.1  above,  in  the  limit  n  —  »•  oo,  0V2n/7r  =  t  constant,  one  has 

\  \ 

-"-'  (99) 


the  sum  being  taken  over  all  integers  71,...,  jr,  with  0  <j\  <J2  <  •  •  •  <jr-  If  we 
introduce  an  extra  variable  z  and  write 


zXi(t}),  (100) 

i=0 


then  (93)  and  (99)  can  be  written  as 


5.1  Spacing  function  for  the  circular  unitary  ensemble 

To  compute  the  spacing  functions  for  circular  ensembles  is  as  easy  or  as  difficult  as  for 
the  Gaussian  ensembles.  From 


-  e),  (102) 

one  has 

Y[     \&ieJ  -  e**  |  =  constant  e-i( 

l<7<*<n 

=  constant  det[ej>^],  (103) 

with 

P=4(n-l),  4(»-3),...,J(n-3),  l(n-l),  (104) 
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.,...,  0B)  =  constant  detfe'^]  det[e-^>] 
=  constant  det 


(105) 


where  p  and  #  take  values  as  in  (104).  The  probability  that  the  interval  (—a,  a)  does  not 
contain  any  eigenvalue  of  a  random  unitary  matrix  taken  from  the  circular  unitary 
ensemble  is  therefore 


*  ° 


det 


[**• 
£"2(0,  a)  =  constant    / 

Ja 

11         rltr-a  I 

—    /         e/(p-«)(9dd 
2?T    /ft 
«/  Q  J 

r         i     ra 

=  det  OM  -  T-    /     cos(p  - 

L       2?r  y_Q 


A* 

=  det 


pq 


(106) 


the  constant  is  fixed  from  the  normalization  £"2(0, 0)  =  1.  Also 

-  cos(pQ)  cos(q0)][$qr  -  sin(q9)  sin(r^)]  =  [6pr  -  cos(/7  - 


so  that 


with 


=  F+F_, 


(107) 
(108) 


r       i    ra 

=  det  \Spq-—      .  cos(p6] 
L          27r  ^-a 


Note  that 


and 


_  =  det  \6pa-— 
L          2?r 


cos 


(109) 
(110) 


are  the  even  and  odd  parts  of  sin(p  —  q)oc/(ir(p  —  q)}.  When  n  —  »•  oo,  a  —  *  0  and  net/ 
(2?r)  =  ?  is  finite,  the  limits  of  the  determinants  D,  D+  and  D_  are  the  values  of  D(z,  t), 
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i=0 


where  A,  are  the  eigenvalues  of  the  integral  (91).  A2l-  and  A2l-+i  correspond  respectively  to 
its  even  and  odd  solutions,  i.e.  they  are  the  eigenvalues  of  the  integral  equation 


=  f  K±( 
J-t 

where  K+(jc,y)  and  K-(x,y]  are  the  even  and  odd  parts  of  K(x,y], 


sin  K(X  -  y)     sin  TT(X 


5.2  Orthogonal  ensemble,  {3  =  1 

For  j(3  =  1  ,  integration  over  alternate  variables  is  possible  as  follows.  We  order  the 
variables  as  jci  <  X2  <  •  •  •  <  xn,  so  that 

n  fe  -  *i  =  life  -  **)  =  det  t^"1^*-!,.^  '  (i  15) 


PI(JCI,  .  .  .  ,*„)  =  constant  det  [<^-i(^k)]j,fe=i,...,«  >  (116) 


or 


where  A:I  <  ^2  <  •  •  •  <  ^n  and  </>/(x)  are  orthonormal  functions,  (26).  Note  that  (i)  In 
P\(XI,  .  .  .  ,jcn)  we  have  changed  the  weights  from  exp(-  Yitf}  to  exP(~  S^/2)'  ^s 
does  not  matter  much  since  one  can  change  the  scale  of  the  Xj  ;  (ii)  In  the  determinant  in 
(116)  we  can  replace  (fa+i(x)  by  ^.(jc)  if  n  is  even,  n  =  2m.  This  is  because  fy(x), 
<pj-i(x]  and  <f)j+i(x)  have  a  linear  relation  (verify!) 

V20JW  =  V%-i  W  -  N/TTT^+i  (x)  .  (117) 

To  see  this,  we  replace  02m-  1  in  the  last  row  by  a  linear  combination  of  0'2m_2  an(^  02m-3, 
and  eliminate  ^»2m-3  in  this  row  by  adding  a  suitable  multiple  of  a  previous  row.  Then 
replace  the  row  <^2m-3  by  a  linear  combination  of  02m-4  an^  02m-5>  eliminate  <^2m-5  in 
this  row  and  so  on,  till  we  reach  the  row  <f>\  which  we  replace  by  <fi'Q. 

(iii)  When  n  is  odd,  n  =  2m  +  1,  by  a  similar  procedure  we  can  replace  the  rows  <£2y  by 
02/-P  ex°ept  for  the  first  row  which  remains  <£o-  Let  us  come  back  to  our  calculation  of 
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El  (0,  0)  =     [  Pl  (Xl ,  .  .  .  ,  X 
J 

f  det  [**->]  nM(*>-^X  (118) 

•/•«<^2<"-<X 


=  constant 

where  u(x)  =  1,  if  \x\  >  9  and  u(x]  =  0,  if  jc|  <  0.  With  the  ordering  of  the  variables  one 
has  to  multiply  the  result  in  the  last  line  by  n\  which  is  absorbed  in  the  constant.  As  x\ 
occurs  only  in  one  column,  we  can  integrate  over  it.  This  replaces  the  first  column  by 

/Xl 
u(x\  }xj '  e~*?/2dr.i        r  =  0  1  n— 1  (1191 

"•VAUA  1C  UA1  »          J    VJ,l,...,n  1.  {LL?) 

•OO 

Now  X2  appears  in  two  columns  and  so  we  integrate  over  *3  replacing  the  third  column  by 
Fj(x4)  —  Fj(x2).  But  we  already  have  a  column  Fj(x2),  add  it  to  the  third  column,  which 
becomes  Fjfa}.  In  the  same  way,  integration  over  xs,  x-j,  ...  replaces  the  corresponding 

columns  by  Fjfa),  Fj(x%), If  n  is  even,  n  =  2m,  each  of  the  remaining  variables  X2, 

*4i  •  •  • )*2m  occur  in  two  columns,  [Fj(x2k),xJ2k].  If  n  is  odd,  n  =  2m  -+- 1,  there  is  an  extra 
column  FJ(OO)  on  the  extreme  right.  We  have  still  to  integrate  over  X2,  *4,  . . . ,  x^m  with 
the  restrictions  X2  <  x$  <  •  •  •  <  X2m,  and  each  variable  occurs  in  two  columns.  But 
interchanging  any  two  variables  we  interchange  two  columns  with  two  other  columns  and 
the  determinant  does  not  change.  So  the  integrand  is  symmetric  in  the  remaining  variables 
and  we  can  remove  their  ordering  provided  we  divide  the  result  by  m!,  which  will  be 
absorbed  in  the  constant  outside. 

/OO 
det[Fj(x2k),  u(x2k)xj2ke~^]dx2  dx4  •  •  •  dx2m.      (120) 
•OO 

In  case  n  is  odd,  n  =  2m  +  1,  there  is  an  extra  column  F7-(oo).  For  simplicity  we  will  take 
n  to  be  even,  «  =  2m.  If  we  expand  the  determinant  in  (120)  by  the  first  two  columns 
containing  X2,  then  the  cofactor  by  the  two  columns  containing  X4,  and  so  on,  and 
integrate  over  X2,  JC4, . . . ,  we  observe  that  (1)  the  integral  is  a  sum  of  terms,  each  term 
being  the  product  of  m  factors  of  the  form 

/OO 
r  E1.  f  Y\  Y^   7T    ( Y\  Y^l  u( TC^f* —      '      H  Y  /7i  •  (  1  ^  1  ^ 

I     J  \     /  *\     /J\/  */'  \  / 

•OO 

(2)  the  indices  of  the  various  factors  a^  in  any  term  are  all  distinct,  and  they  are  in  all 
(0, 1, . . . ,  2m  —  1).  (3)  We  can  restrict 7'  <  k  in  each  ajk,  since  if  7  >  k,  then  we  replace  a/* 
by  -akj .  (4)  The  coefficient  of  the  term  ajlj2aj3j4  •  •  •  aj^j^  is  +1  or  —1  according  as  the 
permutation  from  (0, 1, . . . ,  2m  —  1)  to  (71, 72,  •  •  •  ,72m)  is  even  or  odd.  Therefore  the 
result  is  a  Pfaffian 


Ei(0,0)  =  constant 

(j) 

=  constant  Pf  [ajk}j,k=o,i,...,2m-i 
=  constant  (tet[ajk]jt fc=o,i,...l2m-i)1/2,  (122) 
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with  ajk  given  by  (121).  One  has  from  the  symmetry 

02;,2fc  =  «2j+l,2Jt+l  =  0, 

so  half  the  matrix  elements  a^  are  zero,  and  the  matrix  in  (122)  has  the  form 

0        flQl        0        <3Q3       ... 

«io      0      «12      0      ... 

0        di\         0        #22 
A  rv 

#30        U        #32        « 

Collecting  the  non-zero  elements  in  one  corner  one  has 


Hence 


(123) 


(124) 


(125) 


(126) 


Ei(0, 0)  =  constant  det[fl2/,  2*4-1  ];,jfc=o,...,m-i  • 

Without  changing  anything  in  the  above  argument  we  could  have  replaced  Jt^e"^/2  in  the 
Vandermonde  determinant  (118)  by  02*(*/)  and  j^fc+1e~^/'2  by  02Jk(jc,-).  With  this  choice 
(123)  is  still  valid  and 


, 

=  fyk  ~    I 
J- 

(Verify  it!)  Therefore 

^(0,0)  -  det  L  - 


(127) 


j,fc=0,l,...,m-l 


where  A2i  are  the  eigenvalues  of  the  integral  equation 


(128) 


(129) 


with  Kn+(x>y]  the  even  part  of  KR(x,y), 

m-l 


j=0 


-x,y)},  (130) 

with  Arn(jr,3;)  given  by  (89).  As  n  ->  oo,  while  x\/2n/7r  and  y\/2n/ir  are  kept  finite,  the 
integral  (129)  goes  over  to 


(131) 


Random  matrices  and  matrix  models 
with 

*+(*.?)  =2  [    "foly/    ^"(^-.^l'  (132) 

and  we  have  the  result 


)  =  *+(!,*)  (133) 

with  F+(z,t)  given  by  (112). 

The  calculations  of  E\  (2r,  0)  and  of  £1  (2r  +1,0}  are  long  and  I  will  not  write  them 
here  again,  referring  you  to  either  the  relevant  pages  of  RM  or  to  the  article  by  Mehta  and 
des  Cloizeaux  (see  the  list  of  references).  The  result  is 


(134) 

^(ir-  1,0  =£1(0,0  E  ^^-'-j^E^w/'^w^, 

(135) 
where  ^(jt)  are  the  solutions  of  the  integral  equation,  (91)  normalized  as 

4x  =  8j,k.  (136) 


-t 


From  (134)  and  (135)  one  sees  that 


(Z,OU  (137) 

and 


-  E 


which  will  be  shown  later  to  be  equal  to 


5.3  Circular  orthogonal  ensemble 

For  the  circular  ensemble,  case  (3  =  1,  one  can  use  exactly  the  same  method  of  integrating 
over  alternate  variables.  Let  us  compute  the  average 

P»v»«.xt>is.  _    r     Plruc       Vnl     Aft     IMn     1       Toniiaw   1  0OT   fDoi-t  n 
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v(0fe),  (140) 

alt  alt 


where  u(9]  and  v(0)  are  arbitrary  functions  defined  over  (-TT,  ?r)  and  for  simplicity 
we  have  taken  n  even,  n  =  2m.  The  product  fj^  is  taken  over  a  set  of  alternate  points  0/ 
as  they  lie  on  the  unit  circle,  and  Yl'^  is  the  product  over  the  remaining  alternate  points. 
From  (103),  taking  the  Oj  to  be  ordered  — TT  <  6\  <  #2  <  •  •  •  <  02m  <  TI%  one  has 

I   det[e'^]  f[K(02;-i)v(02/)d0i  •  •  -d02m,  (141) 


//  =  constant 

As  in  §  5.2  ,  we  integrate  over  one  set  of  alternate  variables,  say  0i,  03,  . . . ,  02m_ i,  remove 
the  ordering  of  the  remaining  variables  and  then  integrate  over  them.  For  this  purpose  we 
define 

r  r 

i(Q}v((j)}(e,i(pe+q®  -e'(p*+^))d0d(/),  (142) 

£<7T 

then  as  in  §5.2,  the  result  is 

H2  =  constant  det  [aM],   p,q  =  -m  +  i,  -m  +  f, . . .  ,m  -|,m  -\.      (143) 

Integration  in  (142)  can  be  done  explicitly  for  simple  functions  u,  v,  and  factors  like  l/p 
can  be  removed  outside.  Also  we  can  invert  the  order  of  the  columns.  So  setting 

fep,,  =  |^flp-g)  (144) 

one  has 

H2  =  constant  det  [bpiq[.  (145) 

If  «(0)v(<£)  =  w(-0)v(-0),  then  b-p (_fl  =  ^Pi9,  and  there  are  further  reductions,  now  /f 
itself  can  be  written  as  a  determinant, 

H  =  constant  det  [Fp> g]  ( 1 46) 

with 

A    n   «    —    On  a      \      O  —  jjfj 


=  2^ 

•/       ^—  7T 


(147) 


If  M(^)  =  v(ff)  =  1,   then  Fpifl  =  6ptq,   and  the  constant  in  (146)  is   1   for  correct 
normalization.  If  u(0)  =  v(9)  =  1  for  -TT  +  a  <  B  <  TT  -  a,  and  M(0)  =  v(0)  =  0  for 


"i     r^ 
y  q  =  Sp  q /    cos(p@)  cos(qQ)dO 

T  J-a 

_  sin(p  —  q)a      sin(p  +  q)a 


(148) 


and  det[Fp)0]  in  the  limit  of  large  m  with  ma/ir  =  t  becomes  F+(l,f).  Next  choose 
u(9}  =  v(8)  =  1  for  -TT  +  a  <  Q  <  TT  -  a,  u(9]  =  0,  v(0)  =  2,  for  TT  -  a  <  0  <  TT  +  a. 
This  gives 


and  H  —  det  [Fp>q]  is  the  probability  for  the  interval  (TT  —  a,  TT  +  a)  to  contain  either  0  or 
1  eigenvalue,  i.e.  H  =  Ei(0,  a)  +  E\  (1,  a).  (Why  v(0)  =  2  and  not  1  in  (TT  -  a,  TT  +  a)?) 
In  the  limit  of  large  m  and  ma/Tr  =  t,  this  determinant  goes  over  to  F_(l,  ?). 

6.  Random  matrices  in  physics 

In  physics  three  discrete  symmetry  operations  parity,  charge  conjugation  and  time 
reversal  have  gained  popularity.  Here  we  will  be  concerned  with  only  the  last,  the  time 
reversal  operation  T.  From  physical  considerations  time  reversal  has  to  be  an  anti-unitary 
operator.  In  the  Schfbdinger  equation  —idifc/dt  =  Hif>,  H  is  often  real,  so  the  validity  of 
the  same  equation  with  t  — >  —  t  means  changing  the  sign  of  i  or  taking  the  complex 
conjugate.  As  any  antiunitary  operator,  T  can  be  written  as  T  —  KC,  K  a  unitary  operator 
and  C  the  complex  conjugation  operator.  When  the  representation  of  the  states  is  changed 
by  a  unitary  operator,  ifj  — >•  Ui/j,  T  transforms  to 


r->  UTU~l  =  LTt/t,  (150) 

or  K  transforms  to 

K->UKUT.  (151) 

Operating  twice  with  T  should  leave  the  physical  system  unchanged,  i.e. 

r2  =  a-l,  |a|  =  l,  (152) 

where  1  is  the  unit  operator.  Or 

T2  =  KCKC  =  KK*CC  =  KK*  =  al .  (153) 

But  K  is  unitary, 

Ktf  =  l.  (154) 

So 

K  =  aKT  =  a(aKTf  =  o?K.  (155) 
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Therefore 

o?  =  l    or    a  =  ±l,  (156) 

so  the  unitary  K  is  either  symmetric  or  anti-symmetric,  i.e. 

KK*  =  1,  (157) 

or 

KK*  =  -1.  (158) 

These  alternatives  correspond  respectively  to  integer  or  half  odd  integer  spin  (see 
Wigner's  text  on  Group  Theory).  If  K  is  symmetric,  then  it  can  always  be  written  as 

K  =  UUT  (159) 

and  a  transformation  T/>  — >  t/"1^  performed  on  the  states  brings  K  to  unity  (eq.  (151)). 

K=l.  (160) 

After  one  such  transformation  is  found,  further  transformations  ijj  —»  J?V  are  allowed  with 
RRT  —  RR\  =  1,  to  keep  (160)  valid.  If  the  hamiltonian  is  invariant  under  time  reversal 
and  has  integer  spin,  a  representation  of  the  states  can  be  chosen  in  which  it  is  a  real 
symmetric  matrix,  and  further  transformations  of  the  states  by  real  orthogonal  matrices  is 
allowed.  This  corresponds  to  the  Gaussian  orthogonal  ensemble  (GOE).  Also  if  a  system 
is  invariant  under  space  rotations  (total  spin  integer  or  half  odd  integer),  the  hamiltonian 
H  commutes  with  each  of  the  three  components  of  the  total  angular  momentum,  in  the 
standard  representation  of  the  angular  momentum  matrices  K  can  be  chosen  such  that  it 
commutes  with  H,  and  an  H  invariant  under  T  is  real  symmetric.  This  also  corresponds  to 
the  Gaussian  orthogonal  ensemble  (GOE).  If  K  is  anti-symmetric,  then  it  can  be  written 

as  K  =  UZUT,  where  the  only  non-zero  elements  of  Z  are  the  2  x  2  blocks 

L"1    UJ 

along  the  diagonal.  A  representation  of  the  states  can  be  chosen  such  that  K  reduces  to  Z. 
Being  unitary  K  is  non-singular  and  the  number  of  rows  and  columns  in  K  is  even. 
Representing  every  thing  in  terms  of  quaternions  a  detailed  consideration  shows  that  H 
must  be  a  quaternion  real  self-dual  matrix  and  further  transformations  of  the  states  by 
only  symplectic  matrices  is  allowed.  This  then  corresponds  to  the  Gaussian  symplectic 
ensemble  (GSE).  A  system  whose  hamiltonian  is  not  invariant  under  the  time  reversal  T 
corresponds  to  the  Gaussian  unitary  ensemble  (GUE).  Till  now  we  talked  about  the 
mathematical  methods  used  to  calculate  quantities  like  the  correlation  functions 
Rm(x\, . . . ,  jcm)  or  the  spacing  functions  E(r,  f).  But  when  experimental  data  is  compared 
with  the  theory,  one  uses  only  the  2-point  function  R2(xi ,  JC2),  rarely  3  or  4-point  functions 
and  instead  of  E(r,  t)  one  uses  d2E(0,  t}/&?  =  p(s],  s  =  2t,  giving  the  probability  density 
of  the  spacings  between  nearest  neighbors.  To  make  a  quantitative  judgement  of  how  well 
two  curves  fit  together  is  difficult  since  one  does  not  know  the  "thickness"  of  the 
theoretical  curve.  So  one  resorts  to  various  quantities  called  "statistics"  which  can  be 
computed  from  the  experimental  data  alone  and  whose  average  and  dispersion  can  be 
estimated  from  the  theory.  For  example,  the  number  variance  gives  the  average  value  and 
dispersion  of  the  number  of  eigenvalues  in  an  interval  of  given  length,  its  skewness  and 
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excess,  the  so  called  A  statistic  gives  the  least  square  best  fit  of  a  straight  line  to  the 
staircase  representation  of  the  increase  in  the  number  of  eigenvalues  as  a  function  of 
energy,  covariance  of  two  consecutive  spacings,  the  /-statistic  to  estimate  how  well  a 
given  eigenvalue  fits  in  the  sequence  of  eigenvalues  belonging  to  a  given  ensemble,  etc. 
The  experimental  data  examined  so  far  consisted  of  (1)  slow  neutron  resonance  energies 
in  various  nuclei  such  as  rare  earth  and  fissionable  nuclei,  (2)  electromagnetic  resonance 
energies  in  nearly  two  dimensional  microwave  cavities,  (3)  possible  quantum  energies  of 
a  free  particle  confined  to  various  shaped  two  dimensional  surfaces  (quantum  billiards). 
This  extensive  study  is  mostly  numerical,  (4)  ultrasonic  resonance  frequencies  of  metallic 
beams,  (5)  conductance  of  heterogeneous  quenched  metallic  wire  (mesoscopic  systems), 
(6)  zeros  of  the  zeta  function  on  the  critical  line,  (7)  distribution  of  trees  in  a  forest,  of 
district  head  quarters  in  various  states  and  countries,  of  centers  of  bubbles  in  a  froth,  — 
To  appreciate  these  one  should  have  a  look  at  the  various  data  fitting  curves  in  the 
literature. 

7.  Other  ensembles 

7.1  Ensemble  of  matrices  without  any  symmetry 

People  have  considered  ensembles  of  real,  complex  or  quaternion  matrices  which  are  not 
restricted  to  be  hermitian  or  unitary.  The  probability  density  of  each  matrix  element  is  still 
taken  to  be  Gaussian  for  simplicity.  The  eigenvalues  of  such  matrices  are  complex  in 
general  and  the  question  is  again  what  can  you  say  about  a  few  of  its  eigenvalues.  As 
expected,  the  case  of  complex  matrices  is  mathematically  the  simplest.  One  takes 

P(H)  =exp  -^\HJk\2    =exp[-tr(#t#)])  (161) 

iyr(0)  jri-C1)  (^f^D\ 

Hjk  Atijk  )  '  •  VlbA> 

where  H^  =  Hj^  +  iH^  ,  and  HJ^  and  H^'  are  real.  To  say  anything  about  the 
eigenvalues  one  has  first  to  change  variables  from  the  matrix  elements  to  the  eigenvalues 
Zj  and  some  extra  variables  PJ.  Now  to(H^H]  contains  not  only  the  eigenvalues  Zj  but  the 
extra  variables  as  well,  and  one  has  to  choose  them  carefully  so  as  to  facilitate  later 
integrations.  I  will  not  go  in  the  details  of  these  calculations.  The  final  result  is 

n 

P(ZI,  . . . ,  zn)  =  exp  —  ^  \Zj\2       JJ      \Zj  —  Zk\2-  (163) 

This  joint  probability  density  is  much  easier  to  work  with  than  the  Gaussian  unitary 
ensemble  (GUE)  case.  The  reason  is  the  orthogonality  of  the  powers  of  z  =  x  -f  zy, 

_U2     ;  Jtt  ,      ,  .  i  c  /-i  £ A\ 

exp  ''  zz   dxdy  =  irjlQjk,  U°4j 


— oo 


and  one  does  not  have  to  construct  orthogonal  polynomials.  The  computation  of 
correlation  functions  Rm(zi, .  •  •  ,Zm)  follows  the  same  lines  as  for  the  GUE.  The  next 
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difficult  case  is  that  of  quaternion  matrices.  There  also  the  joint  probability  density  for  the 
complex  eigenvalues,  which  now  come  in  complex  conjugate  pairs,  turns  out  to  be 


P(zi,...,zn)=exp 


n  fe-*2  n 

'  :« 

(165) 


and  to  compute  the  correlation  function  Rm(z\t.  •  •  ,Zm)  one  can  use  me  same  metnod 
of  expressing  the  right  hand  side  of  (165)  as  a  quaternion  determinant  det[/(z/,z*)], 
the  function /(z,  £)  satisfying  certain  conditions.  For  real  matrices,  when  the  eigenvalues 
are  ail  real,  the  resulting  joint  probability  density  for  the  eigenvalues  is  the  same  as  for  the 
(GOE).  But  the  eigenvalues  of  a  real  matrix  are  not  necessarily  all  real,  and  the  joint 
probability  density  of  the  eigenvalues  has  not  yet  been  worked  out. 

7.2  An  integral  over  a  unitary  group 

The  following  deceptively  simple-looking  integral  has  proved  to  be  very  useful  in  dealing 
with  two  matrix  models  and  the  intermediate  matrix  ensembles.  The  integral  can  be 
written  as 


7=1 


/  exp[-c  tr  (A  -  B}2}  f(b)dB  =  *i  /  exp 

J  .  J 

(166) 


det[exp(-c(o/  - 


r  .  -  det  expf-da/  -  bk)  ) I,  fc_, 

/  exp[-ctr(A-i7BC/-1)2]d[/=fc2      l      V    AfaW^  '     ^ 


where  c,  fci  and  fc2  are  constants,  A  and  B  are  complex  hermitian  n  x  n  matrices  with 
eigenvalues  a-}  and  bj  respectively,  f(b}  is  a  function  depending  only  on  the  eigenvalues  of 
B,  A(fl)  =  Y[j<k(aj  ~  aO>  similarly  for  A(b),  the  integral  in  (167)  is  over  the  group  of 
n  x  n  unitary  matrices,  dB  =  ]T.  dfi//  ]TkdBL'dBJ^  and  dfc  =  d^i  •  •  •  dbn.  No  such 
integral  over  the  orthogonal  or  symplectic  group  is  known. 

7.3  Intermediate  ensembles 

We  saw  that  for  the  time  reversal  invariant  systems  with  integral  spin  or  rotational 
symmetry,  GOE  is  a  good  model,  whereas  when  time  reversal  invariance  is  not  valid  GUE 
is  a  good  model.  What  can  one  say  if  the  time  reversal  invariance  is  slightly  broken?  One 
should  consider  then  an  ensemble  of  real  symmetric  matrices  plus  a  small  non-real 
hennitian  part,  an  ensemble  intermediate  between  GOE  and  GUE  slightly  away  from 
GOE.  Similarly,  one  should  consider  an  ensemble  of  matrices  intermediate  between  GSE 
and  GUE,  slightly  away  from  GSE.  Thanks  to  the  integral  (167)  one  can  study  these  cases 
of  intermediate  ensembles  not  only  "near"  GOE  or  GSE,  but  anywhere  in  between.  I  will 
not  go  into  the  details  of  this  calculation.  The  conclusion  is  that  the  transition  from  GOE 
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8.  Matrix  models 

In  these  models  one  needs  to  evaluate  certain  integrals  of  the  form 

-tr 


Z  =   /  exp 


(168) 


where  V/(jt)  are  even  polynomials  in  x,  the  n  x  n  matrices  M^  vary  over  one  of  the  three 
domains;  (i)  all  real  symmetric  matrices,  (ii)  all  complex  hermitian  matrices,  and  (iii)  all 
quaternion  self-dual  matrices.  These  domains  are  characterized  by  a  parameter  (3  taking 
values  1,  2  and  4  respectively.  The  integral  Z  depends  on  the  size  n  of  the  matrices,  on  the 
parameters  entering  the  polynomials  Vt(x)  and  the  coupling  coefficients  cy-.  Or  one  may 
interest  oneself  in  the  statistical  properties  of  a  few  of  the  eigenvalues  of  these  matrices, 
the  so  called  correlation  functions  and  spacing  functions.  Of  particular  interest  is  the  limit 
when  n  is  large.  Analytical  solutions  are  known  for  one  matrix  in  the  three  domains,  and 
for  two  matrices  when  the  matrices  are  complex  hermitian. 

8.1  One-matrix  case 
Consider  the  partition  function 

,  n}  =      exp[-  tr  V(M)]dM,  (169) 


with  V(x)  an  even  polynomial  of  degree  2v.  The  case  V(x]  =  x2  has  been  extensively 
studied  in  the  random  matrix  theory.  We  take  V(x)  =  x2  +  gx4/n.  The  case  of  a  general 
polynomial  V(x)  is  similar  though  more  cumbersome.  For  (3  —  1,  we  will  take  M  to  be  a 
2n  x  In  real  symmetric  matrix  and  write 

(170) 

For  (3  =  2  and  ft  =  4,  we  will  keep  the  definition  implicit  in  (169).  In  the  limit  of  large  n 
we  write 

£(«te,n)  =  ^'fe)  +  i£<«(g)  +  l£f  («)+....     (171) 

The  question  is  to  compute  Ej  (g)  successively.  Another  set  of  questions  is  related  to  the 
evaluation  of  the  m-point  correlation  function 


=    /    Y[  fJ,(Xi]  "[[  \XJ  ~  Xkfdxm+ldxm+2  -  -  -  ,  (172) 

J 


j<k 

or  to  the  corresponding  spacing  function  when  the  variables  Xj  are  outside  a  given  interval, 
for  example.  As  expected,  the  case  /3  =  2  is  .the  simplest  and  was  worked  out  first.  The 
cases  ft  =  1  and  j3  =  4  were  taken  up  later  as  they  involve  .skew-orthogonal  polynomials 
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y.  urmogonai  ana  SKCW  ormogonai  polynomials 

The  first  step  is  to  take  the  eigenvalues  of  M  (and  some  other  auxiliary  parameters)  as 
new  variables.  This  gives 


exp 


(173) 


where  the  constant  K  is  independent  of  g,  though  it  depends  on  n.  Now  introduce 
orthogonal  polynomials  Cj(x),  if  0  =  2,  and  skew-orthogonal  polynomials  Qj(x)  or  Rj(x) 
of  appropriate  kind  if  (3  -  4  or  /3  =  1.  It  is  convenient  to  take  all  these  polynomials  to  be 
monic  (monic  implies  that  the  coefficient  of  the  highest  power  is  one). 


(Qy,  0.2k)  Q  =  (<22;+i  ,  Q2k+i)Q=  0,     (£> 
(R2j,R2k)R  =  (Ry+itR2k+i)R=  0,       (R2j,R2k+i)R=  rj8jk,  (174) 

with  the  definitions  of  (symmetric  or  anti-symmetric)  scalar  products 


—  oo 

1 


/oo      /-oo 
/      /iW/2(y)6(x-y)e-^W+^))/2ckdy,  (175) 

-oo  J—  oo 

where  e(jc)  =  ±1/2  according  as  j:  is  positive  or  negative.  In  terms  of  the  constants  hj,  qj 
and  TJ  one  has 


n-l 

ZiU,n)=Jf.(2n)!  JJry, 

7=0 

n-l 


j-.  (176) 

One  can  express  the  polynomials  /?/(x),  Cj(je)  or  Qj(x)  as  linear  combinations  of  those 
from  any  other  set.  If  one  expresses  Rj(x]  in  terms  of  Cj(x)  or  Cj(x)  in  terms  of  Qj(x), 
then  such  linear  combinations  contain  v  or  v  +  1  terms  depending  on  the  parity  of  j, 
where  2v  is  the  order  of  the  polynomial  V(jc).  This  is  so  because  Cj(x)  is  orthogonal  to 
any  polynomial  of  degree  less  than  j,  Qy(x}  or  R2j(x)  is  skew-orthogonal  to  any 
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polynomial  of  degree  less  than  2j  +  I  and  Qy+i  (x}  or  R2j+i  (x)  is  skew-orthogonal  to  any 
polynomial  of  degree  less  than  2j. 

9.1  Recurrence  relations  for  hj,  qj  and  TJ 

The  polynomials  Cj(x)  satisfy  a  three  term  recurrence  relation 

xC'(x]  —  C-     (x}  +  o-C-     (x] 

pj  =  hj/hj-l.  (177) 

From  now  on  we  will  take  V(x]  =  x2  +  gx4/n.  To  get  the  recurrence  relations  for  p/, 
consider  the  integral 

/oo 
V'(x)C-(x}C--i(x)e~v^dx.  (178) 

•oo 

This  can  be  evaluated  in  two  ways.  Express  V/(x}Cj,i(x}  as  a  linear  combination  of 
Cj+2(x],  Cj+i(x],  . ..,  C/_4(x)  and  use  orthogonality  to  get 

1  =  2(1  +  (2g/n)(pj+1  +  Pj  +  p7_i)]/V  -  (179) 

And  by  partial  integration  we  get 


/O 
- 


Comparing  the  last  two  equations 

Pj(l  +  (2g/n}(Pj+l  +Pj  +  p;--0]  =  j'/2.  (181) 


Equations  (177)  and  (181)  give  A/,  pj  in  terms  of  the  initial  values  ho  =  f^e^^dx, 
^  =  Jf°  x2e~vWdx,  po  =  03  and  pi  =  h\  //IQ.  In  the  limit  of  large  n  one  can  approximate 
PJ  by  a  continuous  function,  PJ  «  np(j/n),  so  that 


Similar  recurrence  relations  can  be  found  for  qj  and  r,-.  By  partial  integration  over  x  in 
(175),  one  has 

±(V'Cj,Rk}R  =  (Cj^c+^R^.  (183) 

Similarly, 

\(V'Qj,  Ck)Q  =  (Qj-,  Ck)Q  -  (Q'j,  Ck)c  .  (184) 

Writing 


R2j+i  (x}  =  C2j+i  (x)  +  fc/i  Cy-i  [x]  +  bj2Cy-3  (x}  , 
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Madan  Lai  Menta 


Cy(x)  =  (32;  W  +  PjiQy-2(*}  +  PflQy-4  W,  (185) 


one  can  derive  equations  relating  a/,  6/1,  bj2,  hj  and  rjt  from  which  eliminating  a/,  fyi  and 
fc/2  one  obtains  a  recurrence  relation  for  the  r;-  which  contain  fy, 

1=a  (186) 


Similarly  from  (184)  and  (185),  one  can  obtain 

/-  +  i^+i=0.  (187) 


qjqj-iqj-2 

Knowing  hjt  TQ  and  r\  one  finds  r,-,  or  knowing  hj,  qo  and  q\  one  finds  qj.  For  the 
correlation  functions  one  expresses  Y[j<k  \xi  ~  xk\  as  an  n  x  n  determinant  [/(*/,**)! 
(ordinary  in  case  /3  =  2,  quaternion  in  case  /?  =  1  or  /3  =  4)  ,  where  the  function  f(x,  y) 
satisfies  (i)  f(x,y)  =/(y,x),  where  /  is  the  dual  of/,  (ii)  ff(x,y)f(y,z)dy  = 
f(x,z)  +  A/(JC,Z)  -/(^,z)A,  A  a  constant  quaternion,  (iii)  J/(^,A:)ck  =  c;  then  one  uses 
the  theorem  that  for  such  an /(*,)>), 

\i  ,  /  ,     -i  \  j    A.\  ft  M  /1QQN 

^fc)]j  fc=l,...,nCbCn  =  \C  —  n+  l)det[/(^/,^Jt)]j1fe=l,...)n-l-  I188/ 


10.  Two  matrix  models 

The  following  two  questions  can  be  answered  analytically. 
1.  Calculation  of  the  integral 

Z=  I  F(A,B)dAdB,  (189) 

where 

F(A,  B)  =  constant  exp[-tr  (Vi  (A)  +  V2(B]  +  cAB)].  (190) 

Vi  (x)  and  V2(j:)  are  even  polynomials,  A  and  #  are  n  x  n  complex  hermitian  matrices  and 
the  integration  is  from  —  oo  to  oo  over  all  the  real  parameters  entering  A  and  B.  In 
particular,  how  does  the  integral  Z  behave  as  a  function  of  n,  c  and  the  coefficients 
entering  Vi(x)  and  V-zfc),  in  the  limit  of  large  n. 

2.  Calculation  of  the  correlation  functions  and  spacing  functions  of  the  eigenvalues  a-3 
of  A  and  fy  of  B.  To  answer  these  questions  one  needs  the  integral  over  the  group  of  n  x  n 
unitary  matrices  U, 


exp  [tr  (A  -  UBU'l}2]dU  =  constant  ,  (191) 


where  A  (a)  =  Y[j<k(&j  —  a*)»  and  similarly  for  A(fc).  The  integral  in  (191)  again  seems 
deceptively  simple.  A  similar  integral  over  the  real  orthogonal  group  (or  over  the 
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(190)  the  joint  probability  density  for  the  eigenvalues  a/  of  A  and  bj  of  5.  In  view  of 

(191)  this  is 


A(a)A(^)det[e-Cfl^]y^=lj..Mn. 

(192) 

To  evaluate  (189)  one  introduces  bi-orthogonal  monic  polynomials  Pj(x)  and  Qj(x)  for 
non-local  weight  defined  by 

exp  (-Vi(x)  -  V2(y)  -  cxy)Pj(x)Qk(y)dx  dy  =  hjSjk.  (193) 


One  writes  A(a)  ==  del  [P/_i  (**)],  A(fc)  =  det[j5|/-i  (&*)],  then  from  the  orthogonality 
(193)  we  get 


Z  =  K-n\[[hj.  .  (194) 

Recurrence  relation  for  the  hj  can  be  deduced  from  the  recurrence  relations  of  the 
polynomials  Pj(x)  and  Qj(x).  In  the  limit  of  large  n,  the  ratio  PJ  =  A///Z./-I  can  be 
approximated  by  a  function  of  a  continuous  variable  j/n.  And  finally  one  finds  how  Z 
behaves  in  that  limit.  To  get  the  correlation  functions  of  the  eigenvalues  a,j  only  (not 
observing  any  of  the  bj)  is  also  simple  following  the  usual  method  of  writing 
fF(a.j]  bj)dbi  •  •  •  dbn  as  an  n  x  n  determinant  [/(a/,  a*)],  and  the  presence  of  the  matrix  B 
modifies  only  the  weight  in  the  definition  of  f(x,y).  Similarly,  the  correlation  functions 
of  the  eigenvalues  bj  only  are  simple  to  calculate.  To  get  the  mixed  correlation  functions 
of  dj  and  bj  is  more  complicated,  since  it  seems  difficult  to  express  F(a,j]  bj)  as  the 
determinant  of  a  2n  x  2n  matrix  with  required  properties  of  its  matrix  elements.  One 
calculates  instead  the  expected  value  of  Y(j(l  +  w (#/))(!  +v(bj)},  with  u(x)  and  v(x) 
small,  and  differentiate  functionally  with  respect  to  u(x)  and  v(x)  the  required  number  of 
times.  The  final  result  becomes  more  and  more  cumbersome  as  m  increases.  The  mixed 
spacing  functions  are  also  complicated,  though  can  be  evaluated  in  principle. 

11.  Selberg's  integral 

Ruler's  beta  integral 

r1  tt.         ^       r< 
Jo  *  dx  =  Yi 

is,  of  course,  well-known.  There  are  at  least  three  generalizations  of  this,  one  known  since 
the  last  century,  and  two  others  due  to  Selberg.  From  our  point  of  view,  the  following  one 
is  important. 
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Theorem  5.  Let 


7=1 


A(jc)=     JJ    (a.--*,),     n>2, 


0  JO  «/0 


then 


Proof  (Aomoto).  From  the  identity 


we  get  on  integration 

0  =  a(xi  •  •  • 

Similarly,  the  indentity 
dx\ 

gives  on  integration 


-  (ft  - 


•  Xm 


(196) 


(197) 
.     (198) 


(199) 


(200) 


(201) 


(202) 
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j=2 


(203) 


Now  interchanging  x\  and  jcy- 

_        /XjX2  •  •  •  Xn 
\     Xl-  Xj 


0,  if     2  <  7  <  m, 

•••*»!)     if     m<j<n. 


Similarly, 


\     Xi  -Xj 


-  X 


.  r   \      if     7  <  /  <  m 

•A./7J  /  J-J-  ^*    ^^      I     ^*    f It,* 

xm}     if     m  <  j  <  n. 


Substituting  (204)  and  (205)  in  (203)  one  gets,  upon  simplification 

(xi--xm)(a  +  P  +  'Y(2n-m-  1})  =  (x2-  •  -xm)(a  +  ^(n 
or 

,  a  +  7(w  -  m) 


-  m)), 


(205) 
(206) 


=   r 

ll 
which  is  (197).  Setting  m  =  n  in  (197),  one  has 


(207) 

(     ° 


n-l 


(208) 


Thus  one  can  decrease  the  value  of  o;  by  one.  But  changing  jc/  —  •  »•  1  —  Xj,  one  sees  that 
/(a,  0,  7,  n)  =  /(/?,  a,  7,  n),  (209) 

so  one  can  also  decrease  /3  by  one.  If  a  and  (3  are  integers,  we  can  express  /(a,  /?,  7,  n)  as 
7(1,  1,7,  n)  multiplied  by  a  known  constant.  One  finally  has  to  evaluate  7(1,  1,7,  n).  For 


this  observe  that 


7(1,1,7,")=  f 

^0 


(210) 
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Now  change  variables  as 

xn  =  t,     Xj  =  tyj,     for    j  =  1,  .  .  .  ,  n  -  1,  (211) 

so  that 


(212) 
and 


7(1,  l,7,n)  =  n!  /  d^"-1)*-1  f 

JO  J0<yi<y2<-<y«-i< 


1-1 


Thus  we  have  related  7(l,l,7,n)  to  7(1,27+  l,7,n-  1)  or  to  7(l,l,7,n-  1)  due  to 
(208)  and  (209).  For  a,  jS  and  7  positive  integers,  one  can  therefore  express  7(a,  (3,  7,  n) 
as  7(1,1,  7,  1)  multiplied  by  a  known  constant.  Finally  7(1  ,  1  ,  7,  1)  =  1.  Collecting  all  the 
constants  carefully  one  gets  Selberg's  integral  (198)  for  a,  /3  and  7  positive  integers.  For 
other  values  of  a,  (3  and  7  one  uses  analytical  continuation. 

(Exercise:  Using  Aomoto's  argument,  calculate  {*i},  (x^),  {*}}  where 


and 


11.1  5<9me  consequences  of  Selberg's  integral 

One  can  of  course  change  the  limits  of  integration  (0, 1)  to  any  finite  values  (a,  b)  by  a 
linear  change  of  variables,  so  the  weight  x0"1  (1  —  xf~  can  be  changed  to  the  weights 
for  classical  orthogonal  polynomials  on  finite  intervals  such  as  Jacobi,  Gegenbauer, 
Chebychev  or  Legendre.  Also  by  a  linear  change  of  variables  and  taking  limits  one  has 
integrals  of  the  form 


or 


r 

JQ 

f 

J-c 


(214) 


where/  is  polynomial  in  dy/d/,  rational  in  y,  analytic  in  t  and  solutions  of  (216)  do  not 
have  any  movable  singularities  except  possibly  for  poles.  (A  singularity  is  called  movable 
if  its  position  can  be  changed  by  changing  the  initial  conditions.)  He  found  that  by  a 
change  of  the  dependent  or/  and  independent  variable(s),  such  an  equation  can  always  be 
reduced  to  one  of  the  50  equations  listed  by  him.  Out  of  these  50  equations  44  were 
known  before  (functions  known  as  trigonometric,  hyperbolic,  elliptic,  Mathieu, 
Whittaker,  .  .  .),  and  6  were  new.  Actually  Painleve  found  only  3  of  them,  the  other  3 
were  found  by  his  student  Gambier.  These  6  equations  are  now  known  as  Painleve 
equations  and  their  solutions  as  Painleve  transcendents.  They  are  numbered  from  1  to  6. 
In  the  standard  form  they  read  as: 


d2y          , 
P-2  :     -^  =  2y3  +  ty  +  a 

d2y      1  /dy\2     1  dy      1     •  2      „.          ,      8 

P-3:     -ry  =  -    -r     —  -f  +  -(ayi  +  j3  +73,3+_ 

dt2      y  \dtj       t  dt      t^  y 


P_5.  _  Idv 

i    J  .        ,  r,  —  I  ~     r         ^   I  I    ,    I  . 


,  r,  —  ^          ,  . 

At2       \2y     y-lj  \dtj       t  dt           t 


t          y-1 
1/11  1 


'      dt2       2\y      y-l      y 


where  a,  ^,  7,  ^  are  com-plex  constants.  Later  people  studied  more  complicated  second 
order  and  simple  third  order  differential  equations  to  see  whether  they  can  find  some  new 
transcendents,  with  no  success  yet.  Why  are  these  equations  important  for  us? 
Consider  the  integral  equation 

(*,  y)/(y)dy,  (218) 
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with  the  kernel 

*(*,y)=^f  (219) 

TT(X  -  y) 

The  solutions  of  (218)  have  a  definite  parity,  they  are  either  even  or  odd,  /(-*)  =  ±/(X) 
(Prove!).  Even  (odd)  solutions  are  also  solutions  of  an  integral  equation  with  the  kernel 

,y}}.  (220) 

The  even  (odd)  solutions  and  the  corresponding  eigenvalues  A  will  be  denoted  by  even 
(odd)  indices.  The  corresponding  Fredholm  determinants  with  an  extra  factor  z  will  be 
denoted  by 

oo  oo 

*V(z,0  =  IR1  -***)'     F-(z,t}  =  H(l-z\v+l},  (221) 

i=0  i=0 

and 

F(Z>  f)  =  F+(z,  r)F_(z,  0  = 


i=0 

The  functions  F±(z,f)  and  F(z,  f),  or  rather  some  combinations  of  them  satisfy  certain 
differential  equations,  non-linear  in  general.  They  are  in  fact  Painleve  equations,  P5  or 
P3.  Such  equations  are  important  in  computing  the  power  series  expansion  or  the 
asymptotic  expansion  of  functions  like  Ep(r,t).  Prior  to  the  knowledge  of  these 
differential  equations  such  power  series  or  asymptotic  series  were  computed  in  certain 
cases  with  great  efforts  using  all  sorts  of  properties  of  the  spheroidal  functions,  Toeplitz 
determinants  and  inverse  scattering  theory  known  in  the  literature.  Now  with  the 
differential  equations  the  task  is  much  easier.  The  Kyoto  school  of  mathematics  (limbo, 
Miwa,  Sato  and  Ueno)  has  studied  such  equations  in  a  different  context  and  found  that 
A(0  =  -5(d/dr)  log  F(l,  t)  satisfies  the  equation 


/dA\2      dA 

y  -2  a? 


t&A     dA\  ,/dA\        dA  /  dA 

2  ^  + 


which  can  be  reduced  to  a  standard  PS.  McCoy  et  al  found  that  the  correlation 
function  in  the  2-dimensional  Ising  model  satisfies  a  PS.  Now  it  is  suspected  that 
every  critical  phenomena  in  physics  will  involve  a  Painleve  transcendent  in  its 
description.  As  Conte  once  said,  the  critical  phenomena  are  independent  of  how 
you  prepare  the  physical  system,  and  so  if  it  is  described  by  a  second  order  differential 
equation  then  the  singularities  of  the  equation,  if  any,  must  be  fixed,  i.e.  its  solution 
must  be  either  a  previously  known  function  or  a  Painleve  transcendent.  Though  I  knew 
the  existence  of  (223)  as  McCoy  insisted  on  its  importance  in  random  matrix  theory  over 
years  every  time  I  saw  him,  I  somehow  missed  the  point  for  at  least  10  years.  I  now 
realize  that  almost  all  correlation  or  spacing  functions  satisfy  a  Painleve  equation,  and  I 
hope  you  will  look  for  one  in  dealing  with  any  critical  phenomena.  Let  us  come  back  to 
the  Fredholm  determinants  F±(z,  t}.  They  describe  completely  the  spacing  functions  for 
the  three  classes  of  matrix  ensembles  Gaussian  orthogonal  (GOE),  Gaussian  unitary 
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(GUE)  and  Gaussian  symplectic  (GSE).  In  fact,  if  £/j(r,  t)  is  the  probability  that  a 
randomly  chosen  interval  of  length  2t  contains  exactly  r  eigenvalues  of  a  matrix 
chosen  from  the  GOE  (J3  =  1),  GUE  (/?  =  2),  or  GSE  (J3  =  4),  then 


---     F+(Z,OU=£i(2r,0+£i(2r-l)0,      r  >  0 


(224) 


Suppressing  the  dependence  on  z  for  simplicity,  let  us  write 

A(0  =  4pogF+(z,0  +  logF_(z,0], 

B(t)  =  4  [logF+(z,  0  -  log  F_(z,  0],  (225) 

which  then  satisfy  the  relations 

^  =  MV  (226) 

df 

=z|5(0|2,  '  (227) 

=zRe(S(0)2,  (228) 

=  z  Im  (5(0)2,  (229) 

where  the  auxiliary  complex  function  S(t)  =  S(z,  0  satisfies  the  differential  equation 

AC 

S*2}  (230) 


(5*  is  the  complex  conjugate  of  S),  along  with  the  initial  condition  5(0)  =  1.  Relation 
(226)  was  conjectured  in  1965  and  was  proved  by  Gaudin  in  1966,  others  were  discovered 
in  trying  to  write  (223)  as 

We  have  four  real  functions  A,  B,  Re  S  and  ImS  and  five  differential  relations  among 
them.  As  a  consequence  of  (226)  and  (227)  we  can  write 

A  =  z\S\2  -  2tB2.  (232) 

Eliminating  S  and  B,  we  get  (223)  for  A,  eliminating  S  and  A,  we  get  for  B  the  equation 


2 


dt 


=4B2 


J 


dB 

^ 


(233) 
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moreover  it  is  the  square  of  another  kernel, 

(234) 


Equations  (223),  (230)  and  (233)  can  be  transformed  to  Painleve  equations,  either  to  a  P5 
or  to  a  P3.  Let  us  start  with  (230).  Set  zS2  =  (R  +  i'7)2,  where  R  and  7  are  real.  Separating 
real  and  imaginary  parts  one  has 


This  pair  of  equations  expresses  R  in  terms  of  /  and  /'  and  /  in  terms  of  R  and  R'.  Such 
pairs  are  very  common  in  the  study  of  Painleve  equations  as  we  will  see.  Elimination  of  / 
gives 


Elimination  of  R  will  give  the  same  equation  for  7.  This  equation  has  singular  points  at 
±TTTt/t/2  and  oo.  To  displace  them  at  0,  1  and  oo,  we  put  y  =  (R  +  n  •\/tj2}/ 
(R-7r^/t/2},  then  y  satisfies  a  Painleve  5  (verify  this!).  Similarly  (I  +  ir^/t/2)/ 
(I  -  TCT/f/2)  satisfies  a  standard  P5.  To  get  rid  of  factors  re,  we  will  set  T  —  ret,  (  =  2z/7r, 
and  remove  factors  2/7T  in  A  and  B.  Thus  (236)  becomes,  for  example, 

*"  =  2-  +       2-)-  (237) 


To  relate  this  P5  with  a  P3,  consider  the  couple  of  equations 

R-        ^ 

R~~ 


Note  that/  is  expressed  in  terms  of  R  and  R',  and  R  is  expressed  in  terms  of  /  and/'. 
Eliminating  /  one  gets  (237),  while  eliminating  R  one  gets 


(239) 


Set/=  V«/(«-l),  or  H  =/2/(/2  -  1),  so  that 


This  is  almost  a  P5  with  6  =  0,  and  becomes  a  standard  P5  on  taking  r2  as  the  new 
independent  variable.  Consider  now  the  couple  of  equations 

2rw 


2        , 

=  W     —  OJ 


UJ  = 
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Eliminating  uj,  one  gets  (240),  while  eliminating  u  one  gets  a  P3  for  u.  As  another 
example  we  will  take  (233),  which  removing  the  unnecessary  factors  TT  can  be  written  as 


b"  +  4b  =  -  \M  +  4fc2,     b  =  4tB.  (242) 

T 

To  see  that  this  is  a  P5,  consider  the  pair  of  equations 


b=  ,     2if>  =  b'  +     b12  +  4b2.  (243) 

ip  —  2r 

Eliminating  if>,  we  get  (10.30),  and  eliminating  b  we  get 

«-*, 


To  put  this  last  equation  in  the  standard  form,  let 

*=~T'  *-£i- 

so  that 


This  is  almost  a  P5,  and  will  be  a  P5  by  taking  r2  as  the  new  independent  variable. 
Consider  now  the  pair  of  equations 


,      rf  +  7?2  -  1 

^  =  7+^+1'   ^r^-^  +  r  '      (247) 

Eliminating  77  one  gets  (246)  for  (j>  and  eliminating  0  one  gets  a  P3  for  77  in  the  standard 
form.  Similarly,  the  equation  for  A  can  be  related  either  to  a  P5,  as  was  done  by  Jimbo, 
Miwa,  Mori  and  Sato,  or  to  a  P3. 
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Abstract.  It  has  been  recognized  recently  that  fractional  calculus  is  useful  for  handling  scaling 
structures  and  processes.  We  begin  this  survey  by  pointing  out  the  relevance  of  the  subject  to 
physical  situations.  Then  the  essential  definitions  and  formulae  from  fractional  calculus  are 
summarized  and  their  immediate  use  in  the  study  of  scaling  in  physical  systems  is  given.  This  is 
followed  by  a  brief  summary  of  classical  results.  The  main  theme  of  the  review  rests  on  the  notion 
of  local  fractional  derivatives.  There  is  a  direct  connection  between  local  fractional  differentiability 
properties  and  the  dimensions/local  Holder  exponents  of  nowhere  differentiable  functions.  It  is 
argued  that  local  fractional  derivatives  provide  a  powerful  tool  to  analyze  the  pointwize  behaviour 
of  irregular  signals  and  functions. 

Keywords.     Fractal;  multifractal;  Holder  exponent;  fractional  calculus. 
PACS  Nos     47.53;  47.52;  02.30;  05.40 


1.  Introduction 

In  the  1880's,  contrary  to  the  notion  existing  till  then  that  a  continuous  function  must  be 
differentiable  at  least  at  some  point,  examples  were  constructed  [1, 2]  to  demonstrate  that 
it  is  possible  for  continuous  functions  to  be  no  where  differentiable.  Weierstrass's 
construction  provided  one  such  early  example.  For  about  three  decades  after  the 
construction  of  such  functions,  they  were  still  considered  to  be  rather  pathological  cases 
without  any  practical  relevance  to  the  physical  world.  Perrin  was  the  first  to  point  out 
their  application  in  a  real  physical  situation,  viz.,  the  problem  of  Brownian  motion.  He 
also  prophesied  [3]  the  possibility  of  a  simpler  way  of  handling  these  functions  than 
differentiable  ones. 

Recent  developments  in  nonlinear  and  nonequilibrium  phenomena  suggest  that  such 
irregular  functions  occur  much  more  naturally  and  abundantly  in  formulations  of  physical 
theories.  The  work  on  Brownian  motion  (see  [4])  showed  that  the  graphs  of  projections  of 
Brownian  paths  are  nowhere  differentiable  and  have  dimension  3/2.  A  generalization  of 
Brownian  motion,  called  fractional  Brownian  motion,  [3,4],  is  known  to  give  rise  to 
graphs  having  dimension  between  1  and  2.  It  was  also  observed  that  typical  Feynmann 
paths  [5, 6],  like  the  Brownian  paths,  are  continuous  but  nowhere  differentiable.  In  fluid 
systems,  passive  scalars  advected  by  a  turbulent  fluid  have  been  shown  [7, 8]  to  have 
isoscalar  surfaces  which  are  highly  irregular,  in  the  limit  of  diffusion  constant  going  to 
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zero.  Also  there  exist  situations  in  which  one  has  to  solve  a  partial  differential  equation 
subject  to  fractal  boundary  conditions,  such  as  Laplace  equation  near  a  fractal  conducting 
surface.  For  instance,  as  noted  in  reference  [9]  irregular  boundaries  may  appear  to  be  non- 
differentiable  everywhere  upto  a  certain  spatial  resolution,  and/or  may  exhibit 
convolutions  over  many  length  scales.  Nowhere  differentiable  functions  can  be  used  to 
model  these  boundaries. 

In  the  study  of  dynamical  systems  theory,  attractors  of  some  systems  are  found  to 
be  continuous  but  nowhere  differentiable  [10].  We  will  consider  one  specific 
example  [10,  11]  of  the  dynamical  system  which  gives  rise  to  such  an  attractor.  Consider 
the  following  map. 

xn+i  =  2xn  +  yn    modi, 

yn+i  -xn+yn    modi,  (1) 


where  x  and  y  are  taken  mod  1  and  z  can  be  any  real  number.  In  order  to  keep  z  bounded, 
A  is  chosen  between  0  and  1.  The  equations  for  x  and  3?  are  independent  of  z.  The  x  -  y 
dynamics  are  chaotic  and  are  unaffected  by  the  value  of  z.  It  was  shown  in  [10]  that  if 
A  >  2/(3  4-  \/5),  the  attractor  of  this  map  is  nowhere  differentiable  torus.  One  more 
example  in  2-dim  phase  space  is  given  in  §  5. 

All  these  irregular  functions  are  best  characterized  locally  by  a  Holder  exponent.  We 
will  use  the  following  general  definition  of  the  Holder  exponent  h(y)  which  has  been  used 
by  various  authors  [12,  13]  recently.  The  exponent  h(y]  of  a  function/  at  y  is  given  by  h 
such  that  there  exists  a  polynomial  Pn(x)  of  order  n,  where  n  is  the  largest  integer  smaller 
than  h,  that  satisfies 

\f(x}-Pn(x-y}\  =  0(\X-y\h},  (2) 

for  x  in  the  neighbourhood  of  y.  This  definition  serves  to  classify  the  behavior  of  the 
function  at  y. 

Multifractal  measures  have  been  object  of  many  investigations  [14-21].  This 
formalism  has  met  with  many  applications.  Its  importance  also  stems  from  the  fact 
that  such  measures  are  natural  measures  to  be  used  in  the  analysis  of  many  phenomena 
[22,23].  The  first  mathematical  rigorous  results  concerning  multifractals  were  given 
in  [24].  It  may  however  happen  that  the  object  one  wants  to  understand  is  a  function  (e.g. 
a  fractal  or  multifractal  signal)  rather  than  a  set  or  a  measure.  For  instance  one  would  like 
to  characterize  the  velocity  field  of  fully  developed  turbulence.  Studies  on  fluid 
turbulence  have  shown  existence  of  multifractality  in  velocity  fields  of  a  turbulent  fluid  at 
low  viscosity  [14]. 

In  the  case  of  functions  having  same  Holder  exponent  h,  with  0  <  h  <  1,  at  every 
point  (Weierstrass  function  (§  5)  is  such  an  example)  it  is  well  known  [4]  that  the 
box  dimension  of  its  graph  is  2  -  h.  On  the  other  hand,  there  are  functions  /(jc)  which 
do  not  have  constant  exponent  h  at  every  point  but  have  a  range  of  such  exponents. 
The  set  of  points  having  same  exponents  {x  \  h(x)  =  h}  for  such  a  function  may 
even  constitute  a  fractal  set.  In  such  situations  corresponding  functions  f(x)  are 
multifractal. 
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In  the  light  of  these  findings  there  has  been  a  renewed  interest,  chiefly  among 
mathematicians,  in  studying  pointwise  behaviour  of  multifractal  functions.  New  tools 
have  been  developed  to  study  local  behaviour  of  functions,  which  will  be  of  practical  use. 
Notable  among  them  is  the  use  of  wavelet  transforms  [25-28]  which  have  been  used  with 
some  success  to  this  effect.  This  transform  allows  one  to  study  the  behaviour  of  functions 
at  different  scales  and  hence  it  is  popularly  known  as  a  mathematical  microscope.  Some 
well-known  functions  have  been  reanalyzed  using  these  techniques  [29-32]  and  shown  to 
possess  a  spectrum  of  Holder  exponents,  thereby  establishing  their  multifractality. 

In  this  article  we  review  a  more  direct  method,  which  uses  fractional  calculus 
formalism,  to  characterize  local  behaviour  of  functions.  The  fractional  calculus 
formalism  [33-35]  allows  taking  derivatives  of  fractional  order.  However,  as  pointed 
3ut  in  §  2  below,  there  is  a  multiplicity  of  definitions  of  fractional  derivatives.  Hence  it  is 
important  to  recognize  the  appropriate  one  which  is  suitable  for  extracting  the  scaling 
behaviour.  Recently  it  was  realized  [36]  that  the  concept  of  local  fractional  derivative, 
introduced  in  §  4  below  in  a  more  general  setting,  is  ideally  suited  for  this  purpose  and 
that  there  is  a  direct  quantitative  connection  between  the  lack  of  differentiability  of  a 
fractal  function  and  the  dimension  of  its  graph.  Further  it  provides  an  alternative  way  of 
characterizing  the  local  Holder  exponent.  In  this  formalism,  unlike  in  wavelet  transform, 
Holder  exponent  shows  up  naturally  since  the  magnitude  of  local  fractional  derivative 
(LFD)  jumps  when  its  order  crosses  a  critical  order.  The  main  theme  of  this  review  is 
centered  around  this  development. 

The  organization  of  the  paper  is  as  follows.  In  §2  we  present  a  brief  review  of 
fractional  calculus  formalism  with  its  recent  applications.  The  next  section  discusses 
fractional  differentiability  using  Weyl's  definition  and  certain  classical  results.  In  the 
fourth  section  we  present  our  definition  of  LFD  in  more  general  setting  and  its  connection 
to  generalized  Taylor  series.  In  §5  we  apply  this  definition  to  a  specific  example,  viz, 
Weierstrass'  nowhere  differentiable  function.  It  is  shown  that  this  function,  at  every  point, 
is  locally  fractionally  differentiable  for  all  orders  below  2  —  5,  but  not  so  for  orders 
between  2  —  s  and  1,  where  s,  I  <  s  <  2,  is  the  box  dimension  of  the  graph  of  the 
function.  In  the  next  section  a  general  result  has  been  proved,  showing  relation  between 
local  fractional  differentiability  and  local  Holder  exponent.  In  §  5  we  demonstrate  use  of 
LFD  in  unmasking  isolated  singularities  and  in  the  study  of  pointwise  behavior  of 
multifractal  functions.  Section  6  concludes  the  article,  pointing  out  a  few  possible 
consequences  of  our  results. 


2.  Fractional  calculus  and  scaling  phenomena 

Fractional  calculus  [33-35]  is  a  study  which  deals  with  generalization  of  differentiation 
and  integration  to  fractional  orders.  There  are  number  of  ways  (not  neccesarily 
equivalent)  of  defining  fractional  derivatives  and  integrations.  We  list  here  some  of  them, 
which  will  be  used  in  this  work.  We  begin  by  recalling  the  Riemann-Liouville  definition 
of  fractional  integral  of  a  real  function,  which  is  given  by  [33, 35] 

Tdv    for    a  <  0,  (3) 


FJ/  M<?        TV       \ 

[d(x  -  a)]9      T(-q) 
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where  the  lower  limit  a  is  some  real  number  and  of  the  fractional  derivative 

1        d" 


f       n-l<q<n.       (4) 
^  ' 


Fractional  derivative  of  a  simple  function  f(x)  =  xp,  p  >  —  I  is  given  by  [33, 35] 

r(P  +  i)   _„-,  for          L  (5) 


.  —     _.    -  x    -<V  i*_Fi  IX      X^  JL  . 

<fc<     r(p-$  +  i) 

For  future  use  we  note  the  following  properties  of  fractional  derivatives.  For  0  <  q  <  1, 
d9  sin(;c)  _  d4"1  cos(jc)  (f. 

— =; ^Oj 

and 

d9  cos(x)  _     x~v      _  tf-1  sin(x)  (  . 

djc9          F(l  —  ^)         djc^""1 

Further  the  fractional  derivative  has  the  property  (see  ref.  [33]),  viz, 


which  makes  it  suitable  for  the  study  of  scaling.  Note  the  nonlocal  character  of  the 
fractional  derivative  and  integral  in  the  defining  equations  (3)  and  (4).  Also  it  is  clear 
from  equation  (5)  that  unlike  in  the  case  of  integer  derivatives  the  fractional  derivative  of 
a  constant  is  not  zero  in  general. 

Since  the  definition  of  fractional  derivative  contains  a  lower  limit  a,  it  is  natural  that 
different  definitions  of  a  fractional  differentiability  would  arise  depending  on  the  lower 
limit  one  chooses.  Weyl  defined  fractional  derivatives,  with  the  arbitrary  limit  a  in  (4) 
going  to  -co,  as  follows. 


for  "- 


The  merit  of  this  definition  is  that  the  fractional  derivative  of  a  periodic  function 
according  to  this  definition  is  a  periodic  function,  which  is  why  it  is  suitable  in  harmonic 
analysis. 

2.1  Applications  to  the  study  of  scaling  in  physical  systems 

Recent  work  shows  that  the  fractional  calculus  formalism  is  useful  in  dealing  with 
scaling  phenomena.  The  purpose  of  this  subsection  is  to  point  out  a  few  relevant 
parallel  developments  even  though  they  are  not  part  of  the  central  theme  of  this 
paper. 

Hilfer  [37-39]  has  generalized  the  Eherenfest's  classification  of  phase  transition  using 
the  fractional  derivative.  Some  recent  papers  [40-43]  indicate  a  connection  between 
fractional  calculus  and  fractal  structures  [3,  4]  or  fractal  processes  [44-46].  Glockle  and 
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integrals  to  formulate  fractal  processes   such  as  fractional  Brownian  motion.   The 
fractional  Brownian  motion  is  described  by  the  probability  distribution  Bn(t]  defined  by 

K(t-t'}dB(t'),     0</f<l,  (10) 


where  B(t)  is  an  ordinary  Gaussian  random  process  with  average  zero  and  unit  variance 
and 

.       (    t-t'-^2  0<t'<t 


The  use  of  the  fractional  integral  kernel  is  apparent.  Recently  Sebastian  [47]  has  given  a 
path  integral  representation  for  fractional  Brownian  motion  whose  measure  has  fractional 
derivatives  of  paths  in  it. 

Fractional  equations  for  a  class  of  Levy  type  probability  densities:  Levy  flights  [48]  and 
Levy  walks  [49,  50]  have  found  applications  in  various  branches  of  physics  [51],  for 
example  in  fluid  dynamics  [52]  and  polymers  [53].  In  [40],  Nonnenmacher  considered  a 
class  of  normalized  one  sided  Levy  type  probability  densities,  which  provides  the  length 
of  a  jump  of  a  random  walker,  given  by 

~l  exp(-a/*)     a  >  0,  x  >  0.  (12) 


It  is  clear  that  for  large  x 

f(x)~x-*-\     ji>0,     *>0,  (13) 

where  //  is  the  Levy  index.  In  [40]  it  was  shown  that  these  Levy-type  probability  densities 
satisfy  a  fractional  integral  equation 

=  -£*  (14) 


or  equivalently  a  fractional  differential  equation  given  by 


Here,  n=lifO</^<!  and  n  =  2ifl  <yu<2,  etc.  An  interesting  observation  of  this 
work  is  that  the  Levy  index  is  related  to  the  order  of  the  fractional  integral  or  differential 
operator. 

Fractional  equations  for  physical  processes'.  Modifications  of  equations  governing 
physical  processes  such  as  diffusion  equation,  wave  equation  and  Fokker-Plank  equation 
have  been  suggested  [41,  42,  54-56]  which  incorporate  fractional  derivatives  with  respect 
to  time.  In  refs  [41,  42]  a  fractional  diffusion  equation  has  been  proposed  for  the  diffusion 
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on  fractals.  Asymptotic  solution  of  this  equation  coincides  with  the  result  obtained 
numerically.  Fogedby  [57]  has  considered  a  generalization  of  the  Fokker-Plank  equation 
involving  addition  of  a  fractional  gradient  operator  (defined  as  the  Fourier  transform  of 
-#*)  to  the  usual  Fokker-Plank  equation  and  performed  a  renormalization  group 
analysis. 

Transport  in  chaotic  Hamiltonian  systems:  Recently  Zaslavasky  [58]  showed  that  the 
Hamiltonian  chaotic  dynamics  of  particles  can  be  described  by  a  fractional  generalization 
of  the  Fokker-Plank-Kolmogorov  equation  which  is  defined  by  two  fractional  critical 
exponents  (a,  /3)  responsible  for  the  space  and  time  derivatives  of  the  distribution 
function  correspondingly.  With  certain  assumptions,  the  following  equation  was  derived 
for  the  transition  probabilities  P(x,t}: 


'W-'))  +2-~-£(B(x}P(x,>)),  (16) 

where 

AW  =  lim  ^1  (17) 

^   '        A<-0     (Atf 

and 

(18) 

V 


with  A'  and  B'  being  ath  and  2ath  moments  respectively.  The  exponents  a  and  /3  have 
been  related  to  anomalous  transport  exponent. 

We  remark  that  most  of  the  applications  reviewed  in  this  section  deal  with  asymptotic 
scaling  only. 

3.  Weyl  fractional  differentiability  and  Holder  classes 

The  purpose  of  this  section  is  to  review  the  classical  work  of  Stein,  Zygmund  and  others 
[59-62]  which  uses  Weyl  fractional  calculus  in  the  analysis  of  irregular  functions.  Let 
f(x)  be  defined  in  a  closed  interval  I,  and  let 

w(^=w(fi:/)=sup|/(*)-/ty)|     for    x.ye/     \x-y\<6.  (19) 

The  function  w(6)  is  called  the  modulus  of  continuity  of  /.  If  for  some  a  >  0  we 
have  (jj(S)  <  C8a  with  C  independent  of  8,  then  /  is  said  to  satisfy  Holder  (in  old 
literature  it  is  called  Lipschitz)  condition  of  order  a.  These  functions  define  a  class  of 
function  AC,.  Notice  that  with  this  definition  of  the  modulus  of  continuity  only  the  case 
0  <  a  <  I  is  interesting,  since  if  a  >  1,  f(x)  is  zero  everywhere  and  the  function  is 
constant.  Hence  this  definition  has  been  generalized  (see  [28])  to  one  where  the  case 
a  >  I  is  also  nontrivial.  The  idea  is  to  subtract  an  appropriate  polynomial  from  a 
function  /  at  v. 

w(5)=w(6:/)=sup|/(jc)-P(jc--3;)|     for    x,y£l     \x-y\<8,      (20) 
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wnere  f  is  me  only  polynomial  or  smallest  degree  wnicn  gives  trie  smallest  order  or 
magnitude  for  u.  Recently,  this  definition  has  been  widely  used  [12, 13]  to  characterize 
the  velocity  field  of  a  turbulent  fluid. 
Following  Welland  [62]  we  introduce 

DEFINITION  1 

f  is  said  to  have  an  ath  derivative,  where  Q<k<a<k+l  with  integer  k,  if  D~@f, 
6  =  k  +  1  —  a,  has  a  k  +  1  Peano  derivatives  at  XQ  i.e.  there  exists  a  polynomial  PXQ  (t}  of 
degree  <  k  +  1  such  that 

(D-*/)(xo  +  0  -  Pxo(t]  =  o(\t\k+l]     t  ->  0.  (21) 

Further  if 

{-  fP  \(D-ftf)(xQ  +  0  -  PXQ(t}\pdt\     =  o(pk+l)     P-+Q    (1  <  p  <  oo), 
(PJ-p  ) 

(22) 

f  is  said  to  have  an  am  derivative  in  the  IP  sense.  The  D~@  is  in  the  Weyl  sense. 

It  is  clear  that  this  definition  of  fractional  differentiability  is  not  local.  Particularly  the 
behavior  of  function  at  —  oo  also  plays  a  crucial  role.  The  main  classical  results  can  be 
stated  using  this  notion  of  differentiability  and  involve  the  classes  h.pa  and  Npa  which  are 
given  by  the  following  definitions. 

DEFINITION  2 

If  there  exists  a  polynomial  Qxo(t)  of  degree  <  k  s.  t.  f(xQ  +  t )  —  QXo(t)  =  O(\t\a)  as 
t  — »  0  then  /  is  said  to  satisfy  the  condition  Aa  and  if 

\-       |/(*o  +  0  ~  Qx0(t}\pdt  I     =  O(pa),     p  ->  0     (1  < p  <  oo),     (23) 
(PJ-P  J 

f  is  said  to  satisfy  the  condition  A£. 

DEFINITION  3 

f  is  said  to  satisfy  the  condition  N^  if  for  some  p  >  0 

-dt  <  co.  (24) 


PJ-P 

We  are  now  in  a  position  to  state  the  classical  results  in  the  form  of  theorems  1  to  4. 
Next  two  theorems  [61, 62]  give  the  condition  under  which  the  fractional  derivative  of  a 
function  exists. 

Theorem  1.  Suppose  that  f  satisfies  the  condition  Aa  at  every  point  of  a  set  E  of  positive 
measure.  Then  Daf(x)  exists  almost  everywhere  in  E  if  and  only  if  f  satisfies  condition 
N%  almost  everywhere  in  E. 
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operation  of  fractional  differentiation  is  performed. 

Theorem  3.  Let  0  <  a  <  1,  /?  >  0  and  suppose  that  f  e  Aa.  TTien  D~^/  e  Aa+Jg  if 
a  +  0  <  1. 

Theorem  4.  Let  0  <  7  <  a  <  1.  Then  mf  s  AQ_7  if  f  e  Aa  . 

Though  they  have  their  own  value,  these  results  are  not  really  adequate  to  obtain 
information  regarding  irregular  behaviour  of  functions  and  Holder  exponents.  We  observe 
that  the  Weyl  definition  involves  highly  nonlocal  information  and  hence  is  somewhat 
unsuitable  for  treatment  of  local  scaling  behaviour.  In  the  next  section  we  introduce  a 
more  appropriate  definition. 

4.  Local  fractional  differentiability 

In  our  previous  work  [36]  we  introduced  the  notion  of  local  fractional  derivative  and 
demonstrated  its  use  in  the  study  of  local  scaling  behaviour.  We  now  briefly  explain  this 
notion  and  use  it  in  subsequent  sections. 

4.1  Local  fractional  derivative  and  critical  order 

Recall  the  observations  made  in  §  2,  viz,  (1)  nonlocal  information  contained  in  fractional 
derivative  and  (2)  the  fractional  derivative  of  a  constant  is  not  zero.  The  appropriate  new 
notion  of  differentiability  must  bypass  the  hindrance  due  to  these  two  properties.  These 
difficulties  can  be  remedied  by  introducing  the  following. 

DEFINITION  4 

If,  for  a  function/  :  [0,  1]  -*  R,  the  limit 

(25) 


exists  and  is  finite,  then  we  say  that  the  local  fractional  derivative  (LED)  of  order  q 
(0  <  q  <  1),  at  x  =  v,  exists. 

In  the  above  definition  the  lower  limit  y  is  treated  as  a  constant.  The  subtraction  of 
/(y)  corrects  for  the  fact  that  the  fractional  derivative  of  a  constant  is  not  zero. 
Whereas  the  limit  as  x  —  >  y  is  taken  to  remove  the  nonlocal  content.  Advantage  of 
defining  local  fractional  derivative  in  this  manner  lies  in  its  local  nature  and  hence 
allowing  the  study  of  pointwise  behaviour  of  functions.  This  will  be  seen  more  clearly  in 
§  4.2  after  the  development  of  Taylor  series. 

DEFINITION  5 

We  define  critical  order  a,  at  y,  as 

a(y)  =  Sup{q  \  all  LFDs  of  order  less  than  q  exist  at  y}. 
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If,  for  a  function/  :  [0,  1]  -»  R,  the  limit 

->)•)  (26) 


exists  and  is  finite,  where  N  is  the  largest  integer  for  which  Mh  derivative  of  f(x)  at  y 
exists  and  is  finite,  then  we  say  that  the  local  fractional  derivative  (LFD)  of  order  q 
(N  <  q  <  N  +  1),  at  x  =  y,  exists. 

We  consider  this  as  the  generalization  of  the  local  derivative  for  order  greater  than  one. 
The  definition  of  the  critical  order  remains  the  same  since,  for  q  <  1,  (25)  and  (26)  agree. 
This  definition  extends  the  applicability  of  LFD  to  functions  where  the  first  derivative 
exists  but  are  still  irregular  due  to  the  nonexistence  of  some  higher  order  derivative.  As  an 
example  we  note  that  the  critical  order  of  f(x)  =  a  +  bx  +  c|;c|7  at  origin,  according  to 
definitions  5  and  6  is  7. 

Remark.  1)  It  is  interesting  to  note  that  the  same  definition  of  LFD  can  be  used  for 
negative  values  of  the  critical  order  between  —  1  and  0.  For  this  range  of  critical  orders 
N  =  —I  and  the  sum  in  (26)  is  empty.  As  a  result  the  expression  for  LFD  becomes 

•  <27) 


An  equivalence  between  the  critical  order  and  the  Holder  exponent,  for  positive  values  of 
critical  order,  will  be  proved  in  §  6.  Here  we  would  like  to  point  out  that  the  negative 
Holder  exponents  do  arise  in  real  physical  situation  of  turbulent  velocity  field  (see  [63,  64] 

and  references  therein). 

2)  Another  way  of  generalizing  the  LFD  to  the  values  of  critical  order  beyond  1  would 

have  been  to  write  it  as 


But  the  existence  of  Mh  derivative  of  /  at  jc  may  not  be  guaranteed  in  general.  Such  a 
situation  may  arise  in  the  case  of  multifractal  functions  to  be  treated  in  §  8. 

4.2  LFD  and  generalized  Taylor  series 

In  order  to  see  the  information  contained  in  the  LFD  we  consider  fractional  Taylor's 
series  with  a  remainder  term  for  a  real  function  /.  Let 


- 
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and 

N        Wnl/.A 

(x-y}n.  (30) 


It  is  clear  that 

®qf(y}=F(y:Q;q,N}.  (31) 

Now,  for  N  <  q  <  N  +  1,  it  can  be  shown  that 


i.e. 


(33) 


where  #i(*,y)  is  a  remainder  given  by 


We  note  that  the  local  fractional  derivative  as  defined  above  (not  just  fractional 
derivative),  along  with  the  first  N  derivatives,  provides  an  approximation  of  f(x) 
in  the  vicinity  of  y.  We  further  note  that  the  terms  on  the  RHS  of  eq.  (32)  are  nontrivial 
and  finite  only  in  the  case  q  —  a,  the  critical  order.  In  ref.  [65]  a  fractional 
Taylor  series  was  constructed  by  Osier  using  usual  (not  local  in  the  present 
sense)  fractional  derivatives.  His  results  are,  however,  applicable  to  analytic  functions 
and  cannot  be  used  for  non-smooth  functions  directly.  Further  Osier's  formulation 
involves  terms  with  negative  orders  also  and  hence  is  not  suitable  for  approximating 
schemes. 
When  0  <  q  <  1  we  get  as  a  special  case 

;C~y)*  +Remainder  (35) 

provided  the  RHS  exists.  One  may  note  in  equation  (35)  that  when  q  is  set  equal  to 
one  in  the  above  approximation  one  gets  the  equation  of  the  tangent.  It  may  be 
recalled  that  all  the  curves  passing  through  a  point  y  and  having  same  tangent  form 
an  equivalence  class  (which  is  modelled  by  a  linear  behavior).  Analogously  all  the 
functions  (curves)  with  the  same  critical  order  a  and  the  same  Da  will  form  an 
equivalence  class  modelled  by  power  law  xf*.  This  is  how  one  may  generalize  the 
geometric  interpretation  of  derivatives  in  terms  of  tangents.  This  observation  is  useful 
when  one  wants  to  approximate  an  irregular  function  by  a  piecewise  smooth  (scaling) 
function. 
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5.  Fractional  differentiability  of  Weierstrass  function 

Consider  a  form  of  Weierstrass  function,  viz, 

oo 

Wx(t]  =  T  A(j-2)fcsinA^,     A  >  1,      1<  5  <  2,     t  real.  (36) 


Note  that  WA(O)  =  0.  The  box  dimension  of  the  graph  of  this  function  is  s.  The  Hausdorff 
dimension  of  its  graph  is  still  unknown.  The  best  known  bounds  are  given  by  Mauldin  and 
Williams  [66]  where  they  have  shown  that  there  is  a  constant  c  such  that 

c 

s  -  -  —  -  <  dimtf  graph  /  <  s. 
log  A 

Recently  it  was  shown  [67]  that  if  a  random  phase  is  added  to  each  sine  term  in  (36)  then 
the  Hausdorff  dimension  of  the  graph  of  the  resulting  function  is  s.  These  kind  of 
functions  have  been  studied  in  detail  in  [4,  68-70], 

We  note  that  there  are  dynamical  systems  with  graphs  of  such  functions  as 
invariant  sets.  For  example,  let  g  :  IR  —  >  1R  be  differentiable,  and  let  h  :  U2  —  >  R2  be 
given  by 

h(X,t)  =  (Xt,X2-s(x-g)t).  (37) 

Then  the  graph  of  /  can  easily  be  seen  to  be  a  repeller  of  for  h,  where  /  is  the  function 
given  by 


In  the  following  two  subsections  we  prove  lower  and  upper  bound  on  critical  order  of  the 
Weierstrass  function. 


5.1  Lower  bound  on  critical  order 

To  check  the  fractional  differentiability  at  any  point,  say  r,  we  use  f7  =  t  —  r  and 

,  r)  =  Wxtf  +  r}-  WA(r)  so  that  WA(0, r)  =  0.  We  have 


oo 


Wx(tl,  T}=^  A(s-2)fe  sin  Xk(t'  +  T)  -  £  A^-2>*  sin  \kr 

k=l  k=l 


=  ]T  A(j-2^(cos  A*r  sin  AV  +  sin  A*T(COS  AV  -  1)).  (39) 


k=\ 


Now  we  take  fractional  derivative  of  this  with  respect  to  t'. 


(40) 
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that  the  series  on  the  right  hand  side  will  converge  uniformly  for  q  <  2  -  5,  justifying  our 
action  of  taking  the  fractional  derivative  operator  inside  the  sum. 

Also  as  t1  -»  0  for  any  k  the  fractional  derivatives  in  the  summation  of  equation  (40) 
goes  to  zero.  Therefore  it  is  easy  to  see  from  this  that 

'T)=0    for    q  <  2  -  s.  (41) 

This  shows  that  gth  local  derivative  of  the  Weierstrass  function  exists  and  is  continuous, 
at  any  point,  for  q  <  1  -  s. 


5.2  Upper  bound  on  critical  order 

For  q  >  2  -  s,  right  hand  side  of  the  equation  (40)  seems  to  diverge.  We  now  prove  that 
the  LFD  of  order  q  >  1  -  s  in  fact  does  not  exist.  To  do  this  we  write  the  Weierstrass 
function  as  follows. 

N 

Wx(t]  =  ^  A(j~2)fe  sin  A*r  +  X(s~2)N  Wx(XNt] .  (42) 

Jk=l 

We  now  write 

:d*(sin(A*f)-sin(AV)) 


We  choose  N  such  that  X~(N+V  <\t-t'\<  X~N.  Now  since  |  sin(A*f)  -  sin(AV)|  < 


This  implies  that  the  absolute  value  of  the  first  term  in  equation  (43)  is  bounded 
from  above  by  CX^S~2+^N.  Now  since  the  first  derivative  of  the  Weierstrass  function 
does  not  exist  at  any  point  there  exists  a  sequence  of  points  tn  approaching  tf  such 
that  \Wx(XNtn)  -  Wx(XNtf}\  >  c\N\tn  -  tf\  .  Therefore  there  exists  a  sequence  tn  such  that 

d*|W(A\)-W(A*V)|  (1_  q}     _ 

d[A*(f»  -  <0]»  (    ^ 

and  this  is  valid  for  every  c  for  large  enough  n.  This  implies  that 

,_2 

-CA 
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d«|sin(A*fn)-sin(AV)| 


Therefore  we  get 


(,. 
^^ 


< 


Fhis  implies  that 


-  Wx(tf}} 


> 


(47) 


(48) 


(49) 


for  C'  >  0.  From  this  it  is  clear  that  the  LFD  of  order  greater  than  2  —  s  does  not  exist. 
This  concludes  the  proof. 

Summarizing,  therefore,  the  critical  order  of  the  Weierstrass  function  is  2  —  s  at  all 
points.  It  may  be  noticed  that  this  proof  is  valid  for  any  A  >  1.  This  generalizes  a  similar 
result  of  [4,  36]  which  is  valid  only  for  sufficiently  large  A.  Thus  there  is  a  direct 
connection  between  dimension  and  the  differentiability  properties  for  W\(?)-  As  seen 
below,  this  observation  is  not  restricted  to  W\(t]  but  is  quite  general. 


5.3.  Levy  index  of  a  Levy  flights  and  critical  order 

Schlesinger  et  al  [71]  have  considered  a  Levy  flight  on  a  one  dimensional  periodic 
Lattice  where  a  particle  jumps  from  one  lattice  site  to  other  with  the  probability 
given  by 


w-1 


(50) 


where  x  is  magnitude  of  the  jump,  b  is  a  lattice  spacing  and  b  >  a;  >  1.  8(x>y)  is  a 
Kjronecker  delta.  The  characteristic  function  for  P(x)  is  given  by 


_    -i 


IX' 

j=o 


Kj 


(51) 


which  is  nothing  but  the  Weierstrass  function.  For  this  distribution  the  Levy  index  is 
log  ujf  log  b,  which  can  be  identified  as  critical  order  of  P(k] . 

More  generally  for  the  Levy  distribution  with  index  fj,  the  characteristic  function  is 
given  by 


P(k)  = 


(52) 


Critical  order  of  this  function  at  k  =  0  also  turns  out  to  be  same  as  p.  Thus  the  Levy  index 
can  be  identified  as  the  critical  order  of  the  characteristic  function  at  k  =  0. 


We  recall  the  definition  of  Holder  exponent  h(y)  of  a  function/  at  y  [12,  13]  as  h  such 
that  there  exists  a  polynomial  Pn(x)  of  order  n,  where  n  is  the  largest  integer  smaller  than 
h,  that  satisfies 

\f(x)-Pn(x-y}\  =  0(\X-yf)>  (53) 

for  x  in  the  neighbourhood  of  y.  When  h  is  restricted  between  0  and  1  the  equation  (53) 
takes  the  form 

I/to -/Ml  =  o(|* -yf).  (54) 

The  Holder  exponent  characterizes  the  behaviour  of  the  function  around  a  given  point. 
One  can  study  the  function  at  every  point  and  find  its  pointwise  Holder  exponents. 
According  to  equation  (53)  an  analytic  function  has  h  =  oo  at  every  point. 

As  can  be  seen  clearly  that  the  definition  of  the  Holder  exponent  is  not  algorithmic  and 
hence  methods  need  to  be  developed  for  its  determination.  In  [36]  the  following  result 
was  proved  in  a  slightly  different  form,  which  establishes  an  equivalence  between  the 
Holder  exponent  and  the  critical  order. 

Theorem  5.  The  continuous  function  f(x)  has  Holder  exponent  a,  0  <  o.  <  1,  at  y  iff 
®qf(y]  =  Qfor  all  q  <  a  and  ®qf(y)  does  not  exist  for  1  >  q  >  a. 

This  result  can  be  understood  from  the  simple  example  of  fractional  derivative  of  a 
power  x?,  given  in  (5).  In  this  equation  if  0  <  p  <  1  and  0  <  q  <  1  then  the  RHS  exists. 
But  if  we  take  a  limit  x  -»  0  then  we  get 


lim— —  =  <  constant,      if  p  =  q  (55) 

x->0  &fl  1  , 

^      oo,         otherwise, 

The  LHS  in  the  above  equation  is  nothing  but  the  LFD  of  xp.  This  shows  that  the  critical 
order  gives  the  exponent  p.  This  is  expected  to  happen  for  any  function  and  at  any  point 
with  a  local  power  law  behaviour. 

The  following  corollary  follows  from  the  theorem  5  and  a  well  known  result  giving 
relation  between  Holder  exponent  and  box  dimension  of  a  graph  of  a  fractal  function  [4]. 

COROLLARY  1 

If  the  critical  order  of  a  function  f(x]  at  every  point  x  is  a  then  dimB  f  =  2-a  where 
dimfi  /  is  a  box  dimension  of  the  graph  of  the  junction  f. 

With  a  slight  modification  in  the  proof  of  theorem  5  a  general  result  giving  equivalence 
between  the  Holder  exponent  and  the  critical  order  using  the  general  definition  of  LFD 
follows.  The  functions  F  and  F  are  defined  in  equations  (29)  and  (30)  respectively. 

Theorem  6.    Let  f  :  [0,  l]->Rbea  continuous  function. 

(a)  If  D?/(y)  =  0  where  N  <  q  <  N  +  I,  for  some  y,  then  h(y]  >  q. 
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(b)  If  there  exists  a  sequence  xn  —  >•  y  as  n  —  >  oo  such  that 

lim  F(y,xn-y\q,N]  =  ±00, 

n—  >oo 

for  some  y,  then  n(y)  <  q. 

Similarly  a  following  converse  of  the  above  theorem  can  also  be  proved. 

Theorem  7.  Let  f  :  [0,  1]  —  »  R  be  a  continuous  function. 
(a)  Suppose 


where  c>Q,  N<a<N+l  and  \x-y\  <  6  for  some  8  >  0.  Then  O*/(y)  =  0/or  any 
<?<  a  for  ye  (0,1). 

(b)  Suppose  that  for  y  €  (0,  1)  and  for  each  8  >  0  tfiere  ejeists  x  swcn  fnaf  |jt  —  y|  <  <5 


where  c  >  0,  5  <  <5o/0r  some  <$o  >  0  and  0  <  a  <  1.  77ze«  ?nere  exists  a  sequence  xn 
as  n  —  >  oo  .VMC/I 


lim  F(y,xn  —  y;  a,  A^)  =  ±00     for    q  >  a. 

n—  »-oo 

These  two  theorems  give  an  equivalence  between  Holder  exponent  and  the  critical 
order  of  fractional  differentiability.  Their  proofs  are  similar  to  that  of  theorem  5. 


7.  Isolated  masked  singularities 

The  purpose  of  this  section  is  to  demonstrate  the  use  of  LFD  to  detect  masked 
singularities.  We  will  consider  only  isolated  singularities.  We  choose  the  simplest 
example  f(x)  =  £JU  anx"  +axa,  N  <a<N+l,  x  >  0.  Critical  order  at  x  —  0 
gives  the  order  of  singularity  at  that  point  whereas  the  value  of  the  LFD  &q=af(Q),  viz 
aT(a  +  l),  gives  strength  of  the  singularity. 

Using  LFD  we  can  detect  a  weaker  singularity  masked  by  a  stronger  singularity.  As 
demonstrated  below,  we  can  estimate  and  subtract  the  contribution  due  to  stronger 
singularity  from  the  function  and  find  out  the  critical  order  of  the  remaining  function. 
Consider,  for  example,  the  function 

/(*)  =  £>„*«  +  axa  +   £  anxn  +  bJ,  (56) 

w=0  n=N+l 

where  Ar<a<Ar+l<M</3<M+land;t>0.  LFD  of  this  function  at  x  =  0  of 
the  order  a  is  BQ/(0)  =  aF(o:+  1).  Using  this  estimate  of  stronger  singularity  we 
now  write 

Gfx-  a)  =/W 

in*,«)    ;w 


me  critical  oraer  or  mis  junction,  at  x  =  u,  is  p  wnicn  is  a  masjcea  singuianty.  JNotice  mat 
the  estimation  of  the  weaker  singularity  was  possible  in  the  above  calculation  just 
because  the  LED  gave  the  coefficient  of  xa/T(a  +  1).  This  suggests  that  using  LFD,  one 
should  be  able  to  extract  secondary  singularity  spectrum  masked  by  the  primary 
singularity  spectrum  of  strong  singularities.  Hence  one  can  gain  more  insight  into  the 
processes  giving  rise  to  irregular  behavior. 

Comparison  of  two  methods  of  studying  pointwise  behavior  of  functions,  one  using 
wavelets  and  the  other  using  LFD,  shows  that  characterisation  of  Holder  classes  of 
functions  using  LFD  is  direct  and  involves  fewer  assumptions.  Characterisation  of  Holder 
class  of  functions  with  oscillating  singularity,  e.g.  f(x)  =  xa  sin(l/x^)  (x  >  0,  0  <  a  <  I 
and  (3  >  0),  using  wavelets  needs  two  exponents  [27].  Using  LFD,  owing  to  theorems  1 
and  2  critical  order  directly  gives  the  Holder  exponent  for  such  a  function. 

It  has  been  shown  in  the  context  of  wavelet  transforms  that  one  can  detect 
singularities  masked  by  regular  polynomial  behavior  [12]  by  choosing  an  appropriate 
analysing  wavelet.  (Wavelets  with  first  n,  for  some  suitable  n,  moments  vanishing 
are  considered  appropriate).  If  one  has  to  extend  the  wavelet  method  to  unmask 
weaker  singularities,  one  would  then  require  analysing  wavelets  with  fractional 
moments  vanishing.  Notice  that  one  may  require  this  condition  along  with  the 
condition  on  first  n  moments.  Further  the  class  of  functions  to  be  analysed  is  in  general 
restricted  in  these  analyses.  These  restrictions  essentially  arise  from  the  asymptotic 
properties  of  the  wavelets  used.  On  the  other  hand,  these  restrictions  are  not  relevant 
while  using  LFD. 


8.  Multifractal  function 

We  saw  in  §  5  that  the  Weierstrass  function  is  a  fractal  function,  i.e.,  it  has  the  same 
Holder  exponent  at  every  point.  But  there  are  multifractal  functions  which  have  different 
Holder  exponents  at  different  points.  These  functions  can  be  used  to  model  various  N 

intermitent  signals  arising  in  physical  applications.  Since  the  critical  order  gives  the  local  >J 

and  pointwise  behavior  of  the  function,  conclusions  of  the  theorems  5,  6  and  7  will  carry 
over  even  to  the  case  of  multifractal  functions  where  we  have  different  Holder  exponents  ! 

at  different  points.   Selfsimilar  multifractal  functions  have   been  constructed  by  i 

Jaffard  [13].  We  give  one  specific  example  of  such  a  function.  This  function  is  a 
solution  F  of  the  functional  equation 

^^EVCSr1  (*))  +  *(*),     *ieal,  (57) 

'=1 

where  St'&  are  the  affine  transformations  of  the  kind  S/(x)  =  ^x  +  bt  (with  |/^|  <  1  and 
h's  real)  and  Vs  are  some  real  numbers  and  g  is  any  sufficiently  smooth  function  (it  is 
assumed  that  g  and  its  derivatives  have  fast  decay).  For  the  sake  of  illustration  we  choose 

/*,  =  A*2  =  V3,  bi  =  0,  b2  =  2/3,  A!  =  3~Q,  A2  =  3^  (0  <  a  <  (3  <  1)  and  ] 

g(x)  =  sin(27rc)     if    x  e  [0, 1]  ; 

=  0    otherwise.  ; 

Pramana  -  /.  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
64  Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


Such  functions  are  studied  in  detail  in  [13]  using  wavelet  transforms  where  it  was 
shown  that  the  above  functional  equation  (with  the  parameters  we  have  chosen)  has 
a  unique  solution  F.  Further  at  any  point  F  either  has  Holder  exponents  ranging  from 
a  to  (3  or  is  smooth.  A  sequence  of  points  ^(O),  S^S^O), ...  ,&„ ..  .5^(0), ..., 
where  ik  takes  values  1  or  2,  tends  to  a  point  in  [0, 1]  (in  fact  to  a  point  of  a  triadic 
Cantor  set)  and  for  the  values  of  /^s  we  have  chosen  this  correspondence  between 
sequences  and  limits  is  one  to  one.  The  solution  of  the  above  functional  equation  is 
given  by  [13] 

oo  2 

j-i  /    \  ^T  ^k ^        \  \          /  /-i —  \  fi —  1  /    ^  ^  /  ^C  0  \ 

\    /  /   j          /  ^j  '  1  'n  o  \    jn  1 1     V     /  / "  \         / 

n-Q   ii, ...,!„=! 

Note  that  with  the  above  choice  of  parameters  the  inner  sum  in  (58)  reduces  to  a  single  _ 
term.  Jaffard  [13]  has  shown  that  the  local  Holder  exponent  at  y  is 

^"'A'"(y))  (59) 

where  {z'i  (y), . . . ,  in(y}}  is  a  sequence  of  integers  appearing  in  the  sum  in  equation  (58)  at 
a  point  y.  It  is  clear  that  /imin  =  a  and  /imax  —  0.  The  function  F  at  the  points  of  a  triadic 
cantor  set  have  h(x]  &  [a,  0]  and  at  other  points  it  is  smooth  (  where  F  is  as  smooth  as  #). 
Benzi  et  al  [72]  have  constructed  multifractal  functions  which  are  random  in  nature 
unlike  the  above  nonrandom  functions.  For  still  another  approach  also  see  [73]. 

Several  well-known  'pathological'  functions  have  been  reanalyzed  in  [29-32]  and 
found  to  have  multifractal  nature.  Here  we  consider  one  example  of  classical  multifractal 
function 

-isin(7m2;t).  (60) 

-1 

This  function  was  proposed  by  Riemann.  It  turns  out  that  the  regularity  of  this  function 
varies  strongly  from  point  to  point.  Hardy  and  Littlewood  [74]  proved  that  R(x)  is  not 
differentiate  at  XQ  if  XQ  is  irrational  or  if  JCQ  can  not  be  written  as  2p  +  l/2q  +  1 
(p^q^N).  In  fact  they  showed  that  the  Holder  exponent  at  these  points  <  3/4. 
Gerver  [75]  proved  the  differentiability  of  R(x)  at  points  of  the  form  2p  +  l/2q  +  1 
(p,q  e  AO-  At  these  points  the  Holder  exponent  is  3/2.  This  function  has  also  been 
studied  in  [76,  77].  Jaffard  [29]  has  recently  shown  that  the  dimension  spectrum  of  the 
Riemann  function  is  given  as  below. 

4a-2    if    ae[|,|] 

0  if    a=|  (61) 

— oo        otherwise , 

where  d(a)  gives  the  Hausdorff  dimension  of  the  set  when  the  Holder  exponent  is  a. 

LFD  forms  one  method  of  studying  pointwise  behaviour  of  such  multifractal  functions 
alongwith  other  methods  and  may  considerably  reduce  the  analysis  involved.  This  fact 
was  demonstrated  in  [36]  on  a  specific  example  of  self-similar  multifractal  function  given 
by  equation  (58). 
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9.  Concluding  remarks 

First  we  reviewed  the  applications  of  various  fractional  differential  equations  in  different 
physical  situations.  It  was  noted  that  most  of  these  applications  dealt  with  the  asymptotic 
scaling.  Further  we  reviewed  the  classical  results  within  the  framework  of  Weyl  fractional 
calculus.  However  these  results  were  found  to  be  inadequate  for  the  study  of  pointwise 
behavior  of  fractal  and  multifractal  functions.  The  notion  of  LFD  as  developed  in  [36] 
was  found  suitable  for  this  purpose.  In  terms  of  the  LFD  it  was  also  possible  to  write  a 
fractional  Taylor  series  of  a  function  (useful  in  an  analytic  treatment  of  approximations). 
It  was  pointed  out  that  generalization  of  the  notion  of  tangents  to  the  graph  of  a  function 
(useful  for  geometric  purposes)  is  also  possible  using  LFD. 

It  was  established  that  the  critical  order  of  the  Weierstrass  function  is  related  to  the  box 
dimension  of  its  graph.  If  the  dimension  of  the  graph  of  such  a  function  is  1+7,  the 
critical  order  is  1—7.  When  7  approaches  unity  the  function  becomes  increasingly 
irregular  and  local  fractional  differentiability  is  lost  accordingly.  Thus  there  is  a  direct 
quantitative  connection  between  the  dimension  of  the  graph  and  the  fractional 
differentiability  property  of  the  function.  This  is  one  of  the  remarkable  conclusions  of 
the  new  approach.  An  important  consequence  of  this  approach  is  that  a  classification  of 
continuous  paths  (e.g.  fractional  Brownian  paths)  or  functions  according  to  local 
fractional  differentiability  properties  is  also  a  classification  according  to  dimensions  (or 
Holder  exponents). 

Also  the  Levy"  index  of  a  Levy  flight  on  one  dimensional  lattice  is  identified  as  the 
critical  order  of  the  characteristic  function  of  the  walk.  More  generally,  the  Levy  index  of 
a  Levy  distribution  is  identified  as  the  critical  order  of  its  characteristic  function  at  the 
origin. 

We  have  argued  and  demonstrated  that  LFDs  are  useful  for  studying  isolated 
singularities  and  singularities  masked  by  the  stronger  singularity  (not  just  by  regular 
behavior).  It  was  also  shown  that  the  pointwise  behavior  of  irregular  (fractal  or 
multifractal)  functions  can  be  studied  using  the  methods  of  this  paper. 

We  note,  however,  that  the  treatment  of  random  irregular  functions  as  well  as 
multivariable  irregular  functions  is  badly  needed.  We  hope  that  these  problems  can  be 
tackled  in  the  near  future. 
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Signature  of  chaos  in  power  spectrum 
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Abstract.  We  investigate  the  nature  of  the  numerically  computed  power  spectral  density 
P(f,  N,  r)  of  a  discrete  (sampling  time  interval,  r)  and  finite  (length,  N)  scalar  time  series  extracted 
rrom  a  continuous  time  chaotic  dynamical  system.  We  highlight  how  P(f,  N,  r)  differs  from  the 
rue  power  spectrum  and  from  the  power  spectrum  of  a  general  stochastic  process.  Non-zero  r  leads 
:o  aliasing;  P(f,N,r)  decays  at  high  frequencies  as  [Trr/sinTrr/]2,  which  is  an  aliased  form  of  the 
I//2  decay.  This  power  law  tail  seems  to  be  a  characteristic  feature  of  all  continuous  time 
iynamical  systems,  chaotic  or  otherwise.  Also  the  tail  vanishes  in  the  limit  of  N  — >  oo,  implying 
hat  the  true  power  spectral  density  must  be  band  width  limited.  In  striking  contrast  the  power 
spectrum  of  a  stochastic  process  is  dominated  by  a  term  independent  of  the  length  of  the  time  series 
it  all  frequencies. 

Keywords.    Chaos;  time  series  analysis;  Power  spectrum;  Aliasing. 
PACSNos    05.45;  47.52 

1.  Introduction 

Ihe  problem  of  distinguishing  chaos  from  noise  as  the  mechanism  responsible  for  the 
irregular  oscillations  observed  in  a  scalar  time  series  is  of  great  importance  owing  to  its 
relevance  to  a  variety  of  experimental  contexts  [1],  If  the  erratic  oscillations  are  chaotic, 
then  there  exists  a  low  dimensional  deterministic  dynamics  responsible  for  it.  Then,  in 
principle,  it  is  possibile  to  reconstruct  the  relevant  dynamical  equations,  paving  way  for  a 
complete  understanding  of  the  phenomenon.  There  has  been  an  intense  activity  in  this 
direction  resulting  in  the  formulation  of  several  numerical  approaches  based  on  metric 
and  topological  considerations,  see  for  example  [2].  In  this  paper  we  shall  focus  attention 
on  the  analysis  of  the  spectral  measure  -  one  of  the  traditional  tools  employed  in  the 
study  of  stationary  stochastic  processes  as  well  as  deterministic  dynamical  systems.  For 
example,  a  periodic  system  with  frequency  /  has  a  pure  point  spectral  measure  with  a 
spectral  density  (henceforth  called  power  spectrum)  constituted  by  sharp(delta)  peaks  at 
f ,  and  all  its  harmonics,  in  general.  On  the  other  hand,  the  spectral  measure  of  a  generic 
stationary  stochastic  process  has  an  absolutely  continuous  part  and  a  corresponding 
broad  power  spectrum.  Chaotic  time  series  also  leads  to  a  broadened  power  spectrum  and 
hence  it  becomes  difficult  to  distinguish  chaos  from  noise.  Intuitively,  it  is  clear  that  finite 
changes  can  not  take  place  in  a  deterministic  system  linear  or  nonlinear  (chaotic  or 
otherwise),  in  an  inifinitesimal  interval  of  time,  whereas  finite  changes  are  possible  in  any 
interval  of  time  in  a  generic  stochastic  process.  This  argument  holds  good  even  for  cases 
of  deterministic  dynamical  systems  with  non  differentiable  solutions,  like  the  kicked 
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C(/)  defined  in  (2)  and  (3),  henceforth,  will  be  referred  to  as  the  true  power  spectrum  in 
this  paper. 

In  general,  the  computer  or  experimental  realization  of  state  variable  x(t)  is  in  the  form 
of  a  scalar  discrete  time  series.  This  is  equivalent  to  obtaining  a  time  series  by  sampling 
x(t]  with  a  non  zero  sampling  time  T,  up  to  a  finite  length  N,  giving  {xj  =  x(t  =  jr}\ 
j  =  Q,N-  1}.  We  now  proceed  to  define  the  power  spectrum  P(/,N,r)  of  such  a 
discrete  scalar  time  series  such  that  it  agrees  with  C(f)  in  the  limit  r  -»  0  and  N  —  >  oo. 
The  discrete  version  of  the  autocorrelation  is  defined  as 


where  {•)  represents  the  average  over  several  initial  conditions  and  trajectories.  This 
averaging  is  performed  so  as  to  ensure  that  the  autocorrelation  function  of  the  discrete 
time  series  Cj(N)  is  identical  to  that  of  the  continuous  time  process  c(t)  evaluated  at  t  =jr 
in  the  limit  N  —  >  oo.  ?(/,  N,  r)  is  then  defined  as 

(AT-l) 

,T)  =  r  Cj(N}&^'.  (5) 


,jV,  r)  can  also  be  represented  in  the  form  analogous  to  equation  (3).  Let  X(/,Ar,  r) 
be  the  discrete  Fourier  transform  of  {xj}, 

X(f,N,r}  =  Y/xjei2^  (6) 

j=o 

and  the  corresponding  power  spectrum  is 

,T)f).  (7) 


In  this  paper  we  refer  to  P(/,  N,  r)  as  the  computed  power  spectrum  or  simply  the  power 
spectrum.  The  computed  power  spectrum  P(f,N,  r}  is  equal  to  the  true  power  spectrum 
C(f)  in  the  following  limit. 

C(/)  =  lim  limP(/,AT,r).  (8) 

T-*0  N-*QO 

In  our  numerical  investigations,  we  used  the  definitions  given  in  equations  (6)  and  (7) 
to  compute  the  power  spectrum  of  chaotic  time  series.  Power  spectra  of  different 
segments  of  length  N,  extracted  from  a  time  series  of  the  dynamical  system 
corresponding  to  a  given  initial  condition,  are  computed.  The  computation  is  repeated 
for  different  initial  conditions.  An  average  over  this  ensemble  of  power  spectra  is  carried 
out  to  obtain  P(/,JV,r). 

The  power  spectra  of  chaotic  dynamical  systems  are  found  to  exhibit  an  exponential 
decay  followed  by  a  much  slower  decay  (like  an  algebraic  decay).  The  power  spectra  of 
two  representative  models  exhibiting  chaos,  namely,  the  Lorenz  and  Rossler  models,  are 
shown  in  figure  l(a)  and  figure  l(b),  respectively.  The  nonexponential  tail,  though 
appears  like  an  algebraic  decay,  does  not  fit  to  any  power  law.  But,  surprisingly,  we  find 
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Figure  1.    Computed  power  spectrum,  P(f,N,r)  for  (a)  Lorenz  model  (N  =  217, 
r  =  0.008)  and  (b)  Rossler  model  (N  =  2l\  r  =  0.016)  F  =  2r/. 


that  the  tail  exhibits  a  universal  [Trr/sin-Trr/]2  decay.  Figures  2(a)  -  2(f)  show  the  power 
spectral  density  in  the  nonexponential  region  to  be  proportional  to  [irT/simrrf]2.  It  is 
obvious  from  the  data  that,  without  exception,  the  fit  is  excellent.  The  origin  of  such  a 
functional  form  for  the  computed  power  spectrum  at  high  frequencies  will  form  the 
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Figure  3.     The  scaling  of  the  computed  power  spectrum  with  the  size  of  the  discrete 
time  series,  for  Rossler  oscillator. 

subject  matter  of  the  next  two  sections.  We  observe  that  the  spectral  density  in  the 
nonexponential  tail  region  is  smooth  and  that  it  scales  as  l/N.  The  N-dependence  of  the 
power  spectral  density  at  a  given  frequency  in  the  tail  region  is  shown  in  figure  3. 
Detailed  description  of  W-dependence  is  presented  in  §  5. 

We  have  carried  out  the  calculations  in  double  and  quadruple  precision  to  see  the 
influence  of  machine  accuracy  on  our  results.  We  find  that  our  conclusions  remain 
the  same. 


3.  Relation  between  the  true  and  computed  power  spectra 

In  order  to  understand  the  findings  reported  in  the  last  section  in  a  proper  perspective,  we 
derive  here  a  formal  relation  between  the  computed  and  true  power  spectra.  In  doing  this, 
we  first  define  the  following  two  functions,  namely 


I     for     -(N-l)r<t<(N-  l}r 
0,     otherwise 


and 


J=-00 


The  correlation  function,  in  general,  depends  on  N  and  it  can  be  rewritten  as 
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implies  that  P(f,N,  r)  can  be  expressed  as  a  sum  of  two  terms 

P(f,N,r)=Pi(ftN,T)+P2(ftN,T)t  (12) 

where 

/oo 
toc(t)wN(t)ex.p(£2irfl)t  (13) 

-CXD 

and  P2(f,N,r},  related  to  {d/(AO},  is  the  remainder  that  vanishes  in  the  limit  AT  -»  oo. 
Pi(f,N,r)  given  in  (13)  can  be  rewritten  in  terms  of  the  functions  defined  in  equations 
(9)  and  (10)  as 

Pi(/,N,r)  =   r  dtc(t}WN(t}^ft.  (14) 

J—  c» 

Using  the  convolution  theorem,  we  can  write 

/oo 
d/C(/-/')W),  (15) 

-oo 

where  C(/)  and  Wjv(/)  are  the  Fourier  transforms  of  c(f)  and  wN(t),  respectively.  The 
Fourier  transform  of  w#(?)  is  given  by 


This  can  be  recast  as 

^(/)=^(/)  +  M/),  (17) 

where  limw_+ooyw(/)  =  0.  Substituting  (17)  in  (15)  gives 

Qw(/,^T)  (18) 


such  that  lim^^ooQ^^JVjT)  =0.  Using  (12)  and  (18),  we  can  write  the  computed 
power  spectrum  as 

P(/,tf,r)  =  C(/)+a(/,tf,T),  (19) 

where  fl(/,JV,r)  vanishes  in  the  limit  N  -^  oo.  Using  once  again  convolution  theorem, 
the  Fourier  transform  of  c(f)  can  be  written  as 

/CXD  OO 

d/'CC/-/')  £  *(/'+-).  (20) 

•o°  m=-oo     X  Ty 

This  can  be  simplified  and  we  get 


Finally  using  (19)  and  (21),  we  get  the  desired  result. 
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Signature  of  chaos  in  power  spectrum 

,r),  (22) 


where  P(/,  N,  r)  is  the  computed  power  spectrum  and  C(/)  is  the  true  power  spectrum. 
This  equation  formally  relates  the  computed  power  spectrum  to  the  true  one.  It  brings  out 
clearly  the  role  played  by  r  and  N.  The  first  term  represents  limN^00P(f,  N,  T)  and  the 
second  term  reflects  the  finite  size  effects.  We  shall  be  analysing  in  the  following 
sections,  how  a  relation  of  this  kind  explains  the  results. 


4.  "Aliasing 

Power  spectrum  of  any  sampled  data,  with  an  interval  of  sampling  r  being  nonzero,  is 
defined  only  in  the  domain  -fm  <  f  <  fm,  where  fm  =  l/2r,  is  the  Nyquist  frequency.  In 
sampling  a  given  function  x(t),  r  can  be  tuned,  r  —  >  0  corresponds  to  sampling  the  entire 
function.  But,  for  a  fixed  r,  the  signal  might  contain  frequencies  higher  than  the  Nyquist 
frequency.  In  such  a  case,  the  spectral  densities  corresponding  to  the  frequencies  larger 
than/m  will  be  folded  into  the  region  \f  \  <  fm.  This  is  the  phenomenon  of  aliasing,  see 
for  example  [13]. 

The  first  term  of  (22)  which  represents  lim  ;v-+oo  P(/,  N,  r)  is  an  aliased  version  of  true 
power  spectrum  C(/).  This  clearly  establishes  the  role  played  by  nonzero  r  resulting  in 
the  change  in  the  functional  form  of  the  decay  in  the  power  spectrum  due  to  aliasing.  The 
remainder  R(f,N,r]  in  (22)  depends  both  on  r  and  N.  This  term,  owing  to  its 
dependence  on  r,  could  itself,  in  principle,  be  an  aliased  version  of  some  other  function. 
We  derive,  in  this  section,  the  aliased  versions  of  power  spectra  with  (i)  an  exponential 
and  (ii)  a  power  law  dependence  on/,  and  discuss  the  implications  with  reference  to  the 
observed  numerical  results. 


4.1  Aliased  exponential  decay 

We  consider  a  power  spectrum  with  an  exponential  dependence  on  /. 

C(/)=«T*'.  (23) 

The  aliased  version  Pa(f]  of  C(/)  can  be  written  as 


pa(f}=  2^  C(/  +  2//M),  (24) 

;=-oo 

where  fm  =  l/2r,  the  Nyquist  frequency.  The  sum  can  be  easily  evaluated  and  we  get 

Pi  r<\           COSn  I  DJffi  ( i       r  1 1  fi**c\ 

a(F)  =  a .Lt-r^T  »  (25) 


where  F  =f/fm,  is  the  scaled  frequency.  It  is  worth  mentioning  that,  as  a  function  of  the 
scaled  frequency  F,  the  discrepancy  between  the  aliased  and  the  true  power  spectra  gets 
confined  to  a  region  closer  to  the  end  point  as  r  decreases. 


4.2  Aliasing  of  power  law  decay 

Here  we  first  derive  an  expression  for  the  aliased  version  of  I//2  decay  and  then  general 
expression  for  any  even  power  of  1  //.  The  true  power  spectrum  is  given  by 

C(/)=~-  (26) 

The  aliased  form  can  be  written  as 
A     .°°  1 


Using  the  integral  representation 

i 


p2  ~  I      "-  *  ~  (28) 

we  can  rewrite  Pa(f}  as 

A    /^coshq-F); 
sinhf 

A     d    f°°  J    sinh(l  -  F)t 
=  ~^2^^/      dt H3^ (29) 


Using  contour  integration  J  can  be  evaluated  as 


J  =  Trcot  —  -  (30) 


and 

„  ,_,      Avr2  0  ?rF 

^(^)-T7Tcosec  -T-  (31) 

J  m  ^ 

The  above  can  be  recast  in  the  following  convenient  form 


From  (32),  it  is  obvious  that  UmT^0^a(/)  =  C(f).  However,  for  any  finite  r,  Pa(F) 
is  shape  invariant  and  the  discrepancy  between  the  computed  and  the  true  power 
spectra  does  not  vanish  at  any  frequency.  This  functional  form  fits  extremely  well 
the  power  spectra  of  chaotic  systems  in  the  high  frequency  (nonexponential)  region 
as  can  be  seen  from  the  figures  2(a)-2(f).  This  implies  that  the  power  spectra  of 
chaotic  time  series  have  a  universal  I//2  tail,  which  is  seen  as  a  [Trr/sinTrr/]2  tail  due  to 
aliasing. 
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The  aliased  version  of  power  law  decay  with  an  even  exponent  (say,  21}  can  also  be 
exactly  calculated. 

A 

(33) 


^     ,  } 

Jm  j^-oo  (4)  +f) 

This  can  be  further  rewritten  as 
A          l 


Since  the  integral  in  (34)  is  already  evaluated  (see  (30)),  Pa(F)  can  easily  be  obtained  by 
computing  the  appropriate  derivative,  in  (34). 

5.  Finite  size  effects 

As  has  been  emphasized  at  the  beginning,  the  finite  size  of  the  time  series  also  leads  to  a 
discrepancy  between  the  computed  and  the  true  power  spectra.  The  formal  relation 
between  the  computed  and  the  true  power  spectra  described  in  (22)  of  §  3,  has  a  part  that 
depends  on  N,  denoted  by  R(f,  N,  r).  This  remainder  is  such  that  it  vanishes  in  the  limit 
N  —»  oo. 

Numerical  investigation  of  the  power  spectral  density  of  several  chaotic  dynamical 
systems  listed  in  table  1,  show  that  there  is  an  exponential  decay  up  to  some  frequency 
f*  followed  by  a  JTrr/sin  nrf]2  tail.  Our  studies  on  the  scaling  of  the  power 
spectral  density  of  the  chaotic  dynamical  systems  with  respect  to  the  system  size 
TV  reveals  that  the  spectral  density  corresponding  to  the  frequencies  in  the 
exponential  decay  region  is  independent  of  N,  whereas  the  ones  corresponding 
to  the  frequencies  in  the  tail  region  scale  as  l/N.  The  scaling  of  the  power 
spectrum  with  TV  for  Rossler  is  presented  in  figure  3.  Since,  by  definition,  see  (22), 
only  R(f,N,r]  can  depend  on  TV,  it  is  clear  from  (32),  together  with  the  numerical 
results,  that 

(35) 

^    ' 


, 

'          TV[sm7TT/ 

to  the  leading  order  in  I/TV.  Since  #(/,TV,  r)  accounts  for  the  whole  of  the  tail  (see  the  fit 
in  figures  2(a)-2(f  )),  within  the  limits  of  the  present  numerical  analysis,  it  is  evident  that 
C(f)  does  not  have  any  spectral  component  in  the  tail  region.  This  implies  that  the  power 
spectrum  is  band  width  limited. 

In  the  experimental  context,  the  interest  is  mainly  to  distinguish  chaotic  oscillations 
from  random  oscillations  (or  stochastic  signals).  The  band  width  limited  character  of 
the  power  spectrum  signals  the  presence  of  chaotic  oscillations  in  the  time  series,  hi 
a  generic  noise,  we  do  not  expect  the  power  spectrum  of  such  a  process  to  be  band 
width  limited.  For  example  the  power  spectrum  of  white  noise  is  broad  and  is 
independent  of  TV.  One  can  understand  that  in  a  determinisitic  dynamical  system, 
chaotic  or  otherwise,  no  drastic  changes  can  take  place  in  an  extremely  short  interval 
of  time  and  hence  the  power  spectral  density  in  the  high  frequency  region  will 
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accordingly  be  zero.  This  results  in  a  band  width  limited  power  spectrum.  On  the 
other  hand,  by  definition,  a  generic  stochastic  process  is  one  in  which  changes  occur 
on  any  time  scale.  This  leads  to  the  conclusion  that  the  power  spectral  density  of 
such  a  process  must  be  nonzero  for  all  the  frequencies.  In  Appendix  n,  we  present  a 
proof  that  for  a  stationary  stochastic  process  with  absolutely  continuous  spectral 
measure,  the  finite  time  power  spectrum  is  dominated  by  an  N  independent  term  at  all 
frequencies. 

The  I/AT  scaling,  together  with  an  aliased  I//2  decay  at  high  frequency  preceded  by  an 
exponential  decay  in  the  power  spectrum  single  out  chaotic  time  series  from  periodic, 
quasiperiodic  and  noisy  signals. 

6.  Critical  remarks  on  existing  theoretical  arguments 

In  [3-5]  the  authors  present  a  case  for  the  exponential  decay  of  the  power  spectra  of 
chaotic  systems.  The  argument  rests  on  the  following  definition  of  the  power  spectrum 


•          (36) 
with 

*(/)=  r*«e''2^dt  (37) 

J  —  CO 

By  extending  x(t)  to  the  complex  time  domain,  the  authors  argue  that  X(f]  can  be 
represented  as  a  sum  of  exponentials,  and  hence  C(/)  must  be  a  sum  of  exponentials. 
They  have  tacitly  assumed  the  existence  of  X(f}. 

In  this  section  we  present  the  problems  associated  with  the  above  argument.  The  proper 
definition  of  C(f]  is,  see  [6] 

C(/)=  K^PW)!2  (38) 

with 

Tt}e2«ftdt.  (39) 


From  the  numerical  simulations  we  know  that  the  spectral  measure  of  chaotic  systems  is 
predominantly  absolute-continuous  at  high  frequencies.  We  find  that  the  associated  power 
spectral  density  C(f)  is  nonzero  and  bounded  in  the  high  frequency  domain.  But,  the 
presence  of  the  factor  1/T  in  (38)  necessitates  vanishing  of  C(f]  whenever  X(f)  takes  on 
a  nonzero,  but  finite  value.  Since  C(f]  of  the  system  under  investigation  is  nonzero,  it  is 
clear  that  X(f]  =  limr_00Xr(/)  can  not  exist.  The  finite  time  Fourier  transform  XT(f] 
should  diverge  as  Vf  as  T  -»•  co  (our  X(f,  N,  r)  diverges  as  VN).  Clearly  these  authors 
have  neglected  this  part  of  the  integral  and  concentrated  only  on  the  principal  value, 
which  in  fact  does  not  contribute  to  C(f)  at  all. 

In  Appendix  I,  we  show  that  the  power  spectra  of  the  deterministic  dynamical  systems 
whose  solution  is  smooth  and  bounded  (which  serves  as  sufficiency  condition  but  not 
necessary)  will  have  a  l/f2N  tail. 
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sampling  time  r  and  finite  length  of  the  time  series  N.  The  nonzero  sampling  time  r  gives 
rise  to  aliasing,  which  changes  the  functional  form  of  the  decay.  The  computed  power 
spectrum  decays  as  (cosh  bfm(l  -  F))/sinh  bfm  up  to  a  frequency  fc  and  subsequently 
decays  as  l/N[Trr/simrrf]2  for/  >  /c.  These  expressions  are  calculated  to  be  the  aliased 
versions  of  e~bf  and  l/Nf2  respectively,  which  means  the  true  power  spectrum  is  band 
width  limited.  The  feature  of  l/Nf2  decay  is  observed  in  all  the  chaotic  dynamical  system 
studied.  The  finite  size  N  of  the  time  series  results  in  contributing  a  spurious  tail  to  an 
otherwise  band  width  limted  power  spectrum.  This  feature  turns  out  to  be  useful  for 
distinguishing  chaos  from  a  generic  stationary  stochastic  process  as  the  former  scales  as 
l/N  while  the  latter  is  independent  of  N. 

Appendix  I 

We  now  proceed  to  prove  the  statement  that  the  power  spectrum,  defined  for  the  signal  in 
the  interval  [0,  T],  of  a  bounded  function  belonging  to  C°°  class  shows  a  (f2T)~l  tail  for 

|/|>/c- 

Consider  a  dynamical  system 

^L/f(x(0),     /=!,...,</  (Al) 

with  the  initial  conditions 

Xi(0)=XiQ.  (A2) 

The  formal  solution  of  the  above  equations  is  given  by 

Xj(t)  =  Z/(XQ,  0  =  CtLXjQt  (A3) 

where 

(A4) 


The  power  spectrum  P/,r(/)  =  l/^l-^/vK/)!    *s  giyen  by 

2 
i 


i=l 

The  finite  time  Fourier  transform  of  Xj(t)  is  defined  as 

[T 

xj,r(f)  =  /    dtaXxo,?K  ^. 
7o 

Substituting  (A3)  in  (A5)  and  simplifying,  we  get 

TXjo-xiQ].  (A6) 


lei2      '  (A7) 
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M  C  Valsakumar  et  al 
The  expression  for  Q  can  be  written  as 


From  (A8)  we  get 

00   /          \ 

(A9) 


«=0 


From  the  formal  solution  (A3)  it  is  easy  to  see  that 

d"  d" 

Ln^(x0,r)=—  Jc/Cxo.OLr     ^    ^o  =  ^(*o,')Uo- 

Assuming  that  the  trajectory  Xj(xo,  t]  of  the  dynamical  system  (Al)  belongs  to  C°°  class, 
it  follows  that  there  exists  an  upper  bound  Mn  for  \dn/dtnXj(xQ,t}\,n  ^  1-  Hence  we 
obtain 

\LnXj(^t]\<Mn    for  all    ?e[0,oo).  (All) 

We  define  a  parameter  M  =  supn(MM)^^.  It  can  be  seen  that 

,    , 

(A12) 


for  all  t  e  [0,  oo).  The  above  series  is  convergent  for  \2-nf  \  >  M(=  27r/c).  The  series  can 
then  be  summed  to  get 


This  illustrates  that  for  |2?r/j  >  M(=  27r/c),  the  power  spectrum  will  have  a  tail 


This  will  manifest  as  a  (7rr/sin7r/r)2  tail  for  a  discrete  time  series  sampled  at  a  time  step 
r  (due  to  aliasing),  consistent  with  our  numerical  results  for  the  power  spectra  of  chaotic 
dynamical  systems. 

Appendix  n 

Let  x(t)  be  a  stationary  stochastic  process  with  an  absolutely  continuous  spectral 
measure.  Let  {xj  =  x(t=JT)'tj  =  Q,...,N-l}  be  a  discretely  sampled  data  set. 
P(f,  N,  T}  can  be  written  as 

/>(/,  AT,  r}  =  Z-  V  V  (^e12*™-*).  (Al) 


iince  the  process  is  stationary  one  can  write 
implifying,  we  get 

/oo 
dfC(f)SN , 
•oo 


yhere 


=  r 


—  c(r(j  —  &)).  Substituting  this  and 


(A2) 


(A3) 


nd  C(/)  is  the  true  (unaliased)  power  spectrum.  P(/,  A7,  r)  can  be  equivalently 
epresented  as  (see  eq.  (22)) 


00  . 

P(f,N,r)=   £  C(f  +  ™)  +  *(/, N, r). 


(A4) 


Thus  it  is  only  the  true  spectral  density  C(f )  that  goes  as  input  to  the  analysis.  By 
lefinition,  a  generic  stationary  stochastic  process  is  characterized  by  a  spectral  measure 
yhich  is  absolutely  continuous,  and  is  nonzero  for  all  frequencies.  It  is  clear  from  (A4) 
hat  if  the  true  power  spectrum  is  nonzero,  so  is  the  aliased  power  spectrum.  This  implies 
hat  for  a  given  A7,  P(/,  N,  r}  of  a  generic  stationary  stochastic  process  consists  of  both  an 
/-independent  term  and  terms  depending  on  N.  In  the  following  analysis  we  calculate  the 
/  dependent  terms  explicitly  to  understand  which  term  dominates  in  the  large  N  limit. 

Let  z  =  27r(/  -/)r,  then 

pWz 

f  2Re 


e'z~l 


2  d 
-ir- 

Ndz 


e'z-l 


Hhis  can  be  evaluated  as 


_  T  sm2(Nz/2] 
N~N  sin2(z/2)  ' 

rhe  power  spectrum  now  takes  the  form 

r 


sin27rr(/-/0  ' 
Noting  that  S#  is  periodic  in  1/r,  the  above  integral  can  be  written  as 

P(f,N,r)  = 


Ar  . 
r     TTT    N  sin 


(A5) 


(A6) 


(A7) 


(A8) 


since  the  spectral  measure  is  absolutely  continuous,  the  power  spectrum  C(/)  is  smooth 
md  one  can  expand  C(f]  as  a  Taylor's  series  in  £/7rr,  to  give  the  power  spectrum  the 
bllowing  form 


P(f,N,r) 


E 

/=-00 


m=0 


/  +  ;)/. 

I       J 
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(A9) 


83 


From  (A8)  we  get 

OO       /          1  \ 

(A9) 


From  the  formal  solution  (A3)  it  is  easy  to  see  that 

d"  d" 

LBx,-(xo,r)=—  Xj(xo,f)Ur    and    LB*,-0  =  —  *,-(xo,OUo- 

Assuming  that  the  trajectory  ;c/(xo,  0  of  the  dynamical  system  (Al)  belongs  to  C°°  class, 
it  follows  that  there  exists  an  upper  bound  Mn  for  \dn/dtnxj(y.Q,t}\,n>  1.  Hence  we 
obtain 

\Lnxj(xQ,t)\<Mn    for  all    re[0,oo).  (All) 

We  define  a  parameter  M  =  supn(Mn}(l/n\  It  can  be  seen  that 

M  V 

(    ' 


for  all  f  e  [0,  oo).  The  above  series  is  convergent  for  \2irf  \  >  M(—  2-7r/c).  The  series  can 
then  be  summed  to  get 


This  illustrates  that  for  \2irf  \  >  M(=  27r/c),  the  power  spectrum  will  have  a  tail 

.  (A14) 


This  will  manifest  as  a  (vrr/sin  7r/r)2  tail  for  a  discrete  time  series  sampled  at  a  time  step 
T  (due  to  aliasing),  consistent  with  our  numerical  results  for  the  power  spectra  of  chaotic 
dynamical  systems. 

Appendix  II 

Let  x(t)  be  a  stationary  stochastic  process  with  an  absolutely  continuous  spectral 
measure.  Let  {*/  =  x(t  =/r);  j  =  0,. .  .,N-  1}  be  a  discretely  sampled  data  set. 
,N,r)  can  be  written  as 

_jV-l    N-l 

Ifcr/rO-*).  (A1) 
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Since  the  process  is  stationary  one  can  write  (*,**)  =  c(r(j  —  k)}.  Substituting  this  and 
simplifying,  we  get 


r°° 
,N,r}  =  \      dfC(f)SN, 

J  —  OO 


where 


cos27rr(/-/)/ 


(A2) 


(A3) 


and  C(/)  is  die  true  (unaliased)  power  spectrum.  ?(/,  AT,  r)  can  be  equivalently 
represented  as  (see  eq.  (22)) 


(A4) 


Thus  it  is  only  die  true  spectral  density  C(f)  tiiat  goes  as  input  to  the  analysis.  By 
definition,  a  generic  stationary  stochastic  process  is  characterized  by  a  spectral  measure 
which  is  absolutely  continuous,  and  is  nonzero  for  all  frequencies.  It  is  clear  from  (A4) 
that  if  the  true  power  spectrum  is  nonzero,  so  is  the  aliased  power  spectrum.  This  implies 
that  for  a  given  N,  P(f,  N,  r)  of  a  generic  stationary  stochastic  process  consists  of  both  an 
^-independent  term  and  terms  depending  on  N.  In  the  following  analysis  we  calculate  the 
N  dependent  terms  explicitly  to  understand  which  term  dominates  in  the  large  N  limit. 

Let  z  =  2ir(f  -f'}r,  then 


T  l+2Re 


JNZ  _l      2  d  T     ewz  -  1 

__  _.,-.,  TfYI    

e'z-l      Ndz       e/z-l 


This  can  be  evaluated  as 


_  T 
N~ 


sin2(z/2) 


The  power  spectrum  now  takes  the  form 


/OO  ' 

d//c(/)^ 
•OO  •<»       M 


in2  AM/"/') 


t*T(f-f'}  ' 
Noting  that  S#  is  periodic  in  1/r,  the  above  integral  can  be  written  as 


/2 


/ 


(A5) 


(A6) 


(A7) 


(A8) 


Since  the  spectral  measure  is  absolutely  continuous,  the  power  spectrum  C(/)  is  smooth 
and  one  can  expand  C(/)  as  a  Taylor's  series  in  £/TTT,  to  give  the  power  spectrum  the 
following  form 


(A9) 


/=— OO 


m=0 
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where 

-  (MO) 


It  is  easy  to  see  that 
/o  =  l 
/2m~0(ij     and    /2m-i=0,     m=l,....  (A12) 

With  the  help  of  these  results  the  power  spectrum  can  easily  be  written  as 


(A13) 

where  £(n)  is  the  Riemann  zeta  function  of  order  n. 

Since  the  first  term  (the  N  independent  part)  in  the  above  equation  is  nonzero  for  any 
absolutely  continuous  nonzero  spectral  measure  and  since  the  coefficient  of  l/N  is  finite, 
we  can  always  find  an  N  such  that  the  first  term  dominates.  This  implies  that  for  a  generic 
stationary  stochastic  process  an'  AT  independent  term  will  dominate  the  power  spectrum  at 
all  frequencies,  if  N  is  sufficiently  large. 

In  the  case  of  deterministic  dynamical  systems,  it  is  known  that  the  power  spectra  of 
regular  deterministic  dynamical  systems  are  discrete,  and  they  do  not  have  AMndependent 
part  at  high  frequencies.  Only  chaotic  systems  show  a  semblance  of  a  continuous  power 
spectrum.  Numerical  results  show  that  chaotic  systems  also  do  not  have  an  N-independent 
part  at  high  frequencies.  Thus  it  is  not  possible  that  the  power  spectra  of  deterministic 
dynamical  systems  are  dominated  by  an  ^-independent  part  at  high  frequency  region. 
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Abstract.  It  is  known  that  the  one-dimensional  discrete  maps  having  single-humped  nonlinear 
functions  with  the  same  order  of  maximum  belong  to  a  single  class  that  shows  the  universal 
behaviour  of  a  cascade  of  period-doubling  bifurcations  from  stability  to  chaos  with  the  change  of 
parameters.  This  paper  concerns  studies  of  the  dynamics  exhibited  by  some  of  these  simple  one- 
dimensional  maps  under  constant  perturbations.  We  show  that  the  "universality"  in  their  dynamics 
breaks  down  under  constant  perturbations  with  the  logistic  map  showing  different  dynamics 
compared  to  the  other  maps.  Thus  these  maps  can  be  classified  into  two  types  with  respect  to  their 
response  to  constant  perturbations.  Unidimensional  discrete  maps  are  interchangeably  used  as 
models  for  specific  processes  in  many  disciplines  due  to  the  similarity  in  their  dynamics.  These 
results  prove  that  the  differences  in  their  behaviour  under  perturbations  need  to  be  taken  into 
consideration  before  using  them  for  modelling  any  real  process. 

Keywords.    Bifurcation;  chaos;  ecology;  one-dimensional  map;  perturbation. 
PACS  Nos    05.45;  80.89.60;  03.20 

1.  Introduction 

The  field  of  dynamical  systems  and,  especially  the  study  of  chaotic  systems,  has  been 
hailed  as  one  of  the  important  breakthroughs  in  science  in  this  century.  Past  three  decades 
have  seen  an  explosion  of  interest  in  the  study  of  nonlinear  dynamical  systems  in  almost 
all  disciplines,  and  chaotic  and  random  behaviour  of  solutions  of  deterministic  systems  is 
now  understood  to  be  an  inherent  feature  of  many  nonlinear  systems.  This  has  been 
possible  due  to  infusion  of  interest  and  techniques  from  a  variety  of  fields,  from  physics 
and  chemistry  to  biology  and  social  sciences  as  most  natural  systems  seem  to  be  complex 
and  nonlinear.  With  renewed  interest  in  low  dimensional  discrete  dynamical  systems  in 
physics  and  other  disciplines  such  as  mathematical  ecology  and  economics,  it  was  shown 
that  even  very  simple  mathematical  models  can  actually  exhibit  very  complicated 
dynamics.  In  his  seminal  paper,  May  [1]  urged  that  simple  one-dimensional  maps  should 
be  part  of  early  mathematical  education  since  on  one  hand  it  is  simple  to  analyse,  yet  on 
the  other  hand,  the  range  of  complex  dynamics  exhibited  by  it  can  introduce  the  concepts 
of  variability,  chance  and  imbalance  as  perfectly  natural  behaviour  expected  from  natural 
systems. 

The  general  form  of  a  one-dimensional  discrete  equation  (map)  is 

(1) 
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where  Xt+\  is  a  function  of  the  discrete  variable  X  at  the  earlier  (i.e.  rth)  iteration  and 
some  parameters.  Historically  [1,  2],  Xt  has  been  studied  as  the  population  density  of 
organisms  (whose  generations  do  not  overlap)  at  the  rth  generation.  If  the  function  f(Xt} 
is  such  that  it  has  the  propensity  to  increase  monotonically  when  X  is  small  and  to 
decrease  monotonically  when  it  is  large  due  to  density-dependent  processes,  then  it 
assumes  a  "hump"-like  shape  with  a  unique  maximum. 

More  than  two  decades  ago  [1,  3-5]  the  numerous  discrete  models  with  single-hump 
growth  functions  existing  in  the  ecological  literature  were  placed  under  a  general  class  of 
unidimensional  maps  based  on  the  similarity  in  their  dynamic  behaviour  with  increasing 
growth  rates.  It  was  shown  that  all  these  maps  display  a  "universal"  bifurcation  structure 
where,  with  the  increase  in  a  parameter  (e.g.,  the  growth  rate),  the  system  dynamics 
changes  from  stable  equilibrium  point  to  chaos  through  the  period-doubling  route,  and  the 
chaotic  regime  contains  an  ordered  sequence  of  periodic  windows  [1,  2,  6].  This 
similarity  in  bifurcation  structure  of  these  one-dimensional  maps  initiated  the  use  of  these 
models  interchangeably  to  describe  experimental  population  data  in  ecology  [7-12]. 

Though  it  is  known  that  all  one-hump  maps  having  the  same  order  of  maximum  belong 
to  the  same  universality  class  [13],  the  precise  conditions  that  determine  if  a  single-hump 
nonlinear  function  would  belong  to  the  "universal"  class  are  not  clear  [14].  Recent 
studies  have  shown  that  slight  perturbations  of  a  map  can  cause  breakdown  or  distortion 
of  universal  behaviour  [14-18],  These  studies,  from  mathematical  ecology  again, 
describe  the  different  responses  of  two  maps  (logistic  and  exponential)  belonging  to  the 
same  universality  class  when  an  additive  disturbance  (can  be  termed  as  perturbation  or 
bias)  is  imposed  on  the  nonlinear  functions. 

In  this  paper  we  study  a  few  representative  maps  that  are  used  commonly  in  theoretical 
studies  and  also  to  model  real  systems  [1,  19,  20]  under  such  perturbation  and  show  the 
similarities  and  differences  of  their  response.  The  major  results  are: 

(a)  the  logistic  map  shows  very  different  response  to  perturbations  compared  to  other 
maps  of  the  same  universality  class  which  have  long  'tail'  or  'plateaus'  at  high  X; 
and 

(b)  the  response  dynamics  depends  on  the  details  of  the  shape  of  the  hump  -  positive 
perturbation  enhances  the  effect  of  the  'tail',  whereas,  negative  perturbation  exposes 
the  gradient  of  the  upper  part  of  the  hump.  Thus,  the  logistic  map  stands  out  among 
other  maps  whose  'plateaus'   and  asymmetric  shapes  give  rise  to  a  variety  of 
interesting,  unexpected  and  unusual  dynamics.  Since  these  latter  type  of  maps  are 
considered  to  be  more  realistic,  one  may  anticipate  many  unexpected  dynamics  in 
nature. 

The  paper  is  organized  as  follows: 

(i)  Three  commonly  used  one-dimensional  maps  and  the  types  of  perturbations  used  for 

this  study  are  introduced. 

(ii)  The  effect  of  the  perturbations  on  the  dynamics  of  these  maps  is  shown, 
(iii)  The  results  are  discussed  with  a  view  to  delineate  the  differences  in  these  maps. 

(i)  One-dimensional  maps 

We  choose  the  following  three  one-dimensional  equations  with  the  specific  single  hump 
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Table  1.    Properties  of  one  dimensional  single  hump  maps. 


Maps 

x-Domain 

r-Range 

Equilibrium  points 
/(*)=* 

Critical  point 

Logistic  map 
Exponential  map 

Bellows  map 

0 
0 

0 

<x<  I 

<X<00 

<  X  <  00 

1 
0 

1 

<  r  <4 
<  r 

<  r 

Oand 
Oand 

Oand 

(r-l)/r 
1 

1/2 
1/r 

,   1/6 

\                    ^ 

functions  for  our  study: 

(I)  Logistic  model: 

Xt+l=rXt(l-Xt).  '     (2a) 

(II)  Exponential  model: 

X,+i=X,exp[r(l-X,)]-  (2b) 

(HI)  Bellows  model: 

*'+1  =  (Tpi*)-  (2c) 

Here  V  and  '£'  are  parameters.  These  three  models  were  chosen  because  they  are 
most  commonly  used,  both  theoretically  and  experimentally  [1,  3-5,  11,  12,  14,  19-22]. 
Though  eq.  (2a)  is  the  simplest  among  the  non-linear  difference  equation  models, 
the  other  two  equations  (2b),  (2c)  have  more  secure  provenance  in  the  biological 
literature  [1,  12,  14,  21].  Table  1  summarizes  the  detailed  features  of  these  models,  and 
figures  (la-f)  show  the  return  maps  (Xt  versus  Xt+\  plot)  and  the  bifurcation  diagrams 
with  increasing  r  for  each  model.  Figure  1  clearly  describes  the  similar  "inverted  U- 
shape"  (hump)  nature  of  /(X)-the  nonlinear  function-in  the  return  maps,  and  the 
commonality  in  their  dynamics  of  progression  from  stable  point,  to  a  hierarchy  of  stable 
cycles  of  period  2n,  to  chaos  with  increasing  r.  These  properties  are  observed  in  all 
models  from  this  class  of  unidimensional  single-hump  equations  [1,  2,  5]. 

Constant  perturbation.  Under  an  additive  constant  perturbation  the  one-dimensional 
equation  (1)  takes  the  form 

Xt+l=f(Xt)+L  =  F(Xt)  (3) 

where  L  is  a  parameter  which  can  take  both  positive  and  negative  values.  The  effect  of  a 
fixed  positive  and  negative  L  onf(X)  is  to  move  the  humps  shown  in  the  return  maps  in 
figures  (la,  Ic  and  le)  above  or  below  the  X-axis  respectively.  As  examples  this  is  shown 
for  the  logistic  model  for  r  =  3.9  in  figure  2a,  and  for  the  Bellows  model  f or  r  =  5  and 
b  =  6  in  figure  2b.  The  continuous  curves  represent  the  unperturbed  return  maps  (eqs  (2a) 
and  (2c))  and  the  dashed  and  dotted  curves  are  for  the  perturbed  maps  (+L  and  -L 
respectively)  in  eq.  (3).  The  exponential  model  is  similar  to  the  Bellows  model. 

It  is  clear  from  figure  2  that  the  perturbations  change  the  intersection  of  the  return  maps 
(dashed  curve  and  dotted  curve)  with  the  Xt+i  =  Xt  line  (solid  line),  thus  changing  the 
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Figure  1.  Return  maps  and  bifurcation  diagrams,  respectively,  for  the  (a,  b)  logistic, 
(c,  d)  exponential,  and  (e,  f)  Bellows  maps.  Unless  otherwise  stated  the  initial 
conditions  are  chosen  to  be  X  at  which /(X)  =  0  in  all  figures. 


number  of  positive  equilibrium  points  and  their  stability.  With  negative  L,  there  are  two 
equilibrium  points  fa,  x2  in  figure  2),  and  the  first  equilibrium  point  (ATI)  is  unstable 
leading  all  iterations  below  that  critical  value  to  zero.  The  dynamics  exhibited  by  these 
maps  depend  on  the  gradient  of  the  hump  at  the  second  equilibrium  point  (jc2).  The  other 
major  change  that  negative  perturbation  introduces  in  the  exponential  and  Bellows  maps 
is  that  these  two  maps  are  no  longer  valid  for  all  X  >  0,  thus  introducing  limits  in  both  r 
and  X  within  which  the  iterations  will  remain  positive.  In  the  case  of  the  exponential  and 
the  Bellows  maps  (see  figure  2b),  positive  L  creates  a  higher  'floor'  as  the  tail  of  the  maps 
are  elevated.  This  does  not  happen  in  the  case  of  the  logistic  map  and  it  continues  to  be 
valid  only  within  a  range  of  X.  In  this  paper  whenever  X  assumes  zero  or  negative  values 
it  is  termed  as  'escape'. 
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Figure  2.  Return  maps  of  the  (a)  logistic,  and  (b)  Bellows  maps.  Continuous  curve  - 
unperturbed  map,  Dashed  curve  -  maps  with  positive  perturbation  (+L).  Dotted  curve 
-  maps  with  negative  perturbation  (— L). 

(ii)  The  dynamics  of  the  perturbed  maps 

The  dynamics  exhibited  by  the  three  models  with  positive  and  negative  L  are  shown 
through  bifurcation  and  stability  diagrams  in  the  r  —  L  parameter  space  (r  —  L 
diagrams).  Interestingly  whatever  be  the  magnitude  of  L  (except  when  it  pushes  the 
system  to  "escape"),  the  dynamical  system  always  returns  to  its  unperturbed  state  on 
withdrawal  of  the  perturbation.  Therefore,  we  have  studied  the  response  of  the  maps  to 
large  ranges  of  L  values.  Figures  3,  4  and  5  are  the  bifurcation  diagrams  showing  the 
dynamics  of  the  perturbed  maps  with  increasing  L  at  two  r  values  corresponding  to  stable 
and  chaotic  dynamics  at  the  unperturbed  state. 

Logistic  map.  Figures  3a-d  show  the  dynamics  exhibited  by  the  logistic  map  with 
increasing  positive  and  negative  perturbations  for  stable  r  (r  =  2.0)  and  for  chaotic  r 
(r  =  3.9)  respectively.  It  is  clear  that  positive  perturbation  induces  (figure  3a)  or 
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Figure  3.    Bifurcation  diagrams  with  L  for  the  logistic  map  for  stable  r  = '. 
(a)  positive  L,  (b)  negative;  and,  L  chaotic  r  —  3.9  (c)  positive  L,  (d)  negative  L. 

maintains  (figure  3c)  chaotic  dynamics  for  all  r,  though  'escape'  occurs  for  very  small 
perturbations  in  a  chaotic  logistic  map.  On  the  other  hand,  negative  perturbation  induces 
stability  in  a  chaotic  map  (figure  3d),  though  'escape'  ensues  for  stable  r  at  relatively  low 
negative  L  (figure  3b).  Thus  the  final  response  of  the  logistic  map  to  both  types  of 
perturbations  is  opposite  and  also  independent  of  r. 

Exponential  map.  Figures  4a-d  show  the  effects  of  increasing  positive  and  negative 
perturbations  in  the  exponential  map  for  stable  (r  =  1.8)  and  chaotic  r  (r  =  3) 
respectively.  In  contrast  to  the  logistic  map  (figure  3a),  a  stable  exponential  map  remains 
stable  under  both  types  of  perturbations  (figures  4a  and  4b),  and  a  small  amount  of 
positive  perturbation  can  induce  stability  in  a  chaotic  map  also  (figure  4c).  The  response 
of  the  chaotic  map  to  negative  perturbation  is  however  unusual.  Though,  like  the  logistic 
map  in  this  case  also  increasing  negative  perturbation  shows  overall  period  reversal  to 
stability  from  chaos,  yet  this  map  shows  an  intervening  band  of  L  values  at  which 
'escape'  occurs,  but  positive  iterates  recur  for  a  range  of  higher  L  beyond  which  iterates 
'escape'  for  all  L.  Interestingly,  though  in  this  map  both  types  of  perturbations  finally 
render  the  map  stable  for  all  r,  'escape'  is  observed  with  — L  only  for  parameter  (r)  values 
corresponding  to  higher  period  (P  >  4)  dynamics. 

Bellows  map.  It  is  clear  from  figures  5a-d  that  the  response  of  the  bellows  map  is  very 
similar  to  the  exponential  map  for  stable  r  (r  =  1.2)  under  negative  L,  and  for  chaotic  r 
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Figure  4.    Bifurcation  diagrams  with  L  for  the  exponential  map  for  stable  r  —  1 . 
(a)  positive  L,  (b)  negative;  and,  L  chaotic  r  =  3.0  (c)  positive  L,  (d)  negative  L. 


(r  —  3.0)  under  both  positive  and  negative  L.  The  stable-unstable-stable  'bubble'  in 
figure  5  a  with  increasing  positive  perturbation  in  a  stable  map  is  a  reminder  of  the  logistic 
map  (see  figure  3a)  except  that  further  period  doubling  bifurcations  are  curtailed  here  and 
period  reversal  gives  rise  to  stable  dynamics  at  higher  L.  It  is  also  interesting  to  note  that, 
like  the  logistic  map  (figure  3b),  stable  bellows  map  (figure  5b)  also  shows  'escape'  for 
small  negative  perturbation.  This  map  has  an  additional  parameter  'b'  which  has  a  role  in 
deciding  the  shape  of  the  hump  and  hence  the  bifurcation  scenario.  We  have  taken  b  =  6 
here  which  imposes  strong  nonlinearity  in  the  maps,  though  similar  behaviour  is  observed 
for  lower  b  (>  2)  also. 

To  get  a  complete  picture  of  the  effect  of  the  constant  perturbations  on  the  dynamics  of 
the  maps  for  a  range  of  values  of  the  nonlinear  parameter  r,  we  have  used  a  combined 
analytical  and  numerical  approach  to  obtain  the  stability  regions  in  the  r  —  L  parameter 
space.  Figure  6  shows  three  r  —  L  diagrams  which  provide  a  description  of  the  different 
dynamical  behaviour  such  as,  stable  fixed  point  (P  =  1),  stable  oscillations  of  period  2 
(P  =  2),  and  complex  oscillations  of  higher  periods  including  chaos  (P  >  4)  that  are 
exhibited  by  the  models  for  different  values  of  r  and  L.  The  space  outside  the  demarcated 
region  constitutes  the  r  and  L  values  that  lead  to  'escape'.  Since  positive  and  negative  L 
moves  the  hump  vertically  up  and  down,  the  range  of  X  and  r  values  for  which  the  logistic 
map  is  valid  also  changes  with  L.  For  negative  L,  the  exponential  and  Bellows  maps  both 
resemble  the  logistic  map  since  they  now  do  not  have  the  extended  tail  and  hence  exist 
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Figure  5.    Bifurcation  diagrams  with  L  for  the  bellows  map  for  stable  r  =  1.2. 
(a)  positive  L\  and,  (b)  negative  L;  and,  chaotic  r  —  3.0  (c)  positive  L,  (d)  negative  L. 


only  within  a  certain  range  of  r.  Therefore  they  also  show  the  'escape'  region  for  higher 
negative  perturbations. 
The  boundaries  in  the  r  —  L  plot  in  figure  6  can  be  obtained  as  follows: 

(a)  The  curve  that  forms  the  lower  bound  for  all  maps  in  the  r  -  L  plot  for  negative  L  is 
obtained  by  finding  the  values  of  L  for  every  r  at  which  a  saddle-node  bifurcation 
takes  place  [20]. 

(b)  The  intervening  'escape'  region  observed  in  the  exponential  and  bellows  maps  at 
higher  r  occurs  for  intermediate  values  of  L  at  which,  F(Xmayi}  —  x\  <  0  where  x\  is 
the  first  unstable  equilibrium  point  (cf.  figure  2b).  Thus,  the  boundary  in  the  r  -  L 
parameter  space  can  be  obtained  by  considering  values  of  r  and  L  which  satisfy  the 
expression 

•F(-^max)  ~  Xi  =  0. 

This  condition  is  never  satisfied  in  the  case  of  the  logistic  map. 

(c)  The  uppermost  curve  for  +L  separating  the  'escape'  region  is  observed  only  for  the 
logistic  map  (figure  6a).  In  the  logistic  map,  there  is  one  L  for  every  r  at  which 
^2(^max)  <  0,  i.e.  the  iterate  falls  outside  the  hump  on  the  JC-axis.  Both  exponential 
and  Bellows  maps  do  not  have  this  region  since  they  have  'tails'  for  high  X  (for  +L) 
which  force  the  iterates  to  be  positive. 
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Figure  6.  Stability  diagrams  in  r-L  para- 
meter space  for  the  (a)  logistic  map, 
(b)  exponential  map,  and  (c)  Bellows  map. 

(d)  The  boundaries  of  the  first  two  period  doubling  bifurcations  (P  =  2  and  P  -  4)  in 
figures  6a-c  are  obtained  by  solving  the  equations 

Fn(X)=X    and     [Fn(X)]r  =  -1,     n=l,2. 

Figure  6  clearly  shows  that  the  response  of  the  logistic  map  to  perturbations  is  very 
different  when  compared  to  that  of  the  exponential  and  the  Bellows  maps.  This  feature 
clearly  indicates  that  all  these  maps  can  not  be  treated  at  par. 

Though  from  above  the  exponential  and  the  Bellows  maps  seem  to  have  very  similar 
response  to  both  positive  and  negative  perturbations,  comparison  of  figures  4a  and  5a 
indicate  finer  differences  between  them.  The  'bubble'  of  instability  in  the  bellows  map 
(figure  5a)  is  a  feature  which  arises  due  to  the  detailed  shape  of  f(X)  near  the  critical 
point.  Figure  7  exemplifies  this  for  the  three  maps.  In  this  figure  the  continuous  curve 
shows  the  slope  of  the  map  functions  /'  (X)  for  all  X  starting  after  the  critical  point  for  the 
r  value  at  which  the  first  bifurcation  from  stability  to  oscillation  occurs.  The  dotted  curve 
represents  the  quantity  {/(X)  -  X}. 
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Figure  7.    Slope  of  the  nonlinear  functions 
Too      (/'  (X))  and  the  curve  for  {/(X)  -  X}  for  the  (a) 
logistic,  (b)  exponential  and  (c)  bellows  maps. 


It  is  clear  from  these  figures  that  the  equilibrium  point  xi  (cf.  figure  2b)  where 
{/(X)  -X}  =0  occurs  is  also  the  point  at  which  /(X)  =  -1.  The  major  point  to 
be  noted  is  that  there  are  two  values  of  X  at  which  /'(X)  =  -1  for  the  Bellows  map 
in  figure  7c  as  opposed  to  the  exponential  map  in  figure  7b.  Here  (figure  7c), 
the  equilibrium  point  is  closer  to  the  critical  point  of  the  map  when  the  first  bifurcation 
(at  r=1.5)  occurs.  A  lower  value  of  r  will  obviously  show  stable  dynamics  in 
the  unperturbed  map  and  the  equilibrium  point  will  be  on  the  left  of  the  first  intersection 
of/'{X)  =  -1.  A  constant  positive  perturbation  (i.e.  +L)  to/(X)  at  stable  r  will  move 
the  equilibrium  point  to  the  right  on  the  X-axis.  The  system  will  then  exhibit  stable- 
unstable-stable  'bubble1  as  L  is  increased  due  to  the  presence  of  the  two  intersections 
of  f(X]  -  -1  in  the  Bellows  map.  Thus  it  is  clear  from  this  figure  that  there  exist  a 
range  of  stable  r  values  in  this  map  where  positive  perturbations  can  give  rise  to  "bubble" 
like  dynamics.  Such  a  situation  never  arises  for  both  the  logistic  and  the  exponential  maps 
due  to  the  shape  of  their  humps  (see  figures  7a,b). 


2.  Discussion 

The  above-mentioned  results  clearly  demonstrate  that  small  perturbations  can  delineate 
the  differences  among  the  maps  which  are  considered  to  belong  to  the  same  universality 


class  and  have  similar  bifurcation  dynamics.  They  can  now  be  classified  based  on  their 
response  to  constant  perturbations  which  is  model-specific.  The  main  features  that  play 
important  roles  in  the  manifestation  of  the  differences  in  dynamics  in  these  maps  under 
perturbations  are 

1.  The  presence  of  the  'tails'  in  the  nonlinear  functions  in  the  exponential  and  Bellows 
maps  at  higher  X.  This  gives  rise  to  higher  'floors'  (plateaus  which  flattens  out  at  high 
X)  with  positive  perturbations,  leading  to  period  reversals  to  stability  [14]. 

2.  The  detail  shapes  of  the  nonlinear  functions  near  the  maximum  at  medium  values  of 
X.  These  are  highlighted  in  the  form  of  'bubbling'  under  positive  perturbations,  or,  the 
minimum  iterate  (F(Xmiai))  falling  below  the  first  unstable  equilibrium  point  under 
negative  perturbations. 

3.  The  symmetric,  purely  concave-down  structure  of  the  hump  in  the  logistic  model  does 
not  allow  any  of  the  above  processes  to  occur  under  constant  additive  perturbations.  In 
fact  the  nonlinear  parameter  r  in  the  logistic  map  can  be  easily  scaled  to  include  the 
effect  of  the  perturbation  (L).  This  can  explain  the  changes  in  the  dynamics  with 
positive  and  negative  perturbation  in  the  logistic  map  in  terms  of  increased  or 
decreased  effective  r. 

These  simple  one-dimensional  discrete  equations  studied  here  have  been  extensively  used 
for  describing  density-dependent  population  growth  of  organisms  having  non-overlapping 
generations  [3,  5,  9,  23,  24],  In  ecological  terms,  the  constant  perturbation  can  be 
considered  as  constant  migration  to/from  a  population.  We  have  shown  that  these  maps 
do  not  behave  similarly  under  perturbations  thus  exposing  the  inherent  non-equivalence 
among  them.  Our  results  have  far-reaching  implications  not  only  in  the  study  of  one- 
dimensional  non-linear  discrete  systems  in  general  but  also  in  the  area  of  population 
dynamics  and  resource  management  studies.  Thus  one  can  say  that  though  all 
unidimensional,  discrete,  single-hump  maps  show  similar  bifurcation  dynamics 
[1,  2,  5],  the  exact  nature  of  the  non-linear  functions  plays  an  important  role  in  their 
different  dynamic  response  to  simple  perturbations. 
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Chaos  in  modulated  logistic  systems 
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Abstract.  This  paper  is  a  review  of  the  work  done  on  the  dynamics  of  modulated  logistic  systems. 
Three  different  problems  are  treated,  viz,  the  modulated  logistic  map,  the  parametrically  perturbed 
logistic  map  and  the  combination  map  obtained  by  combining  two  maps  of  the  quadratic  family. 
Many  of  the  interesting  features  displayed  by  these  systems  are  discussed. 

Keywords.    One  dimensional  maps;  universality;  parametric  perturbation;  scaling  relations. 
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1.  Introduction 

The  complicated  dynamical  behaviour  that  can  arise  in  the  low  dimensional  deterministic 
nonlinear  systems  has  generated  a  lot  of  intense  research  in  the  last  two  decades  [1,2]. 
These  studies  have  shown  that  simple  physical  systems  obeying  simple  laws  can  often 
show  complex  and  chaotic  behaviour.  Since  several  nonlinear  systems  can  be  analysed 
using  one  dimensional  maps,  such  maps  have  come  to  play  an  important  role  in 
understanding  many  of  the  general  features  of  chaotic  systems.  One  of  the  simplest  one 
dimensional  maps,  namely  the  logistic  map,  originally  introduced  in  the  study  of  the 
population  dynamics  of  non  overlapping  generations  in  biology  [3]  has  become  a 
paradigm  for  such  systems.  The  logistic  map  is  a  one-hump  map  of  the  form 

Xn+i=4XXn(l-Xn)  (1) 

defined  in  the  interval  (0, 1)  and  is  characterized  by  a  control  parameter  A  which  is  also 
varied  in  the  interval  (0, 1).  This  system  undergoes  a  transition  to  chaos  via  the  well- 
known  period  doubling  route  and  shows  universal  metric  [4]  as  well  as  structural 
properties  [5]. 

One  of  the  earliest  experimental  observations  of  the  period  doubling  route  to  chaos  was 
in  CO2  laser  [6].  However,  in  many  of  the  experimental  situations  the  control  parameter 
can  have  a  time  dependence  of  its  own.  Therefore,  it  would  be  interesting  to  consider  the 
dynamics  of  a  map  with  a  time-dependent  control  parameter. 

In  one  of  the  earlier  studies,  Kapral  and  Mandel  [7]  investigated  a  non  autonomous 
quadratic  map  wherein  the  control  parameter  was  assumed  to  vary  linearly  with  time. 
They  have  observed  that  this  time  dependence  of  the  parameter  delayed  the  onset  of 
bifurcations  in  the  system.  The  system  also  showed  bistability  and  hysteresis.  Ruelle  had 
suggested  the  study  of  dynamical  systems  with  adiabatically  fluctuating  parameters 
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where  the  control  parameter  has  a  very  slow  variation  in  time  and  this  time  dependence 
itself  might  be  determined  by  a  dynamical  system  [8]. 

In  this  paper  the  dynamics  of  the  modulated  logistic  system  is  presented.  We  use  the 
term  modulated  logistic  system  (MLS)  to  describe  the  following  three  situations: 

(i)  A  logistic  map  in  which  the  control  parameter  has  a  discrete  time-dependence 
determined  by  a  dynamical  system  -  a  modulated  logistic  map  (MLM). 

(ii)  A  logistic  map  wherein  the  control  parameter  is  perturbed  by  the  addition  of  a 
periodic  term  (parametrically  perturbed  logistic  map). 

(iii)  A  combination  map  where  the  logistic  map  is  combined  with  another  map  of  the 
same  quadratic  family. 

The  paper  is  organized  as  folllows.  In  §  2,  we  introduce  the  MLM  and  present  some 
of  the  earlier  results  of  Harikrishnan  and  Nandakumaran  [9-12].  Some  of  the  related 
works  in  this  field  are  also  discussed  in  the  section  [13].  We  assume  that  the  time 
evolution  of  the  control  parameter  is  determined  by  another  logistic  map.  This  coupled 
system  shows  many  interesting  features  such  as  the  locking  of  the  periodicities  of  the 
two  subsystems.  Section  3  deals  with  the  parametrically  perturbed  logistic  map.  Here 
the  control  parameter  of  the  logistic  map  is  perturbed  by  a  sequence  of  periodic 
pulses.  Such  a  system  is  relevant  for  modelling  those  systems  which  are  subjected  to 
periodic  stimuli.  The  dynamics  of  the  system  shows  many  interesting  and  novel  features 
such  as  the  formation  of  bubble  structure  in  the  bifurcation  diagram,  emergence  of 
periodic  behaviour  after  a  chaotic  regime  etc.  The  combination  map  is  presented  in  §  4. 
This  is  a  one  hump  map  obtained  by  adding  a  sinusoidal  map  to  the  logistic  map  (both  the 
maps  belong  to  the  same  universality  class).  It  is  found  that  the  scaling  index 
characterizing  the  behaviour  of  the  Lyapunov  exponent  near  the  onset  of  the  chaotic 
transition  is  different  from  that  of  a  single  one-hump  map.  Section  5  contains  the 
concluding  remarks. 


2.  Modulated  logistic  map 

In  this  section  we  briefly  review  the  results  obtained  by  Harikrishnan  and  Nandakumaran 
on  MLM  [9-12].  An  MLM  is  defined  by  the  following  pak  of  equations: 

Xn+l=4XnXn(l-Xn},  (2) 

l-An)  (3) 


with  0  <  Xrt,  An,  /j,  <  1.  Here  /j,  plays  the  role  of  the  control  parameter.  This  system  is 
analogous  to  the  one  suggested  by  Ruelle  although  in  the  present  case  the  time  evolution 
of  the  A-system  is  not  slow,  but  is  of  the  same  order  as  that  of  the  X-system.  As  jj,  is 
varied  continuously,  the  MLM  undergoes  a  sequence  of  period  doubling  bifurcations. 
However,  the  bifurcation  diagram  for  X  is  very  much  different  from  the  one  for  the  A- 
system  [11]. 

In  order  to  establish  the  universality  properties  of  the  map  we  have  determined  the 
Feigenbaum  constant  6  using  the  method  of  eigenvalue  matching  renormalization  due  to 
Derrida  et  al  [14,  15].  The  basic  idea  of  the  method  is  the  following.  Let  us  represent  the 

Pramana  -  J.  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
100  Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


Chaos  in  modulated  logistic  systems 
MLM  as 

(v     ,    X       \  —  T  (Y     \  }  (A\ 

(An+li  An+i)  —  lfj,\An,An).  (4-) 

/  \ 

We  linearize  the  map  Tj^  in  the  neighbourhood  of  an  n-cycle.  With  each  /u  we  associate 
a  jjf  such  that  the  linearization  of  T^'  around  a  point  of  cycle  n  is  identical  to 
linearization  of  T^n'  around  a  point  of  cycle  2n.  This  gives  rise  to  a  relation  connecting  fj, 
and  [jf,  the  fixed  point  of  which  is  given  by  ^  universal  constant  8  is  then  given  by 


8  = 


(5) 


The  details  of  the  calculations  are  presented  in  [10].  In  the  first  order  approximation, 
we  choose  n  =  1,  this  gives  a  value  of  5  =  4.4339...  which  is  consistent  with  the 
Feigenbaum  constant  for  the  logistic  map.  Better  agreement  can  be  obtained  by  choosing 
higher  values  for  n. 

One  of  the  interesting  features  that  we  have  observed  in  MLM  is  that  the  periodicity  of 
X  is  identical  to  that  of  A  except  in  the  range  of  values  0.848  <  0.860  [9,  13]  where  A  has 
a  periodicity  of  2  while  X  has  a  periodicity  of  4.  As  fj,  is  varied  A  period  doubles  and  so 
does  the  variable  X.  This  locking  of  the  periodicities  of  the  two  is  a  special  case  of  the 
more  general  situation  described  by  Batra  and  Varma.  For  a  given  value  of  fj,  they  have 
derived  conditions  under  which  the  X-system  is  in  a  state  of  periodicity  n  or  in  a  state  of 
periodicity  which  is  an  integral  multiple  of  n,  when  the  A-system  is  in  a  stable  state  of 
periodicity  n.  For  the  MLM  the  two  periodicities  are  identical  except  in  the  range  of  (j, 
mentioned  above.  This  result  has  an  important  consequence.  It  shows  that  the  MLM  has  a 
stable  n-cycle  for  the  same  range  of  fj,  for  which  the  logistic  map  has  an  n-cycle.  Thus  the 
periodicity  of  X  is  enslaved  to  the  periodicity  of  A.  This  implies  that  the  ordering  of  the 
cycles  in  MLM  is  the  same  as  that  of  the  logistic  map.  Thus  the  Sarkovskii  ordering  [16] 
which  is  an  universal  structural  property  of  unimodel  maps  is  maintained  in  the  MLM. 
Batra  and  Varma  have  suggested  other  possible  2D-sy stems  that  may  show  Sarkovskii 
ordering.  These  universal  properties  of  MLM  makes  it  suitable  to  model  physical  systems 
in  which  the  control  parameter  is  time-dependent. 

3.  Parametrically  perturbed  logistic  map 

In  this  section  we  consider  a  logistic  map  whose  control  parameter  is  perturbed  by  the 
addition  of  a  periodic  term  [17].  Instead  of  changing  A  continuously  it  is  changed  in  a 
discrete  sequence  of  pulses,  repeated  periodically,  the  envelope  of  the  amplitudes  of  these 
pulses  forming  a  positive  sine  profile.  Maps  where  the  control  parameter  is  perturbed  by  a 
sequence  of  pulses  may  be  relevant  in  the  study  of  certain  biological  systems  subjected  to 
periodic  stimuli  [18].  A  logistic  system  with  a  similar  perturbation  but  with  a  cosine 
profile  that  includes  both  the  positive  and  the  negative  half  cycles  has  been  considered 
earlier  [19],  It  has  been  shown  that  the  map  undergoes  a  transition  from  a  fixed  point  to  a 
state  which  has  the  periodicity  of  the  perturbation.  We  observe  that  the  map  has  some 
novel  features  not  considered  earlier.  Quadrative  maps  with  additive  periodic  forcing  that 
leads  to  bistability  and  the  coexistence  of  multiple  attractors  have  been  studied  by  Sanju 
and  Varma  [20]. 


Figure  1.    Bifurcation  diagram  of  the  parametrically  perturbed  map  for  q  =  3. 
i-o 


o-x 


Figure  2.    Bifurcation  diagram  for  5  =  8. 
The  parametrically  perturbed  logistic  map  is  defined  by  the  equations 


with 


A  =  A0  +  A*  sm(7rcdn)  ,     (un)  mod  1  .  (6) 

This  represents  a  train  of  pulses  whose  periodicity  is  determined  by  u. 

If  u}  —p/q,  the  control  parameter  forms  a  periodic  <y-sequence.  In  what  follows  we 
chose  q=l.u  can  also  be  chosen  as  irrational.  The  dynamics  of  the  map  (6)  is  studied 
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Figure  3.    Bifurcation  diagram  for  q  =  (51/2  —  l)/2. 

numerically  by  fixing  AO  and  varying  the  strength  of  the  perturbation  amplitude  A*.  To 
obtain  a  wide  tunability  for  A*  we  fixed  a  low  value  for  AO  =  0.1.  For  this  value  of  AO,  the 
unperturbed  logistic  map  has  a  stable  fixed  point  X*  =  0.  The  bifurcation  diagrams  are 
drawn  for  q  =  3, 8  and  (51/2  -  l)/2  which  is  the  golden  mean  (figures  1,2  and  3).  For 
q  =  3,  when  A*  is  increased  the  fixed  point  X*  =  0  becomes  unstable  and  the  system 
makes  a  transition  to  a  stable  3-cycle  and  remains  locked  to  it  for  a  very  long  range  of  A*. 
After  this,  period  doublings  follow  in  quick  succession  followed  by  a  chaotic  regime. 
When  A*  is  increased  still  further  the  iterates  leave  the  interval  (0, 1)  and  escape  to 
infinity.  Qualitatively  similar  features  are  seen  for  #  =  8.  However,  for  q  =  8,  the  chaotic 
region  is  followed  by  the  reappearance  of  8-cycles  which  further  period  doubles  and  goes 
to  chaos.  Some  of  the  other  novel  features  observed  in  this  system  have  not  been  included 
in  this  paper  since  they  form  the  contents  of  another  paper  which  has  been  sent  for 
publication.  When  q  =  (51/2  —  l)/2,  the  stable  fixed  point  becomes  unstable  as  A*  is 
increased  and  goes  to  a  quasiperiodic  state  (figure  3)  and  remains  in  that  state  until 
escape.  When  uj  is  chosen  as  irrational  there  are  additional  interesting  phenomena  such  as 
strange  nonchaotic  attractors  [21].  However,  we  do  not  consider  this  in  the  present  case. 

It  is  instructive  to  study  the  superstable  cycles  in  this  map.  For  superstable  cycle, 
x  =  1/2  is  one  of  the  cycle  elements.  Metropolis  et  al  [5]  have  studied  extensively  the 
sequence  of  iterates  starting  from  the  extremum  at  x  —  1/2.  These  sequences  are  often 
called  the  MSS  sequences.  The  superstable  cycles  are  represented  by  a  sequence  of 
symbols  R  and  L  depending  on  whether  the  iterate  falls  on  the  right  (R)  or  the  left  (L)  of 
the  extremum  at  x  =  1/2.  We  have  done  a  similar  analysis  for  the  map  (6)  and  the  results 
are  tabulated  in  table  1  for  various  #- values.  It  is  clear  from  the  table  that  the  map  (6)  can 
support  sequences  that  do  not  fall  under  the  MSS  classification.  In  fact  the  first 
superstable  sequence  for  each  a;  contains  only  the  symbol  L.  In  the  table  the  sequences 
indicated  by  a  star  belong  to  the  MSS  classification. 

The  parametrically  perturbed  map  is  of  importance  since  it  may  be  used  to  model  many 
experimental  situations  such  as  a  laser  system  [22]  where  the  control  parameter  would  be 
modulated  by  periodic  pulses  or  pulse  trains. 
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Table  1.    Super  stable  sequences  of  the  parametrically  perturbed 
logistic  map.  The  super  stable  value  of  A*  is  indicated  by  A*. 


q                                         Sequences 

A* 

3      k  =  3                         L2 

0.6672028 

k  =  6                         L2RL2 

1.008131 

4      fc  =  4                          L3 

0.5871376 

*RL2 

0.6724169 

L2R 

0.9041113 

5      k  =  5                        L4 

0.5268982 

6      fc  =  6                         L5 

0.4899559 

*RL4 

0.5539654 

L4R 

0.7129956 

7      fc  =  7          '                L6 

0.4855321 

*RL5 

0.59889335 

L4R2 

0.6514045 

*RL4R 

0.80803725 

8      k  =  8                        L7 

0.4585666 

*RL6 

0.5080377 

L5R2 

0.5843965 

R3L4 

0.655407 

L4R3 

0.81519066 

L4RLR 

0.92404916 

9      Jk  =  9                        L8 

0.4621206 

RL7 

0.4633298 

R2L6 

0.5403486 

R3L5 

0.7149534 

*RL5R2 

0.720069367 

RL5R2 

0.80574375 

L4R4 

0.85825292 

*RL4R3 

0.9043900 

10      k  =  10                       L9 

0.44201028 

*RL8 

0.48165978 

L7R2 

0.508807 

R3L6 

0.57816015 

L5R4 

0.691275 

*RL5R3 

0.7179589 

RLRL5R 

0.7626641 

RLRLRL4 

0.7773538 

L4R2LRL 

0.8562051 

*  Indicates  the  cycles  that  occur  in  the  MSS  sequences.  If  k  is  the 

period  of  the  cycle  the  sequence  contains  (k  - 

1)  symbols. 

4.  Dynamics  of  a  combination  map 

The  combination  map  is  obtained  by  combining  the  logistic  map  with  a  sinusoidal  map. 
Thus  we  have  a  two-parameter  one  dimensional  map  given  by  [23,24] 

,  A,A)  =  4AXn(l  -XB)  -  Asin(nXn).  (7) 
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Chaos  in  modulated  logistic  systems 


X     -» 


Figure   4.     The   combination   map   for    A  =  1.    The   curves   correspond   to 
A  =  0,0.5,  S/TT^S/TT  and  1  in  that  order  from  top  to  bottom. 


On  the  right  hand  side  both  the  terms  belong  to  the  same  universality  class,  viz,  the 
quadratic  family.  The  motivation  for  studying  the  map  (7)  is  to  see  whether  the  map  still 
has  all  the  characteristics  of  the  quadratic  family.  We  shall  show  that  it  has  the  same 
universal  constants  as  the  logistic  map,  however,  the  scaling  index  of  the  Lyapunov 
exponent  at  the  transition  point  is  different. 

The  map  (7)  has  an  extremum  at  x  =  1/2  which  is  a  second  order  maximum  for 
(A  —  1)  <  A  <  8A/-7T2  while  it  is  a  minimum  for  8A/7T2  <  A  <  X  and  there  is  a  point  of 
inflexion  at  x=l/2  for  A  =  8A/7T2.  Consequently,  /(Xn,A,A)  is  one  humped  for 
(A  -  1)  <  A  <  8A/7T2  and  two  humped  for  8A/7T2  <  A  <  A.  It  can  easily  be  seen  that  the 
parameter  A  must  lie  between  (A  —  1)  and  A  so  as  to  keep  the  iterates  of  (7)  within  the 
unit  interval  (0,  1).  The  function  f(x,  A,  A)  is  plotted  for  various  values  of  A  in  figure  4. 
In  figure  5,  we  give  the  complete  bifurcation  structure  for  the  map  by  keeping  A  =  1  and 
by  changing  A  in  steps  of  0.001.  As  the  parameter  A  is  slowly  tuned,  the  system  retraces 
the  entire  period  doubling  route  to  chaos  in  the  reverse  order  and  finally  settles  down  to  a 
one-cycle  for  A  >  0.2435.  One  can  also  consider  negative  values  of  A  which  would  bring 
the  system  from  a  state  of  periodicity  to  a  state  of  chaos.  By  changing  A,  the  value  AOO  of 
A  at  which  the  system  makes  transition  from  chaos  to  order  and  vice  versa,  is  determined 
by  calculating  the  Lyapunov  exponent  and  by  noting  the  value  of  A  at  which  it  changes 
sign.  The  thick  line  figure  6  represents  the  value  of  A  as  a  function  of  A.  The  lines  parallel 
to  it  (only  two  are  shown  in  figure  6)  represent  the  bifurcation  lines  along  which  each 
period  doubling  occurs.  The  shaded  region  corresponds  to  the  chaotic  regime  of  the 
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Figure  S.    Bifurcation  structure  of  the  combination  map  with  A  =  1. 


0.50 


Figure  6.  The  parameter  space  (A,  A)  of  the  combination  map.  The  thick  line 
represents  the  transition  from  order  to  chaos  while  the  lines  parallel  to  it  are  the 
bifurcation  curves. 


system  where  the  Lyapunov  exponent  is  generally  positive  except  for  windows  of 
periodicity  within  the  chaotic  regime.  The  lowermost  line  represents  the  minimum  value 
of  A,  namely  (A  —  1)  for  each  A.  By  extending  this  line  upwards  we  can  increase  the 
value  of  A  beyond  A  =  1  by  taking  suitably  high  value  of  A.  Also,  since  the  bifurcation 
curves  in  the  (A,  A)  space  are  parallel  lines,  it  is  clear  that  the  Feigenbaum  constant  S 
defined  in  terms  of  the  bifurcation  values  An  for  fixed  A  must  be  the  same  as  that  for  the 
logistic  map  alone. 


Table  2.    The  scaling  index  v  and  accumulation  point  Ac 
for  the  combination  map  considered  in  §  4. 


A 

•A  00 

V 

0.75 

-0.1382968450 

0.9989838 

0.875 

-0.0170178750 

0.9677731 

1.0 

-0.1046909450 

0.9936844 

8 

7.2212675805 

0.9996790 

To  see  whether  the  combination  map  has  all  the  characteristics  of  the  quadratic 
maximum  we  have  investigated  the  scaling  property  of  the  Lyapunov  exponent  as  a 
function  of  |A  —  A^.  For  a  one-hump  map  the  nature  of  this  scaling  near  the  period 
doubling  accumulation  point  has  been  worked  out  by  Hubermann  and  Rudnick  [25].  They 
have  shown  that  the  Lyapunov  exponent  a  follows  the  relation 

cr~  (fl-floo!",  (8) 

where  a^  is  the  accumulation  point  for  the  period  doubling  bifurcations  and  that 

v  =  In  2/  In  6,  (9) 

8  being  the  Feigenbaum  constant.  The  Hubermann-Rudnick  (HR)  relation  (9)  indicates 
that  v  depends  on  z,  the  order  of  the  map,  through  the  value  of  S  which  is  different  for 
maps  belonging  to  different  universality  classes  (different  z)  [26].  Since  8  is  the  same  for 
the  combination  map  one  expects  the  same  value  for  v  for  the  combination  map  also.  To 
test  this  we  have  determined  v  and  A  for  the  combination  map  and  the  results  are  shown 
in  table  2  [24]. 

It  is  clear  from  the  table  that  v  for  the  combination  map  is  almost  double  the  value  of 
1/2  for  the  quadratic  family  as  predicted  by  the  HR  relations  and  also  computed 
numerically  for  the  logistic  map.  Thus,  for  the  combination  map  which  also  has  a 
quadratic  maximum  at  x  =  1/2  does  not  satisfy  the  HR  relations.  Numerical  studies  have 
shown  that  a  series  of  bifurcation  for  the  chaotic  band  take  place  as  A  — >  AQQ  [23]. 
However,  this  cascade  of  band  bifurcations  does  not  take  place  ad  infmitum.  This 
incomplete  nature  of  the  cascade  for  the  combination  map  could  be  a  possible  reason  for 
the  violation  of  HR  relations  for  the  map. 

5.  Concluding  remarks 

In  this  review  we  have  considered  the  dynamics  of  modulated  logistic  systems. 
Specifically  we  have  discussed  three  different  systems,  viz,  the  modulated  logistic  map, 
the  parametrically  perturbed  logistic  map  and  the  combination  map  obtained  by 
combining  two  maps  in  the  same  quadratic  family.  Each  of  these  systems  shows  several 
interesting  features,  such  as  the  enslavement  of  the  periodicities  (in  MLM),  occurence  of 
superstable  sequences  other  than  the  ones  included  in  me  MSS  classification,  the 
reappearance  of  periodicity  after  a  chaotic  regime  (in  the  parametrically  perturbed 
logistic  map)  and  the  violation  of  the  Hubermann-Rudnick  scaling  relations  for  the 
Lyapunov  exponents  (in  the  combination  map). 
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Abstract  The  invariant  density  of  one-dimensional  maps  in  the  regime  of  fully-developed  chaos 
with  uncorrelated  additive  noise  is  considered.  Boundary  conditions  are  shown  to  play  a  significant 
role  in  determining  the  precise  form  of  the  invariant  density,  via  the  manner  in  which  they  handle 
the  spill-over,  caused  by  the  noise,  of  orbits  beyond  the  interval.  The  known  case  of  periodic 
boundary  conditions  is  briefly  recapitulated.  Analytic  solutions  for  the  invariant  density  that  are 
possible  under  certain  conditions  are  presented  with  applications  to  specific  well-known  maps.  The 
case  of  'sticky'  boundaries  is  generalized  to  're-injection  at  the  nearest  boundary',  and  the  exact 
functional  equations  determining  the  invariant  density  are  derived.  Interesting  boundary  layer 
effects  are  shown  to  occur,  that  lead  to  significant  modifications  of  the  invariant  density 
corresponding  to  the  unperturbed  (noise-free)  case,  even  when  the  latter  is  a  constant  -  as  illustrated 
by  an  application  of  the  formalism  to  the  noisy  tent  map.  All  our  results  are  non-perturbative,  and 
hold  good  for  any  noise  amplitude  in  the  interval. 

Keywords.  One-dimensional  maps;  fully  developed  chaos;  invariant  density;  boundary  condi- 
tions; noise. 
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1.  Introduction 

Random  noise  is  almost  inevitably  present  in  systems  of  physical  interest.  Its  effects  on 
the  evolution  of  these  systems  in  time  have  long  been  recognized  as  worthy  of 
investigation,  and  chaotic  dynamics  is  no  exception.  From  the  earliest  days  of  the  modern 
phase  in  the  study  of  deterministic  chaos,  there  has  been  interest  in  analyzing  the 
consequences  of  the  inclusion  of  noise  in  chaotic  flows  and  maps.  The  broad  features  of 
the  resulting  dynamics  are  understood  reasonably  well  (see  [l]-[5]  and  references 
therein). 

In  this  paper,  we  re-examine  the  problem  of  one-dimensional,  fully  chaotic  discrete 
dynamics  with  additive  noise,  with  the  aim  of  highlighting  an  aspect  that  has  not,  in  our 
opinion,  received  the  attention  it  deserves:  the  "spill-over"  of  the  state  variable  beyond 
its  original  domain  (interval)  caused  by  the  additive  noise  -  equivalently,  the  effect  of 
different  boundary  conditions.  Studies  of  noisy  maps  have  for  the  most  part  focussed  on 
the  regimes  preceding  fully-developed  chaos,  where  this  spill-over  is  perhaps  not  so 
significant,  as  the  corresponding  attractors  do  not  comprise  the  entire  interval.  In  contrast, 
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we  restrict  our  attention  to  the  case  of  fully-developed  chaos  -  in  particular,  the  effect  of 
specific  boundary  conditions  on  the  invariant  densities  of  noisy  maps.  In  general,  it  is 
found  that  different  boundary  conditions  lead  to  significant  differences  in  the  form  of  the 
invariant  density.  As  the  latter  quantity  is  a  central  one  in  the  statistical  description  of  the 
systems  concerned,  we  are  led  to  the  conclusion  that  the  conjunction  of  noise  and 
boundary  conditions  must  be  handled  with  care. 

In  §  2,  we  recapitulate  the  formulation  of  the  problem  along  the  lines  best  suited  for 
what  is  to  follow.  In  §  3,  we  consider  the  case  of  periodic  boundary  conditions,  the  one 
most  frequently  adopted  in  the  literature  [5]-[9].  After  a  brief  summary  of  the  formalism 
we  point  out  instances  where  a  complete  solution  is  possible,  provided  certain  conditions 
are  met.  In  §  4,  we  introduce  what  we  believe  is  a  realistic  boundary  condition,  namely, 
re-injection  at  the  nearest  boundary,  derive  the  exact  integral  equation  obeyed  by  the 
invariant  density,  and  analyze  the  structure  of  its  solution.  A  numerical  illustration 
is  provided  for  the  case  of  the  tent  map.  In  §  5,  we  consider  the  generalized  dimension  Dq 
corresponding  to  the  noisy  maps  investigated.  Section  6  contains  our  concluding  remarks. 

2.  Frobenius-Perron  equation  for  a  noisy  map 

Consider  the  one-dimensional  map  xn+\  =/(*„),  XQ  €  [a,b].  We  assume  that  the  map  is 
onto,  i.e.,  we  consider  the  case  of  'fully-developed'  chaos.  Well-known  examples  include 
the  logistic  map  /(*)  =  4*(1  -  x)  and  the  tent  map  f(x)  =  I  -  \2x  -  1|  in  [0,  1],  the 
square-root  cusp  map/(jc)  =  1  -  2\x$  in  [-1,  1],  and  so  on.  The  invariant  density  p°(x) 
of  the  unperturbed  map/  is  given  by  the  Frobenius-Perron  equation  [10] 

pQ(x]  =  PQ[pG(x}}  =      b  dy6(x  -/(y))p°(y).  (2.1) 


We  assume  that  the  operator  P0  is  known  explicitly  (via  the  functional  equation  to  which 
(2.1)  is  reducible),  as  also  the  solution  p°(x). 
Now  consider  the  noisy  map 

*n+i  =/(*„)+&,  (2.2) 

where  (£„)  =  0,  (£Bfn'}  =  (£2)<W  (uncorrelated  noise).  We  want  to  find  an  exact 
expression  for  the  invariant  density  in  this  case,  given  the  distribution  g(£)  of  the  noise. 
Earlier  treatments  either  ignore  the  complications  due  to  the  'spill-over'  of  /(*„)  +  £„  in 
some  realizations  to  values  outside  the  interval  [a,  b],  or  else  assume  [5]-[9]  that  #(£)  is  a 
periodic  function  with  fundamental  interval  [a,b].  (g(f)  is  generally  taken  to  be  a 
Gaussian  with  zero  mean,  or  an  infinite  sum  of  Gaussians  to  ensure  periodicity.)  Let  us 
denote  by  P  the  perturbed  Frobenius-Perron  operator  (i.e.,  the  one  corresponding  to  the 
noisy  map),  and  by  p(x)  the  corresponding  invariant  density,  averaged  over  the 
realizations  of  the  additive  noise.  A  formal  expression  for  the  kernel  of  the  operator  P  is 
well  known,  and  is  obtained  as  follows.  From  (2.2),  it  follows  that  the  noise-averaged 
density  at  time  n  evolves  according  to 


pn+l  =  /  d£  /  dy  6(x  -/(y)  -  0   pn(y) 

J  Ja 


(2.3) 
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Invariant  density  of  one-dimensional  maps 

This  is  an  exact  expression  if  the  noise  distribution  is  periodic,  but  imprecise  if  it  is  not, 
precisely  because  the  spill-over  across  the  boundaries  a  and  b  has  been  ignored  (nor  is  it 
immediately  clear  what  the  effects  of  this  neglect  will  be).  Formally  performing  the 
integration  over  £  using  the  <5-function,  one  gets  for  the  invariant  density  the  perturbed 
Frobenius-Perron  [3,  6,  10] 

/>(*)  =  P[p(x}}  =  f  dy  g(x  -/(v))  p(y).  (2.4) 

Ja 

Comparision  with  the  noise-free  case,  (2.1),  shows  that  the  effect  of  the  noise  is  to  replace 
the  ^-function  kernel  of  PQ  with  the  noise  distribution  function  g. 

3.  Periodic  boundary  conditions 

3.1  Fourier  expansion  method 

The  convolution  form  of  (2.4)  suggests  a  solution  in  terms  of  a  Fourier  expansion,  and 
this  is  what  has  been  done  in  earlier  treatments  [6,  7],  taking  g  to  be  a  periodic  function  of 
its  argument.  As  mentioned  earlier,  this  is  appropriate  when  periodic  boundary  conditions 
are  assumed.  We  may  then  write 


nexP(/U),  (3.1) 

tt=—  OO 

where  kn  =  2irn/(b  —  a}.  The  inversion  formula  is 

b 

U).  (3.2) 


r 

/ 

Ja 


Expanding  p(x)  in  a  similar  fashion,  i.e., 

CO 

p(0  =  (b  -  ay1          ~pn  exp  (ikn®,  (3.3) 


n=— oo 

we  obtain  from  (2.4)  the  relation 

~      X     ~*      ~      C          ~  /'i    A\ 

J7J=— OO 

where 

-i    fb 
Snm  =  (b-a)       I    dy  exp  i[kmy  -  knf(y}].  (3.5) 

Ja 

The  effects  of  the  noise  are  contained  in  {#„}.  The  noise-free  case  is  recovered  by  setting 
gn  =  1  for  all  n,  corresponding  to  replacing  g(£)  by  <$(£).  The  normalization  conditions  on 
g  and  p  imply  that 

fb 

Ja 


Equation  (3.4)  can  be  written  [6,  7]  as  the  following  inhomogeneous  equation  for 


Pn  =  gnSnQ  +      T'  gn-Wm,       («  ^  0)  (3-7) 


where  £)'  is  over  non-zero  integers  m.  In  principle,  iteration  of  (3.7)  yields  the  formal 
solution 

Pn  =  gn  Sn0  +  'gB|m  Snm  5m0  +          '     P  ',gw£m£/  SnM  5m/  Sfl)  H  ----          (3-8) 


for  pn,  and  hence  for  p(x}.  Rapid  convergence  of  the  iteration,  at  least  in  the  fully  chaotic 
cases  considered  here,  helps  numerical  calculations  in  specific  instances  [6,  7].  Clearly,  it 
is  rather  difficult  to  extract  analytical  information  regarding  the  dependence  of  p(x)  on 
the  parameters  of  the  noise  distribution  from  the  formal  solution,  except  under  special 
circumstances.  To  reconstruct  p(x)  in  closed  form,  one  must  find  pn  exactly  and  then  sum 
the  Fourier  series.  In  fact,  this  procedure  is  complicated  enough  even  in  the  noise-free 
case  (gn  =  1)  -  e.g.,  for  the  logistic  map  *n+i  =  4xn(l  -  xn]  on  [0,  1],  5nm  is  expressible 
[6]  in  terms  of  Fresnel  integrals,  but  finding  pn  explicitly  and  summing  the  Fourier  series 
hardly  seems  to  be  the  easiest  way  to  obtain  the  well-known  result  [11] 
p°(x)  =  [T?X(\  -*)P-  However,  there  are  two  situations  in  which  (3.4)  (or  (3.7))  can 
be  solved  easily.  These  are  considered  below. 

3.2  SnQ  =  0;  application  to  the  Bernoulli  and  tent  maps 
If  the  quantity 

Sno  =  (b-  a}~1        dyexp  {-iknf(y}}  *  (3.9) 


happens  to  vanish  for  every  nonzero  integer  n,  then  (3.7)  reduces  to  a  set  of  homogeneous 
equations  for  {pn},  n  ^  0.  The  determinant  of  this  set  of  equations  will  be  nonzero  in 
general,  so  that  only  the  trivial  solution  pn  =  0,  n  =£  0,  exists.  This  means  that 

p(x)=p\x}  =  (b-arl  (3.10) 

in  this  case,  for  any  (periodic)  noise  distribution  g(£).  This  is  what  happens  for  the 
general  Bernoulli  map 

oan  ,  0  <  xn  <  I/a  ,_  _  . 

-l),     I/a  <*„<!,  t3'11) 


where  a  >  1.  The  same  result  holds  good  for  the  continuous  counterpart  of  (3.11), 
namely,  the  tent  map  in  which  the  second  segment  of  the  map  reads 
*n+i  =  a(l  -  xn)/(a  -  !).«  (For  a  =  2,  these  maps  are  the  usual  Bernoulli  map  and  the 
symmetric  tent  map,  respectively).  Another  example,  involving  a  map  that  is  not 
piecewise  linear,  is  the  antisymmetric  square-root  cusp  map  in  [-1,  1]  given  by 


0<jcn<l. 
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We  find  in  this  case 

/•i 

Snm  =  (-!)"        dycos  TT  (my- 2«v^),  •  (3-13) 

Jo 

so  that  SnQ  =  0.  This  leads,  as  before,  to  the  constant  normalized  density 
p(x]  —  p°(x)  =  1/2  for  both  the  unperturbed  map  and  the  noisy  map.  In  general:  under 
periodic  boundary  conditions,  additive  white  noise  does  not  alter  the  unperturbed 
invariant  density  p°  when  the  latter  is  a  constant.  We  shall  see  later  that  this  conclusion 
does  not  hold  good  if  the  boundary  conditions  are  changed. 

3.3  Snm  =  ±#71, -m>  SnQ  =  ^S-n,o'  application  to  the  cusp  map 

There  is  another  situation  in  which  an  explicit  solution  for  p(x)  is  feasible  in  the  above 
approach,  even  if  p°(x)  is  not  a  constant  (i.e.,  Sno  ^  0).  This  occurs  when  (i)  Snm  is  even 
[odd]  in  the  index  m,  (ii)  SnQ  is  odd  [even]  in  the  index  n,  and  further  (iii)  gn  =  #_„,  i.e.,  gn 
is  real  (which  happens  when  g(£)  is  symmetric  about  (b  —  a)/2).  It  is  evident  that  if 
conditions  (i)-(iii)  are  satisfied,  all  terms  beyond  the  first  on  the  right-hand  side  of  the 
iterative  solution  (3.8)  vanish  identically.  Moreover,  (Pn  =  SnQ  in  this  case  (since  the  noise- 
free  case  corresponds  to  gn  =  1).  We  are  therefore  enabled  to  arrive  at  the  solution 

~    ~     C      ~     ~0  fa   ]AJ\ 

Therefore 

oo 

p(x)  =  (b-a)~l  ^2  Sn  SnQ  exp  (iknx).  (3.15) 

n—— oo 

Inserting  the  definitions  of  gn  and  Sno,  and  using  the  relation 

)  =  6(z),  (3.16) 


«=— oo 

we  find  the  solution 

r-fc 


p(x)  =  (b-  a}-1  I  d£  /  dy  g($  6(x  -/(y)  -  £).  (3.17) 

J  a  Ja 

A  concrete  example  of  the  situation  just  described  is  provided  by  the  symmetric  square- 
root  cusp  map 


l-2\xn\       *€[-!,!].  (3.18) 

This  map  is  of  importance  as  a  model  of  intermittent  chaos,  arising  from  the  marginal 
stability  of  the  fixed  point  at  the  left  boundary  x  =  —  1.  This  is  reflected  in  its  invariant 
density  [12] 

p°(x)  =  (l-*)/2,  .(3.19) 

which  shows  how  the  probability  is  'piled  up'  towards  x  =  —  1.  We  have  in  this  case 


Snm  =  (-1)"  /  dy  cos  (7r/ny)  exp  (2imijy), 
Jo 


(3.20) 
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ia  wvt-n  111  m. 


/     t  \n  /•___%  —  1  ci  oi\ 

«0  ~  1-1}    (.17171)      ,  ^.Zl) 

which  is  odd  in  n.  Therefore  the  conditions  just  discussed  are  met,  and  we  have  p°  =  5«o, 
for  n  ^  0.  Together  with  pj  =  1,  this  leads  on  summing  the  Fourier  series  to  the  recovery 
of  the  known  result  (3.19)  for  the  invariant  density  p°(x)  of  the  unperturbed  map.  The 
point  we  now  make  is  that  this  form  of  the  invariant  density  can  undergo  significant 
modification  in  the  presence  of  noise.  For  a  symmetric  noise  distribution  in  [— 1, 1]  (so 
that  gn  =  #_„),  using  (3.6),  (3.14)  and  (3.21)  we  get  the  exact  result 


1 


.  „  _    sm  n  TTJC 


(3.22) 


The  'antisymmetry'  of  p(x]  about  the  value  \  is  at  once  evident,  as  is  the  fact  that 
p(0)  =  |.  Further,  if  p(-l)  =  p(l),  then  each  of  these  is  also  equal  to  ^  else,  their  mean 
value  is  equal  to  \.  In  order  to  exhibit  analytically  the  form  of  the  solution,  let  us  consider 
first  the  simple  case  of  a  rectangular  or  uniform  noise,  given  by  the  density 


(3.23) 


(a) 


114 


2.5 


1.5 


on 


0.5 


-0.5 


Figure  l(a).    The  rectangular  (uniform)  noise  distribution  K(f)  =  (2nTl9(ri 
for  77  =  0.2. 
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(b)       1 


Figure  l(b).  The  invariant  density  p(x)  for  the  symmetric  cusp  ma.pf(x)  =  I  —  2\xfi 
in  the  presence  of  additive  noise  distributed  as  in  figure  l(a),  under  periodic  boundary 
conditions.  The  dotted  line  depicts  the  invariant  density  (1  —  x)/2  of  the  unperturbed 
(noise-free)  map. 

in  the  fundamental  interval  [—1,1],  where  0  <  77  <  1.  The  amplitude  of  the  noise  is  thus 
limited  by  the  parameter  77.  Carrying  out  the  calculations  required,  we  arrive  at  the  result 


(3.24) 


i  [(2 -iT1) 


The  noise  distribution  (3.23)  and  the  invariant  density  are  shown  in  figures  l(a)  and  (b) 
respectively.  We  note  how  the  noise  suppresses  the  effect  of  the  marginally  stable  fixed 
point  of  the  map  at  x=  —  1.  Equation  (3.24)  is  an  exact  result  (under  the  periodic 
boundary  conditions  assumed)  that  is  valid,  in  fact,  for  any  value  of  the  amplitude  77  of  the 
noise.  As  77  — >  I,  p(x)  tends  to  the  uniform  distribution  p(x)  =  1/2,  as  one  might  expect 
under  the  circumstances. 

The  fact  that  p(x)  (3.24)  continues  to  remain  piecewise  linear  is  actually  an  artefact  of 
the  uniform  noise  distribution  (3.23).  A  smoother  modification  of  the  unperturbed  density 
(1  —  x)/2,  but  one  that  retains  the  general  features  of  p(x]  as  found  above,  is  provided  by 
the  one-parameter  family  of  unimodal,  normalized  noise  densities 


,..     ,       iz(cos  u£  —  cos  u)  r    ..     , 

£(£;^)=^4 — "- ^-          ££    -1    1 

2(sm  //  —  fj,  cos  ^,) 
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(3.25) 
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Figure  2(a).  The  noise  distribution  #(£;/*)  of  (3.25)  in  the  limiting  cases 
fj,  =  ir  (3.27)  and  n  =  0  (3.29).  Also  shown  is  the  case  p,  =  ir/2,  i.e.,  #(£;  ?r/2)  = 
(w/4)  cos( 


where  0  <  /j,  <  TT.  We  then  find  the  corresponding  invariant  density  to  be  given  by 

-  |x|)  -  (1  -  |x|)  sin/zl  /0  -^ 

v          ';  (3.26) 


the  corresponding  invariant  density  being 


,  —  sin  p,} 
Thus  p(-l)  =  p(0)  =  p(l)  =  1/2.  In  the  limiting  case  /z  =  TT,  we  have 

(3.27) 

(3.28) 

(3.29) 

p(x)  =  \(\  -x}  +  Jx(|x|  - x2}.  (3.30) 

In  this  case,  p(-l)  and  p(l)  remain  equal  to  1  and  0  respectively.  The  noise  distributions 
of  (3.27)  and  (3.29)  are  shown  in  figure  2(a),  while  the  corresponding  solutions  for  p(x) 
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The  limit  /i  =  0  is  exceptional.  It  corresponds  to  the  parabolic  distribution 


leading  to  the  invariant  density 


Figure  2(b).  The  .corresponding  invariant  densities  (3.26)  for  the  noisy,  symmetric 
cusp  map,  given  by  (3.28)  for  p  =  TT  and  by  (3.30)  for  (j,  =  0.  Also  shown  is  the  case 
//=7r/2,  for  which  p(x)  =  i(2  —  x  —  cos(7rx/2))  (x  >  0)  and  p(x)  =  \  (cos (TO/2)  -  x) 
(x  <  0). 


are  depicted  in  figure  2(b).  The  'evening  out'  effect  of  the  noise  is  quite  evident.  It  must 
be  borne  in  mind,  though,  that  these  results  correspond  to  relatively  large  stochastic 
perturbations  of  the  original  dynamics:  the  standard  deviation  of  the  noise  for  the 
distribution  (3.29)  is  l/v%  «  0.45,  while  that  for  (3.27)  is  <J(l/3)  -  (2/Tr2)  w  0.36.  The 
Lyapunov  exponent  A,  which  is  equal  to  1/2  for  the  unperturbed  map  [12],  is  also  altered 
in  the  presence  of  noise:  we  have  In  [f(x}\  =  —  \  In  |jc|,  so  that  the  second  term  on  the 
right  in  eq.  (3.22)  does  not  contribute  to  A,  being  odd  in  x. 


3.4  Functional  equation  under  uniformly  distributed  noise 

We  present  in  brief  a  formalism  that  avoids  Fourier  series  and  leads  to  a  functional 
equation  for  the  invariant  density,  for  an  onto  map  in  the  fundamental  interval  [a,  b] 
perturbed  by  additive  noise  £,  under  periodic  boundary  conditions.  This  may  be  a  useful 
approach  in  numerical  calculations,  especially  in  situations  in  which  the  noise  is  not  just  a 
small  perturbation. 
Returning  to  (2.3),  we  insert  the  factor  1  =  Jflfc  du  6(u  -/(y))  on  the  right-hand  side: 

Pn+l(x)  =  /d£  f  dy  f  du8(X-f(y)-$6(u-f(y))  g(Q  pn(y) 

J  J  a  J  a 
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V  Balakrishnan  et  al 

-b 


=  I  d£  /    du  6(x  —  u  —  £) 


du8(u-f(y)}  pn(y) 
PQ\pn(u)]t 


(3.31) 


using  the  definition  of  the  unperturbed  Frobenius-Perron  operator  PQ.  Therefore  the 
invariant  density  is  given  by 


p(x]  = 


Po[p(u)]. 


(3.32) 


For  brevity,  let  us  introduce  the  notation  PQ[P(U)\  =  p(u).  (Of  course,  fP(x)  =  p°(x] 
itself).  Thus 


(3.33) 


For  the  sake  of  definiteness  (so  as  to  be  able  to  write  all  formulas  explicitly),  we  consider 
henceforth  the  case  of  uniformly  distributed  noise,  as  in  (3.23),  with  0  <  77  <  (b  —  a)/2. 
For  periodic  boundary  conditions,  the  support  of  the  ^-function  in  (2.3)  must  be 
(/(y)  +  £)  mod(fc  -  a),  i.e.,  the  support  of  6(x-u-  f  )  in  (3.33)  is  w  =  (*-£) 
mod  (b  —  a).  Writing  6mod(x  —  u  —  £)  to  keep  track  of  this  fact,  we  have 


~  u  - 


-i   f  f 

p(x)  —  V'ri]     I  d£#(r?  ~  l£l)  / 

J  Ja 

Carrying  out  the  integration  over  £,  we  find: 

-i  r  fx+ri          fb  i 

(2rj)     \          dup(u}+  dup(u]  ,     for  a  <  x  <  a  +  r/, 

\Ja  Jx—n+b—a  J 


(3-34) 


p(x)  = 


(277) 


-1 


du  p(u)  , 


for  a+  rj  <  x  <  fe-  77, 


/     dup(u}-\-  I  dup(u] 

JX—TI  Ja 


(3-35) 

Equation  (3.35)  must  be  solved  numerically  to  find  p(jc). 

It  is  evident  that  p(x}  is  continuous  at  all  points  hi  the  interval,  including  the  points 
a  +  r)  and  b-rj.A  convenient  way  of  writing  the  functional  equation  (3.35)  is  hi  terms  of 
the  derivative  of  p(:t):  we  have 


(335) 


(3.37) 


I  P(X  ~  *?)  ~  P(X  +  77  -  b  +  a]  ,     b  -  ri  <  x  <  b. 
The  discontinuity  of  p'(x)  at  x  =  a  -f  TJ  and  x  -  b  -  r)  is  given'  by 
disc  p'(a  +  r})  =  -disc  p'(b  -rf)  =  (2^  \p(b)  -  p(a)}. 


Pramana  -  .7.  Phvs..  Vol.  48.  Nn.  1.  Innnnrv  100?  n><»*+  T* 


This  relation  may  be  verified  for  the  cusp  map  using  the  solution  (3.24)  already  found  for 
the  invariant  density. 

4.  Sticky  boundaries:  Re-injection  at  the  nearest  boundary 

4.  1   General  equation  for  the  invariant  density 

While  periodic  boundary  conditions  constitute  a  convenient  way  of  handling  the  spill- 
over problem  in  a  noisy  map,  other  solutions  are  also  possible  [5].  These  include 
reflecting  boundary  conditions,  or  working  in  the  infinite  domain  (—00,00)  so  that 
natural  boundary  conditions  can  be  employed.  Yet  another  approach  is  to  tailor  the  noise 
distribution  by  making  it  state-dependent,  such  that  the  probability  of  an  escape  of  the 
dynamical  variable  out  of  the  interval  is  zero.  This  means  that  the  noise  is  actually 
multiplicative  rather  than  additive,  with  a  density  g(£,  x)  satisfying  g(£,  #)  =  #(£,  b}  —  0. 
One  way  of  implementing  this  is  to  assume  g(£,Jt)  to  be  of  the  form  [13] 


0,  otherwise. 

While  it  is  conceivable  that  this  and  other  such  prescriptions  might  be  justified  on 
physical  grounds  in  certain  instances,  the  very  modelling  of  the  stochastic  forces  on  a 
system  by  means  of  multiplicative  noise  may  be  quite  inappropriate  in  others,  particularly 
if  the  noise  arises  from  sources  that  are  quite  decoupled  from  the  system  dynamics. 

A  model  that  appears  to  be  rather  more  plausible  on  physical  grounds  is  provided  by 
'sticky'  boundaries:  once  the  dynamical  system  reaches  a  boundary  value,  it  stays  put  at 
that  value,  until  the  noise  moves  it  away.  More  generally,  instead  of  escape  out  of  the 
interval  owing  to  the  addition  of  noise  to  the  evolution,  we  may  assume  re-injection  at  the 
nearest  boundary.  This  avoids  the  arbitrary  clipping  of  the  orbit  of  jcn  when  /(#,,)  -j-  £  goes 
out  of  [a,  b]  by  re-defining  the  noisy  map  as  follows.  Denoting  f(xn]  +  £  by  F(xn,  £„),  we 
have 


a  ifF(jtB,&)<fl,  (4.2) 

b  if  *•(*,,&)>*• 
Therefore  if  the  representative  point  overshoots  the  boundary  at  a  or  at  b,  it  is  replaced 
at  that  boundary,  remaining  trapped  there  until  it  is  re-injected  into  the  phase  space 
by  the  noisy  map.  (This  is  so  even  if  a  or  b  is  a  fixed  point  of  the  unperturbed  map.) 
The  end  points  thus  act  as  temporary  traps  that  prevent  any  loss  of  'measure'  due  to 
overflow. 

As  before,  we  are  interested  in  the  invariant  density  p(x),  rather  than  the  time- 
dependent  density  pn(x).  The  counterpart  of  (2.3)  for  the  map  (4.2)  is,  in  the  limit  n  —  >  oo, 


+  6(x  -  a)0(a  -  F(v,  0)  +  *(*  -  b)0(F&  0  -  b}\p(y}g($.       (4.3) 
We  note  that  the  re-injection  prescription  in  (4.2)  leads,  in  general,  to  ^-functions  in  the 
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density  with  support  at  the  boundary  points.  Hence  p(x)  is  of  the  form 

p(x)  =  a6(x-a}  +  (38(x  -b)+p*  (x)  ,  (4.4) 

where  a  and  f3  are  (dimensionless)  constants,  and  p^(x}  is  the  'regular'  part  of  the 
invariant  density.  Substituting  (4.4)  in  (4.3),  we  get  the  coupled  equations 


,  0) 

(4.5) 
00(F(b,  0  -  ft) 

(4-6) 

P*  W  =      d£*(0  <*S(x  -  F(a,  0)  +  Wx  -  F(fr,  0) 
"  L 

+  I  dy«(*-F(y,0)p.ty)|.  (4.7) 

Jo  J 

As  before,  in  order  to  be  specific,  we  shall  take  g(£)  to  be  the  amplitude-limited  (uniform) 
noise  distribution  of  (3.23).  Further,  let  us  assume  that  the  map  /  is  a  continuous, 
unimodal,  onto  map,  with  f(y)  increasing  monotonically  from  y  =  a  to  y  =  ym,  and 
decreasing  monotonically  from  ym  to  b\f(a]  =f(b]  =  aj(ym)  =  b.  (The  modifications 
required  in  other  cases  can  be  worked  out.)  Let  y\^(z)  denote  the  two  roots  of 
/GO  =zG  [a,b].  We  have  y\(a)  =  a,  y2(a}  =  b,  y\z(b}=ym.  Then,  after  some 
simplification  (the  details  are  described  in  the  Appendix),  we  arrive  at  the  following 
equations  for  a,  j3  and  p*(x): 


y\(a+ri) 


(4.8) 
I      ryzfr-n) 

13  =^n          &y(n-b+f(y)}p*(y\  (4.9) 

*n  Jy\(b-rj) 


-a}-d(x-a-  77)] 


In  the  last  integral  on  the  right,  the  factor  1  =  jj  dw  <5(u  -/(y))  may  be  inserted  as 
before,  to  lead  to 

(Of  +  /?\  1        /"min^.jc-Hj) 

-—  -J[fl(x-a)-fl(x-a-7?)]  +  —  Aup*(u).        (4.11) 

'     '  •^' 
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rb 

a  +  (3+       p*(x)dx=l.  (4.12) 

Ja 

We  note  that  a  >  (3.  Moreover,  in  addition  to  the  ^-function  spikes  at  the  boundaries, 
there  occurs  an  extra  boundary  layer  contribution  on  the  left,  in  (a,  a  +  77).  There  is  a 
discontinuity  in  p(x)  at  x  =  a  +  77,  and  a  discontinuous  change  in  slope  at  x  =  b  -  77,  with 
the  respective  discontinuities  given  by 

(4.13) 


'(fc  -  77)  =  -p,(fr)/(277)  =  -~  f     dw  p.(M).  (4.14) 

^r  Jb-i 


ij 

The  Lyapunov  exponent  is  also  altered,  in  general: 


(4.15) 
We  illustrate  the  foregoing  in  the  case  of  the  tent  map  at  fully-developed  chaos. 

4.2  Application  to  the  tent  map 

The  map  concerned  is  f(x)  =  1  -  |2jc  -  1|,  x  €  [0, 1].  Thus  a  —  0,   b  =  I,  ym  =  3, 
yi(z)  =  z/2,  y2(z)  =  1  -  (z/2).  Further, 


and  the  noise-free  map  has  the  constant  invariant  density  p°(x)  =  1.  The  noisy  map  has  an 
invariant  density  of  the  form 

p(x)  =  a6(x)  +  06(x  -  1)  +  &(x),  (4.16) 

where 

1  (  n/2  Cl  1 

-{         dy(77-2y)p*(y)+  /          dy  (r)-2+2y}p*(y)},          (4.17) 

?7    Jo  J(\-nm 


.  |2y-l|)  p»(y),  (4.18) 

*H  J(l-rj)/2 

and 
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Figure  3(a-b).    (Continued) 
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Figure  3.  Numerical  solution  and  comparision  with  simulation  for  the  invariant 
density  p(x)  —  aS(x)  +  06(1  -  x)  +  p*(x]  of  the  noisy  tent  map  with  noise  amplitude 
T)  =  0.2.  (a)  Solution  after  2  iterations,  starting  with  the  zeroth  approximation 
p#(x)  =  l.  The  solution  essentially  stablizes  (correct  to  3  decimal  places)  in  3 
iterations,  (b)  Solution  after  10  iterations,  (c)  The  result  of  a  numerical  simulation  on 
the  noisy  tent  map  with  the  re-injection  prescription,  (4.2).  The  result  shown  is  an 
average  over  10  simulations  of  106  iterations  each,  with  a  bin  size  of  10~2. 


Equation  (4.19)  yields  the  following,  for  x  in  different  ranges: 

1     f"    f(x+r))/2 

2rj    +2^    Jo 


.  . 
<><*<„:*(,)- 


a 


1-77: 


£ 


•SI 


(4.20) 

3 

(4.21) 
(4.22) 


We  note  right  away  that  p(x)  =  1  or  even  p*(x)  =  constant  is  not  a  solution,  although 
p°(x)  =  1.  Equations  (4.17)-(4.22)  must  be  solved  numerically.  The  broad  features  of 
the  solution  can  be  seen  by  adopting  an  iterative  procedure:  if  we  take  p*(x)  =  1  in  the 
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zeroth  approximation,   we   obtain  the  following  as   the  first  approximation  (after 
normalization): 

3?*)  77 

a=='  ^'  (4>23) 


(4.24) 


Successive  iterations  push  the  'plateau'  in  p*(jt)  further  to  the  right.  The  actual  solution 
does  not  have  any  interval  in  which  p*(x)  is  a  constant,  but  rather  a  range  (77,  1  —  77)  in 
which  its  variation  with  x  is  gentle.  It  varies  linearly  in  the  ranges  (0,77), 
((1  +  77)72,1-77)  and  (1-77,77).  Figures  3  (a)  and  3(b)  show  how  the  solution  for 
Pt(x)  stabilizes  as  the  set  (4.17)-(4.22)  is  solved  iteratively,  in  the  case  77  =  0.2.  The 
discontinuity  in  p*(jc)  at  x  =  77  =  0.2  and  that  in  p((x]  at  x  —  1  -  77  =  0.8  are  also 
verified  to  satisfy  (4.13)  and  (4.14)  respectively.  The  foregoing  form  of  the  invariant 
density  has  been  completely  verified  by  direct  numerical  simulations  of  noisy  tent-map 
dynamics  with  the  prescription  of  re-injection  at  the  nearest  boundary  as  in  (4.2),  as 
shown  in  figure  3(c).  The  Lyapunov  exponent  remains  equal  to  hi  2  in  this  case,  by  virtue 
of  the  fact  that  |/'(jt)j  is  a  constant  and  the  invariant  density  is  normalized  to  unity 
according  to  (4.12). 


5.  Generalized  dimension  Dq 

We  have  commented  already  on  the  manner  in  which  the  Lyapunov  exponent  A  is  altered 
(or  remains  unaltered)  for  the  various  noisy  maps  considered.  In  the  same  vein,  we  may 
ask  how  the  generalized  dimension  Dq  is  altered  by  the  addition  of  noise.  This  quantity  is 
defined  as  [11,  14] 

/w(<0        \    / 
Dq  =  Urn  (q  -  I)'1  In   ;£>;)*     /me,  (5.1) 

\j=l  /  / 

where  -oo  <  q  <  oo.  In  the  present  context,  N(e)  =  (b-  a)/e,  while  the  measure  of  the 
j'th  cell  is  given  by 

ra+je 

M;  =  /  p(x)dx  w  ep(a  +  (j  -  l)e).  (5.2) 

Ja+-le 


Therefore 

W(«)  W(«) 

I>f  =  ^  E  e\p(a  +  (j  -  J)e)]«.  (5.3) 

j=l  7=1 

As  the  sum  on  the  right-hand  side  tends  to  S*[p(x)}qfo,  the  behaviour  of  Dq  is  governed 
by  the  integrability  or  otherwise  of  ffl(x\  i.e.,  by  the  nature  of  the  zeros  and  singularities 
of  p(x)  [14].  It  is  at  once  evident  that  Dq  =  D0  =  1  in  aU  cases  in  which  the  invariant 
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Invariant  density  of  one-dimensional  maps 

density  is  a  constant.  A  simple  non-trivial  case  is  once  again  provided  by  the  symmetric 
cusp  map  (3.18),  for  which  p°(x)  =  (1  -  jc)/2  (3.19).  Owing  to  the  vanishing  of  p°(x]  at 
x  =  1,  we  have,  for  q  >  —1, 

Dq  ->  limfe  -  I)"1  In  e^/lne  =  1,     (q  >  -1).  (5.4) 

For  q  <  -1,  on  the  other  hand,  we  find 

l),     (<?<-!)  (5.5) 


since  /^  «  ep(-l  +  (N  -  i)e)  =  e2/4,  on  setting  a  =  -1,  b  =  1,  We  =  b  -  a  =  2.  The 
question  is:  how  is  this  result  affected  by  the  addition  of  noise? 

In  §  3.3,  we  have  seen  how  the  unperturbed  density  p°(x)  =  (1  —  x)/2  is  modified  by 
various  kinds  of  noise  distributions  g(£)  under  periodic  boundary  conditions.  For  the 
purpose  at  hand,  the  relevant  point  is  that  the  noise  generically  eliminates  the  zero  of  the 
density  at  x  =  1,  without  introducing  any  fresh  zeroes  or  singularities  in  the  interval  (cf. 
(3.24)  and  (3.26)  for  /J-^G).  Therefore  Dq  remains  equal  to  1  for  all  q,  in  contrast  to  the 
noise-free  case.  The  case  p,  =  0,  corresponding  to  the  noise  distribution  (3.29),  is  an 
exception:  the  corresponding  density  p(x)  (3.30)  continues  to  have  a  simple  zero  at  x  =  1, 
so  that  Dq  is  given  by  (5.4)  and  (5.5),  as  in  the  noise-free  case.  The  question  can  be 
analyzed  for  a  general  noise  distribution  g(£)  by  examining  the  representation  (3.22)  for 
p(x)  in  the  vicinity  of  jc  =  1. 

We  turn  now  to  the  effect  on  Dq  of  the  other  boundary  condition  considered  in  this 
paper,  namely,  that  of  re-injection  at  the  nearest  boundary.  Again,  taking  the  tent  map 
f(x)  =  1  —  \2x  —  1|  in  the  unit  interval  as  an  illustration,  we  have  an  invariant  density 
p(x)  of  the  form  a6(x)  +  06(1  -x)  +  p*(*)  (cf.  (4.16)).  Breaking  up  the  interval  [0,  1] 
into  N  parts  with  Ne  =  1,  we  now  have 

M?  =  e/*.((7-i)<0,  2<j<(N-\),  (5.6) 

while 

Aii=a  +  ep,Q),  /i#  =  0  +  e^((tf-i)e).  (5.7) 

As  ex,  /3  >  0,  this  leads  to 


n        r    t       .  ^  ,,,. 

D,-,hm(,-l)     -  ^  -  .  (5.8) 

Although  p*(x)  has  a  finite  discontinuity  at  x  =  77  and  cusps  at  x=  (l+rj)/2  and 
x  =  (1-r?)  (we  consider  uniformly  distributed  noise  with  maximum  amplitude  77),  pi  (x) 
has  no  zeros  or  infinities,  and  is  integrable  in  [0,  1]  for  all  finite  q.  Therefore,  for  q  <  1  the 
factor  eq~l  dominates,  and  Dq  =  1.  On  the  other  hand,  this  factor  tends  to  zero  for  q  >  1, 
and  Dq  =  0.  For  q  =  1,  we  have 


P»V»H/>HSI  _   T    PIivc     Vnl    AA    KT/t    1     Tantiovv  1  QO7  rParf  TV 


V  Baiaknsnnan  et  al 
Using  (5.6)  and  (5.7),  we  find 

JV(e) 

,  (5.10) 


which  reduces  to 

D!  =  /  dxp^x}.  (5.H) 

Jo 

Collecting  these  results  and  using  (4.12),  we  have 

1,  «<1, 

tt-j8,        «  =  1,  (5-12) 


The  generalized  dimensions  of  the  attractor  corresponding  to  the  noisy  tent  map,  under 
the  prescription  of  re-injection  at  the  nearest  boundary,  are  thus  modified  considerably  in 
comparision  with  the  uniform  value  Dq  =  1  for  the  unperturbed  case. 

6.  Concluding  remarks 

We  have  shown  that  the  form  of  the  invariant  density  for  noisy  one-dimensional  maps  in 

the  fully  chaotic  regime  is  quite  sensitive  to  the  boundary  conditions  imposed.  The  latter 

are  necessary  to  prevent  the  arbitrary  omission  or  clipping  of  orbits  that  overflow  the 

interval  when  the  noise  is  added.  While  it  is  well  known  that  very  small  amplitude  noise 

helps  'smoothen'  the  invariant  density  in  general,  our  results  are  valid  for  arbitrary  noise 

amplitude,  and  show  that  the  invariant  density  at  fully  developed  chaos  can  differ 

significantly  from  that  in  the  noise-free  case.  The  effect  of  a  marginally  stable  fixed  point 

can  be  diminished,  as  was  shown  in  the  case  of  the  cusp  map.  On  the  other  hand,  an 

initially  uniform  invariant  density  can  develop  considerable  structure,  as  in  the  case  of  the 

tent  map  with  the  re-injection  prescription  used  in  §  4:  5-function  spikes  occur  at  the  end- 

points  of  the  interval,  and  the  unstable  fixed  point  at  x  =  0  is  the  source  of  an  enhanced 

boundary  layer  in  its  vicinity,  as  well.  Similar  effects  occur  in  the  case  of  other  maps  such 

as  the  cusp  map  and  the  logistic  map.  Our  results  suggest  that  care  must  be  exercised  in 

drawing  conclusions  from  numerical  studies  on  the  aspects  of  chaotic  dynamics 

considered  here,  as  the  combination  of  external  noise  and  boundary  conditions  can,  and 

do,  lead  to  non-trivial  and  sizeable  modifications  of  what  one  obtains  in  the  absence  of 

noise. 

Appendix 

We  begin  with  equations  (4.5)  to  (4.7)  for  a,  (3  and  p«(jc).  Using  the  rectangular 
distribution  (3.23)  for  g(£)  and  setting  /(a)  =/(&)  =  a,  (4.5)  for  a  yields 

a  =  $(«  +  /3)  +1  jT  d£  jf  dy0(a  -/(v)  -  0  p*(y}.  (Al) 
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for  £  <  0.  Now  the  inequality  f(y)  <  z  where  z  G  [a,b]  implies  that  y  e  [a,yi(z)]U 
\y2(z),b].  Therefore 


r 
/ 

Ja 


y\(°-£) 


/       dyp*  00 

Jy-ita-t} 


Changing  variables  to  — £  and  interchanging  the  order  of  integration,  we  have 

7J 


(A2) 


(A3) 


where  /i  and/2  are,  respectively,  the  ascending  and  descending  branches  of/.  Equation 
(4.8)  for  a  follows  immediately.  The  appropriate  branch  of  f(y]  in  the  integrand  of  that 
equation  is  automatically  selected  by  the  range  of  integration. 
Similarly,  (4.6)  yields 


.      (A4) 


As  77  is  certainly  less  than  b  —  a,  the  first  term  on  the  right-hand  side  does  not  contribute 
anything.  The  second  term  reduces  to 


0=1-    [\ 
s  71    i  f\ 

Interchanging  the  order  of  integration, 


rym  rr,  r 

J\\  (b-n]  Jb-fi  M  J  v. 


/>j 
- 


b-f2(y) 


,        (A6) 


where  the  peak  in/(y)  occurs  at  y  =  ym.  Equation  (4.9)  for  /?  follows  immediately. 
Finally,  (4.7)  becomes 


(A?) 


The  fkst  term  in  the  integrand  only  contributes  if  x  lies  between  a  and  a  +  77. 
Equation  (4.10)  therefore  follows. 
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Abstract.  There  exist  several  standard  numerical  methods  for  integrating  ordinary  differential 
equations.  However,  if  one  is  interested  in  integration  of  Hamiltonian  systems,  these  methods  can 
lead  to  wrong  results.  This  is  due  to  the  fact  that  these  methods  do  not  explicitly  preserve  the  so- 
called  'symplectic  condition'  (that  needs  to  be  satisfied  for  Hamiltonian  systems)  at  every 
integration  step,  hi  this  paper,  we  look  at  various  methods  for  integration  that  preserve  the 
symplectic  condition. 
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1.  Introduction 

In  this  paper,  we  consider  numerical  integration  methods  for  Hamiltonian  systems.  In 
particular,  we  would  be  interested  in  long-term  integration  of  these  systems.  In  these 
cases,  it  is  important  to  preserve  the  Hamiltonian  nature  of  the  system  at  every  integration 
step.  Otherwise,  one  can  get  spurious  damping  or  even  chaotic  behaviour  which  is  not 
present  in  the  original  system.  Such  behaviour  can  obviously  lead  to  wrong  predictions 
regarding  the  long-term  stability  of  the  Hamiltonian  system  being  studied. 

In  this  paper,  we  look  at  various  integration  methods  that  overcome  the  above  problem 
[1-20],  Such  integration  methods  go  by  the  name  of  symplectic  integrators.  In  §2,  we 
introduce  the  basic  concepts.  In  §3,  we  consider  the  generating  function  methods  for 
symplectic  integration.  Section  4  is  devoted  to  symplectic  Runge-Kutta  (RK)  and 
Runge-Kutta-Nystrom  (RKN)  methods.  Section  5  introduces  the  Lie  algebraic  methods 
for  integrating  Hamiltonian  systems.  A  better  formulation  of  the  generating  function 
methods  in  this  language  is  given.  The  jolt  map  factorization  method  for  symplectic 
integration  is  also  considered  in  some  detail.  Finally,  integration  using  solvable  maps  is 
discussed.  The  concluding  remarks  can  be  found  in  §  6. 


2.  Basic  concepts 


Consider  the  following  set  of  2n  first  order  differential  equations: 
dq_dH(q,p)        dp_      dH(q,p) 


(      } 

(    } 
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Figure  1.  Numerical  integration  results  using  a  non-symplectic  fourth-order  Taylor 
series  map  for  the  anharmonic  oscillator.  The  initial  condition  used  is  (q\  (0), 
/>,(()))  =  (0.8,  0.0). 


where  q,  p  e  Rn.  The  initial  conditions  are  given  by  q(to)  =  qo  and  p(to)  =  PQ. 
The  above  system  of  differential  equations  defines  a  Hamiltonian  system  where 
the  Hamiltonian  function  H  is  a  function  of  the  variables  qt,  pi  (i  =  1,2,  ...,n). 
The  variables  qi  and  pf  constitute  the  phase  space  of  the  Hamiltonian  system. 
Typically,  <jj's  are  the  (generalized)  coordinates  and  p,-'s  the  (generalized)  momenta  of 
the  system. 

Consider  the  Poisson  bracket  [,]    of  two  phase  space  functions  f(q,p)  and  g(q>p) 
defined  as 


dz     Of 

ir-ir 

^  \dqi  dpi     dpt 


(2-2) 


Hamiltonian  systems  possess  the  remarkable  property  that  they  preserve  the  fundamental 
Poisson  bracket  [qi,pj],  that  is 


Vi,;. 


(2.3) 


Equivalently,  Hamiltonian  systems  preserve  the   symplectic  2-form  dpAd^.   This 
condition  is  called  the  symplectic  condition. 

In  general  H  is  a  complicated  function  of  #'s  and  p's.  Consequently,  the  resultant 
equations  of  motion  [cf.  eq.  (2.1)]  are  nonlinear  ordinary  differential  equations. 
Generically,  such  systems  of  nonlinear  equations  do  not  admit  an  analytic  solution, 
that  is,  the  system  is  non-integrable.  Hence,  one  is  forced  to  integrate  these  sets  of 
ordinary  differential  equations  numerically.  There  exist  several  standard  methods  for 
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Figure  2.  Numerical  integration  results  using  the  exact  symplectic  map  for  the 
anharmonic  oscillator.  The  initial  conditions  used  are  (<?i(0),/>i(0))  =  (0.8,0.0), 
(1.2,0.0),  (1.6,0.0),  and  (2.0,0.0). 

numerical  integration  of  a  system  of  first  order  ordinary  differential  equations.  However, 
these  are  general  purpose  methods  and  are  not  geared  explicitly  towards  Hamiltonian 
systems.  In  particular,  they  do  not  preserve  the  symplectic  condition  at  every  step.  For 
short-term  integration  this  does  not  lead  to  much  problems.  For  long-term  integration, 
these  non-symplectic  methods  can  be  a  disaster.  Since  the  symplectic  condition  is  not 
preserved,  even  the  qualitative  nature  of  the  solutions  obtained  by  these  methods  can  be 
very  different  from  that  of  the  exact  solution.  For  example,  one  can  get  spurious  damping 
or  chaos  where  there  is  none.  This  can  lead  to  wrong  deductions  regarding  the  long-term 
stability  of  the  Hamiltonian  system  in  question. 

An  example  of  how  things  can  go  wrong  is  shown  in  figure  1.  This  figure  gives  the 
result  of  integrating  the  nonlinear  Hamiltonian  system  defined  by  the  following 
Hamiltonian  using  a  non-symplectic  method 


H  = 


24 


(2.4) 


Looking  at  the  figure,  one  might  conclude  that  chaotic  behaviour  is  present  in  the  system. 
However,  the  exact  solution  of  the  system  exhibited  in  figure  2  shows  that  this  is  not  true. 
One  way  to  avoid  such  a  situation  while  using  non-symplectic  methods  is  to  reduce  the 
step  size  so  that  the  symplectic  violation  is  very  small  at  each  step.  However,  the 
computational  cost  of  such  a  procedure  for  long-term  numerical  integration  would  be 
prohibitive.  Therefore,  one  is  lead  to  look  for  integration  schemes  which  explicitly 
preserve  the  symplectic  condition  at  every  step.  Such  schemes  are  called  symplectic 
integration  methods  and  we  look  at  some  of  them  in  the  following  sections. 
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3.  Generating  function  methods 

Symplectic  integration  methods  were  first  discovered  by  DeVogelaere  [1].  However, 
these  results  were  not  widely  known.  In  1983,  Ruth  [2]  independently  discovered 
symplectic  integration  methods  using  canonical  generating  functions. 

The  basic  idea  behind  this  method  is  as  follows.  For  simplicity,  let  us  consider  a 
Hamiltonian  given  by  the  special  form 

H(q,p)=;A(p}+V(q}.  (3-1) 

Ideally,  we  would  like  to  make  a  canonical  transformation  such  that  the  new  Hamiltonian 
H'  in  the  new  variables  is  identically  zero.  However,  this  is  not  practical.  Hence,  we 
attempt  to  make  H'  zero  up  to  a  given  order  tk  [2,  3] 

H'(c{,p\t}  =  0(^},  (3.2) 

where  q'  and  p'  are  the  new  variables  after  the  canonical  transformation.  Using  the 
equations  of  motion  [cf.  eq.  (2.1)],  we  get 

}.  (3.3) 


Thus,  the  motion  is  simple  in  the  new  coordinates.  Once  this  is  accomplished,  we  can 
invert  the  canonical  transformation  to  get  the  motion  in  the  original  variables  (accurate  to 
order  fc).  Therefore  we  get  a  map  from  g(0),  p(O)  to  q(t),  p(t]  (again  accurate  to  order  k). 
Now  we  can  repeat  the  process  to  go  from  q(t],  p(t]  to  q(2t],  p(2t)  and  so  on.  Thus,  we 
have  an  integration  algorithm  (of  order  fc)  for  the  Hamiltonian  system  which  allows  us  to 
step  forward  in  time  starting  from  the  intitial  conditions.  Of  course,  we  will  typically  take 
the  time  step  t  at  every  stage  to  be  small  so  that  the  error  generated  (which  is  of  order 
f*+1)  is  small.  Since  canonical  transformations  explicitly  preserve  the  symplectic 
condition,  we  in  fact  have  a  symplectic  integrator  of  order  k. 

We  did  not  specify  above  how  the  canonical  transformation  is  to  be  performed.  This  is 
done  using  the  generating  function  method  [21].  We  use  a  generating  function  of  the  new  /* 

coordinates  and  the  old  momenta  [2,  3]  ; 


=  -<?'•  P  +  £(<?',/>,  0-  (3-4) 

Now  we  derive  a  simple  symplectic  integration  method  using  the  technique  described 
above.  Let  the  generating  function  be  specified  by  [2] 

G  =  -H(q',p}t  =  -[A(p]  +  V(q'}}t.  (3.5) 

Using  the  canonical  transformation  equations  appropriate  for  this  generating  function 
[21]  we  get 


Note  that  the  momentum  equation  would  in  general  be  an  implicit  equation  in  p  which 
has  to  be  inverted  to  obtain  p  as  a  function  of  p1.  This  can  be  done  explicitly  only  for 
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terms  or  q  and  p  as  follows  [2J: 

Consequently,  one  gets 

Thus,  we  have  succeeded  in  obtaining  a  symplectic  integration  method  of  order  1  for  our 
simple  Hamiltonian. 

Following  the  same  procedure,  one  can  in  principle  get  symplectic  integration  methods 
of  higher  order.  However,  it  is  not  easy  to  obtain  the  necessary  canonical  transformations. 
In  general,  one  uses  a  series  of  transformations  to  attain  the  final  goal.  For  example,  for 
the  special  Hamiltonian  given  in  (3.1),  a  fourth  order  symplectic  integration  method  is 
given  by  the  following  equations  [3] 

/  =  0,1,2,3,4.     (3.9) 


Here  the  index  /  labels  successive  canonical  transformations.  Moreover,  qW  =  qQ  and 
p(°)  =p0  (the  initial  conditions).  The  unknown  coefficients  C[,  di  are  determined  by  the 
condition  that  the  new  Hamiltonian  in  terms  of  the  final  canonical  variables  is  zero  upto 
order  f4.  One  possible  solution  is  given  below  [3,  5]: 

ci  =  c4  =  a  +  i,  c2  =  c3  =  -a,  d\=d$=2a+\t  d2  =  -4a  -  1,  c?4  =  0, 

(3-10) 

where  «  =  0.1756.... 

The  necessary  canonical  transformations  become  more  and  more  difficult  to  solve  for 
as  one  goes  to  higher  orders.  Further,  for  complicated  Hamiltonians  the  inversion  of  the 
momenta  equations  becomes  impossible  analytically.  One  has  to  invert  them  numerically 
(typically  using  Newton-Raphson  method)  with  its  attendent  convergence  problems. 
Also,  this  slows  down  the  integration  algorithm  considerably.  We  shall  see  that  these 
problems  are  solved  by  using  Lie  algebraic  methods. 

4.  Symplectic  RK  and  RKN  methods 

In  this  section,  we  discuss  Runge-Kutta  and  Runge-Kutta-Nystrom  methods  that  are 
explicitly  symplectic  [14-20].  We  start  with  the  set  of  2n  first-order  differential  equations 
for  a  Hamiltonian  system  [cf.  eq.  (2.1)]  which  we  reformulate  as  follows: 
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where  z  =  (q,p)  is  a  2n-vector  and  /  is  a  vector  function  specified  by  the  right  hand  side 
of  (2.1). 

An  s-stage  Runge-Kutta  method  to  integrate  the  above  system  of  differential  equations 
is  given  by  [20] 


Here  h  is  the  step  size  and  zn,  zn+i  are  the  values  of  phase  space  variables  at  the  end  of  the 
nth  and  (n  +  l)th  integration  steps  respectively.  The  coefficients  a,;  and  bj  have  specific 
values  depending  on  the  RK  method  under  consideration.  The  RK  method  is  said  to  be 
explicit  [20]  if  a^  vanishes  for  i  <  j.  In  this  case,  the  vectors  Zj  can  be  solved  recursively 
without  having  to  solve  any  implicit  equations.  The  method  is  called  diagonally  implicit 
if  ay  vanishes  for  i  <  j  and  implicit  otherwise.  It  can  be  shown  [17]  that  only  implicit  RK 
methods  can  be  symplectic.  These  are  symplectic  if  the  following  condition  is 
satisfied  [17]: 

hay  +  bjdji  -  bibj  =  0,     i,  j  =  1  ,  2,  .  .  .  ,  s. 
An  example  of  a  two-stage  implicit  RK  method  which  is  symplectic  is  specified  by 

1  !      V5  1     V3  1  ,      . 

a\i=a22  =  -,     fll2=--  —  ,     021  =4  +  -g->     bi=b2  =  2-         I4-3) 

This  can  be  shown  to  be  a  symplectic  integrator  of  order  4.  In  fact,  this  is  one  member  of 
a  family  of  symplectic  RK  methods  which  go  by  the  name  of  Gauss-Legendre  methods 
[20].  However,  these  are  computationally  costly  methods.  Iserles[16]  has  derived  more 
efficient  symplectic  RK  methods. 

Next,  we  consider  Runge-Kutta-Nystrom  (RKN)  methods.  For  simplicity,  we  restrict 
ourselves  to  Hamiltonians  of  the  form  H  =  p2/2  +  V(q).  The  equations  of  motion  for 
such  Hamiltonians  can  be  written  as 


where  /  is  the  gradient  of  -V.  The  above  system  can  be  solved  by  an  s-stage  RKN 
method  [20] 


7=1,2,...  ,*,  (4.4) 

(4.5) 

j 
qn+i  =qn+  hpn  +  h2  Y^  bjf(Qj).  (4.6) 


Symplectic  integration  of  nonlinear  Hamiltonian  systems 

This  method  is  symplectic  if  the  following  conditions  are  satisfied  [15,18] 

h  -  h  +  ha  =  0,     1  =  1,2,...,*,  (4.7) 

hay  -  bjaji  -  btbj  +  bfi  =  0,     i  =  1,2,  ...,;-  1;    ;  =  2,3,..  .  ,s.     (4.8) 

Unlike  RK  methods,  explicit  RKN  methods  can  also  be  symplectic. 

The  simplest  symplectic  one-stage  RKN  method  is  the  Stormer-Verlet  method  of  order 
2  specified  by 

an=0,     ci=0.5,     &i=0.5,     bi  =  1. 

The  above  method  is  used  extensively  in  molecular  dynamics  simulations.  Higher  order 
symplectic  RKN  methods  have  been  derived.  The  highest  order  method  currently 
available  is  the  explicit  13-stage  method  of  order  8  obtained  by  Calvo  and  Sanz-Serna 
[19].  Symplectic  RKN  methods  have  also  been  obtained  using  collocation  and  perturbed 
collocation  techniques  [20]. 

5.  Lie  algebraic  methods 

One  of  the  approaches  to  symplectic  integration  is  the  use  of  Lie  algebraic  methods  [3,  5, 
7-13].  Before  going  into  the  details  we  need  to  consider  some  mathematical  prelimi- 
naries. The  common  denominator  in  all  these  approaches  is  the  representation  of 
Hamiltonian  systems  by  symplectic  maps  [22]  which  are  then  iterated  to  obtain  the  time 
evolution  of  the  system. 

We  start  by  defining  certain  mathematical  objects.  For  simplicity,  we  restrict  ourselves 
to  three  degree-of-freedom  systems.  Let  us  denote  the  collection  of  six  phase-space 
variables  #/,  /?,-  (i  =  1,  2,  3)  by  the  symbol  z: 

3)-  (5-1) 


The  Lie  operator  corresponding  to  a  phase-space  function  /(z)  is  denoted  by:/(z):.  It  is 
defined  by  its  action  on  a  phase-space  function  g(z)  as  shown  below 

=  /(*):*(*)  =  [/(*),*(*)]•  (5-2) 

Here  [/(z),  g(z)]  denotes  the  usual  Poisson  bracket  of  the  functions  /(z)  and  g(z)  [cf.  eq. 
(2.2)].  Next,  we  define  the  exponential  of  a  Lie  operator.  It  is  called  a  Lie  transformation 
and  is  given  by 


l.  (5.3) 

n=0 

Powers  of  :  /(z):  that  appear  in  the  above  equation  are  defined  recursively  by  the  relation 


with 

:/(*):°*00=*(z).  (5-5) 

For  further  details  regarding  Lie  operators  and  Lie  transformations,  see  [22]. 


M  [22]  as  follows: 

(5-6) 


at 

Symplectic  maps   are  maps   whose  Jacobian   matrices  M(z)   satisfy  the  following 
symplectic  condition 

M(4/M(z)=7,  (5.7) 

where  M  is  the  transpose  of  M  and  J  is  an  antisymmetric  matrix  defined  as  follows: 


(  0      1      0      0      0      0\ 
-100000 
000100 
00-1000 
000001 

\  o    o    o    o  -i  oy 


(5-8) 


Matrices  M  satisfying  (5.7)  are  called  symplectic  matrices  and  the  corresponding  maps 
M  symplectic  maps.  Symplectic  maps  explicitly  preserve  the  symplectic  condition.  It  can 
be  shown  [22]  that  the  set  of  all  M's  forms  an  infinite  dimensional  Lie  group  of 
symplectic  maps.  On  the  other  hand,  the  set  of  all  real  6x6  symplectic  matrices  forms 
the  finite  dimensional  real  symplectic  group  Sp(6,  R). 

The  above  map  M  gives  the  final  state  z^  of  a  particle  after  one  time  step  as  a 
function  of  its  initial  state  z^: 


(5.9) 

To  obtain  the  state  of  a  particle  after  N  time  steps,  one  has  to  merely  iterate  the  above 
mapping  N  times  i.e. 

ZW=MNZM.  (5.10) 

Thus,  we  have  obtained  an  integrator  that  is  symplectic  (since  M.  preserves 
symplecticity).  Moreover,  we  have  the  values  of  phase  space  variables  after  one  time 
step  as  explicit  functions  of  the  initial  values. 


5.1  Generating  function  methods 

We  reformulate  the  generating  function  method  using  Lie  algebraic  techniques  [3].  For  a 
time-independent  Hamiltonian,  (5.6)  can  be  solved  formally  to  give 

>M(f)=exp(:-fH(z&):).  (5.11) 

If  one  could  solve  this  explicitly,  one  would  get  a  symplectic  integrator  valid  to  arbitrary 
order  since  symplectic  maps  explicitly  preserve  the  symplectic  condition.  However, 
unless  the  Hamiltonian  is  integrable,  one  can  not  evaluate  the  Lie  transformation  on  the 
right  hand  side.  On  the  other  hand,  for  special  Hamiltonians,  it  may  be  possible  to  split 
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*>  (512) 

tHt:)     i=l,2,  ^  '     ' 

where  the  symplectic  map  Mi  can  be  evaluated  exactly  (or  up  to  order  tk,  the  order  of  the 
symplectic  integration  method  that  we  wish  to  derive).  The  above  restriction  is  a 
drawback  of  this  method  (equivalent  to  the  invertibilty  of  momenta  equation  in  §  3).  We 
will  overcome  this  restriction  in  the  next  subsection. 

Under  the  assumption  that  H  can  be  split  as  above,  one  can  construct  symplectic 
integration  methods  as  follows.  Consider 

M(t}=M,(t}M2(t).  (5.13) 

Since  each  symplectic  map  Mi  is  a  canonical  transformation  [22],  AT  is  a  product  of  two 
canonical  transformations.  Therefore,  this  is  just  a  reformulation  of  the  generating 
function  method  considered  in  §3.  Using  the  group-theoretical  Campbell-Baker- 
Hausdorff  (CBH)  formula  [23],  one  obtains 

M(t)  =  Mi(t}M2(t]  =exp(:-tfTi  -tH2-  ti\Hl,H2}/2  +•••:).           (5.14) 
Using  eq.  (5.12),  we  get 

M(t)  =  M(t}  +  0(t2).  (5.15) 

Thus,  M(t)  is  a  symplectic  first-order  approximation  to  M(t)  and  consequently,  we  get 
z(t)  as  a  function  of  z(0)  accurate  to  order  t.  In  other  words,  we  have  succeeded  in 
obtaining  a  first-order  symplectic  integrator.  If  we  apply  this  to  the  H  given  in  (3.1)  with 
HI  —  A(p]  and  H2  =  V(q],  we  get  back  (3.6).  This  again  demonstrates  the  equivalence  of 
this  method  with  the  generating  function  method  described  in  §  3. 

To  obtain  a  fourth-order  symplectic  integrator,  we  start  with  the  relation  [3] 


where 

2(1  +  a)A  =  :  -tff,  :,      (2  +  /3)  =  :  -tH2:. 
If 

4a2-4o:-2 


—  2, 

(a+1)2 

we  get  [3] 


This  is  of  course  a  symplectic  integrator  of  order  4.  This  is  actually  the  same  integrator 
that  we  discussed  in  §  3  in  a  different  form. 

The  above  method  was  further  generalized  by  Yoshida  [7]  to  obtain  symplectic 
integrators  of  arbitrary  even  orders  for  Hamiltonians  of  the  form  given  in  (3.1). 

5.2  Symplectic  completion  of  symplectic  jets 

In  this  approach  [8-10,  12,  13],  symplecticity  is  achieved  by  refactorizing  the  truncated 
symplectic  map  using  the  so-called  'jolt  maps'. 
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We  start  by  factorizing  the  symplectic  map  M  [cf.  eq.  (5.6)]  representing  the 
Hamiltonian  system  using  the  Dragt-Finn  factorization  theorem  [22,  24]  from  Lie 
perturbation  theory: 

M  =  Me:/3:e:/4:  •  •  •  e:/":  -  •  •  .  (5.16) 

Here  M  gives  the  linear  part  of  the  map  and  hence  has  an  equivalent  representation  in 
terms  of  the  Jacobian  matrix  M(0)  of  the  map  M.  at  the  origin  [22]: 

Mzi=Mijzj=(Mz}i.  (5.17) 

Thus,  M  is  said  to  be  the  Lie  transformation  corresponding  to  the  6  x  6  matrix  M 
belonging  to  Sp(6,R}.  The  infinite  product  of  Lie  transformations  exp(:/n:) 
(n  =  3,4,  ...)  in  (5.16)  represents  the  nonlinear  part  of  M.  Here  fn(z)  denotes  a 
homogeneous  polynomial  (in  z)  of  degree  n  uniquely  determined  by  the  factorization 
theorem. 

One  can  not  use  M.  in  the  form  given  in  (5.16)  for  numerical  computations  since  it 
involves  an  infinite  number  of  Lie  transformations.  Using  the  perturbation  theory 
approach,  we  therefore  truncate  M  as  follows: 

M«Me:/3:e:/4:---e:/p:.  (5.18) 

However,  each  exponential  e:^:  in  M  still  contains  an  infinite  number  of  terms  in  its 
Taylor  series  expansion.  One  possible  way  around  this  is  to  truncate  the  Taylor  series 
generated  by  the  Lie  transformations  to  order  P  giving  the  following  truncated  map 
(denoted  by  Mp): 


•••(!  +  :  /,:  +  •••)*  (5.19) 

Here  the  power  series  is  truncated  in  such  a  way  that  the  highest  order  term  generated  is 
zp~l.  The  truncated  map  Mp  is  called  a  symplectic  jet  of  order  P  (for  a  more  precise 
definition,  see  [10]).  Despite  its  name,  MP  is  not  symplectic  because  of  the  truncation  of 
the  Taylor  series.  Therefore,  it  can  lead  to  spurious  damping  or  growth  (as  discussed 
earlier). 

The  basic  idea  behind  the  approaches  followed  [8,  10,  12]  is  to  refactorize  MP  [cf. 
eq.  (5.19)]  as  a  product  of  symplectic  maps  that  can  be  evaluated  exactly.  This  process  of 
refactorizing  MP  goes  by  the  name  of  'symplectic  completion  of  symplectic  jets'. 

We  begin  this  process  by  defining  jolt  maps.  Consider  the  symplectic  map  given  by 
Q-S(Z}-.  wnere  gfy  }s  a  falcon  Of  ^e  pnase  space  variabies  z.  it  is  called  a  jolt  map  if 
:g(z):  is  a  nilpotent  operator  of  rank  2  i.e.  if  the  following  condition  is  satisfied 

'•g(z)?z  =  Q.  (5.20) 

The  function  g(z)  is  then  called  a  jolt  function.  We  note  that  jolt  maps  have  only  two  non- 
zero terms  in  their  Taylor  series  expansions  [cf.  eq.  (5.3)].  Hence  they  can  be  evaluated 
exactly  without  any  truncation.  The  term  jolt  map  was  first  introduced  in  [12].  An 
example  of  jolt  map  [10]  is  given  below: 


Here/(#i ,  qi,  q-$)  is  an  nth  degree  polynomial  in  variables  q\ ,  #2,  and  #3  and  R  is  the  Lie 
transformation  corresponding  to  a  6  x  6  matrix  R  belonging  to  any  subgroup  of  Sp(6,R}. 
R  is  given  by  the  following  relation  [cf.  eq.  (5.17)] 

Rzi  =  RijZj  =   (Rz\.  (5.22) 

Now  we  can  formulate  the  problem  [8].  Given  the  map  jM/>,  find  another  map  J 
specified  by  the  following  product  of  K  jolt  maps 

J  =  Me'  3  4  p  '&'  3  *  p  •  •  •  •  e'  3  4  f  (5.23) 

such  that  this  map  agrees  with  M.p  to  order  P  i.e. 

J  si  MP  to  order  P.  (5.24) 

Here  gn  's  are  (homogeneous)  jolt  polynomials  of  degree  n  given  by  the  following  relation 

g(t)  _  ^(Ojfy^n     j  =  1,2,...,^,  (5.25) 

where  /%'  is  a  real  coefficient.  The  matrices  J?,  belong  to  a  subgroup  of  Sp(6,  R} 
(including  Sp(6,  R)  itself)  and  RI  denotes  the  Lie  transformation  corresponding  to  these 
matrices  [cf.  eq.  (5.22)].  Since  J  is  given  as  a  product  of  finite  number  of  jolt  maps,  it  can 
be  evaluated  exactly  without  any  truncation.  Thus  it  gives  a  symplectic  approximation  to 
M.p  accurate  to  order  P. 

By  comparing  terms  of  same  order  in  M.p  and  J,  the  problem  of  obtaining  a 
symplectic  completion  for  MP  can  be  reduced  to  the  following  general  problem:  Given  a 
nth  degree  homogeneous  polynomial/,  and  K  matrices  /?,,  find  the  coefficients  /3«    s  such 
that  the  following  condition  is  satisfied 
K 
V  fflRicR  =/„.  (5.26) 

/_      -f  n  •*.  I  *f 

Typically,  one  has  more  fin  's  than  there  are  equations.  So,  one  imposes  constraints  on 
these  coefficients.  In  [10],  the  sum  of  squares  of  these  coefficients  is  minimized.  A  more 
sophisticated  approach  can  be  found  in  [12,  13]. 

To  proceed  further,  following  the  approach  given  in  [10],  one  goes  to  the  continuum 
limit  of  the  above  problem.  We  get  the  following  continuum  problem:  Given  an  nth  degree 
homogeneous  polynomial /„  and  a  subgroup  G  ofSp(6,  R)  on  which  invariant  integration  is 
well  defined,  find  the  function  g(u)  such  that  the  following  condition  is  satisfied 

R(u)<ft.  (5.27) 

Here  u  denotes  a  general  element  of  the  group  G  and  R(u)  denotes  the  Lie  transformation 
corresponding  to  u.  All  integrations  are  invariant  integrations  performed  over  the  group 
G.  Again,  an  appropriate  constraint  on  g(u)  is  imposed. 

This  problem  has  been  solved  [10]  by  taking  the  group  G  to  be  St/(3).  The  solution  is 
given  by: 


(«)  =  E 


Here,  £/s  (all  of  which  can  be  shown  to  be  non-zero)  and  0^'s  are  the  coefficients 
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M  =  Afe:/3:e:/4=  •  •  •  e:/":  •  •  •  .  (5.16) 

Here  M  gives  the  linear  part  of  the  map  and  hence  has  an  equivalent  representation  in 
terms  of  the  Jacobian  matrix  Af(0)  of  the  map  M  at  the  origin  [22]: 

Mzi^MijZj  =  (Mz)i.  (5.17) 

Thus,  M  is  said  to  be  the  Lie  transformation  corresponding  to  the  6  x  6  matrix  M 
belonging  to  Sp(6,/?).  The  infinite  product  of  Lie  transformations  exp(:/«:) 
(n  =  3,4, . . .)  in  (5.16)  represents  the  nonlinear  part  of  M..  Here  fn(z)  denotes  a 
homogeneous  polynomial  (in  z)  of  degree  n  uniquely  determined  by  the  factorization 
theorem. 

One  can  not  use  M  in  the  form  given  in  (5.16)  for  numerical  computations  since  it 
involves  an  infinite  number  of  Lie  transformations.  Using  the  perturbation  theory 
approach,  we  therefore  truncate  M  as  follows: 

M«Afe:/3:e:/4:---e:/p:.  (5.18) 

However,  each  exponential  e:jf":  in  M.  still  contains  an  infinite  number  of  terms  in  its 
Taylor  series  expansion.  One  possible  way  around  this  is  to  truncate  the  Taylor  series 
generated  by  the  Lie  transformations  to  order  P  giving  the  following  truncated  map 
(denoted  by  M.p}: 


Mpz  = 

•  ••(!  +  : /,:  +  •••)*•  (5.19) 

Here  the  power  series  is  truncated  in  such  a  way  that  the  highest  order  term  generated  is 
z73"1.  The  truncated  map  JAp  is  called  a  symplectic  jet  of  order  P  (for  a  more  precise 
definition,  see  [10]).  Despite  its  name,  MP  is  not  symplectic  because  of  the  truncation  of 
the  Taylor  series.  Therefore,  it  can  lead  to  spurious  damping  or  growth  (as  discussed 
earlier). 

The  basic  idea  behind  the  approaches  followed  [8,  10,  12]  is  to  refactorize  MP  [cf. 
eq.  (5.19)]  as  a  product  of  symplectic  maps  that  can  be  evaluated  exactly.  This  process  of 
refactorizing  M.P  goes  by  the  name  of  'symplectic  completion  of  symplectic  jets'. 

We  begin  this  process  by  defining  jolt  maps.  Consider  the  symplectic  map  given  by 
Q:S(Z)'  where  g(z)  is  a  function  of  the  phase  space  variables  z.  It  is  called  a  jolt  map  if 
:g(z):  is  a  nilpotent  operator  of  rank  2  i.e.  if  the  following  condition  is  satisfied 

•0(7}  -27  =  0  fS  90^ 

•  oV.~y  •  *  —  "•  \j.z\j) 

The  function  g(z)  is  then  called  a  jolt  function.  We  note  that  jolt  maps  have  only  two  non- 
zero terms  in  their  Taylor  series  expansions  [cf.  eq.  (5.3)].  Hence  they  can  be  evaluated 
exactly  without  any  truncation.  The  term  jolt  map  was  first  introduced  in  [12].  An 
example  of  jolt  map  [10]  is  given  below: 

(5.21) 
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Here  f(qi,  q2,  #3)  is  an  nth  degree  polynomial  in  variables  q\,  qi,  and  #3  and  R  is  the  Lie 
transformation  corresponding  to  a  6  x  6  matrix  R  belonging  to  any  subgroup  of  5p(6,  jR). 
R  is  given  by  the  following  relation  [cf.  eq.  (5.17)] 

Rzi  =  RyZj  =   (Rz\.  (5.22) 

Now  we  can  formulate  the  problem  [8].  Given  the  map  Mp,  find  another  map  J 
specified  by  the  following  product  of  K  jolt  maps 


je+--    •*+Je+...+*-  •e+e+-+a-  ,f  ~ON 

**         *p  °e  3      4          p  '  •  •  •  e'*3     *4         5p  '  (5.23) 

such  that  this  map  agrees  with  Mp  to  order  P  i.e. 

J  ^  Alp     to  order  P.  (5.24) 

Here  #„  's  are  (homogeneous)  jolt  polynomials  of  degree  n  given  by  the  following  relation 


p(«)  _  fiWfo.fJ1      j  —  1    O  JT 

<5«    ~  Pn  Kl"l       I  —  1,  Z,  .  .  .  ,  A., 

where  /3«  is  a  real  coefficient.  The  matrices  /?,  belong  to  a  subgroup  of  5p(6,7?) 
(including  5p(6,  /?)  itself)  and  Rt  denotes  the  Lie  transformation  corresponding  to  these 
matrices  [cf.  eq.  (5.22)].  Since  J  is  given  as  a  product  of  finite  number  of  jolt  maps,  it  can 
be  evaluated  exactly  without  any  truncation.  Thus  it  gives  a  symplectic  approximation  to 
Aip  accurate  to  order  P. 

By  comparing  terms  of  same  order  in  M.p  and  J,  the  problem  of  obtaining  a 
symplectic  completion  for  M.p  can  be  reduced  to  the  following  general  problem:  Given  a 
nth  degree  homogeneous  polynomial  /„  and  A'  matrices  /?,,  find  the  coefficients  /%  s  such 
that  the  following  condition  is  satisfied 

E  #0*0i  =/-  (5-26) 

/=! 

Typically,  one  has  more  /%'  's  than  there  are  equations.  So,  one  imposes  constraints  on 
these  coefficients.  In  [10],  the  sum  of  squares  of  these  coefficients  is  minimized.  A  more 
sophisticated  approach  can  be  found  in  [12,  13]. 

To  proceed  further,  following  the  approach  given  in  [10],  one  goes  to  the  continuum 
limit  of  the  above  problem.  We  get  the  following  continuum  problem:  Given  an  nth  degree 
homogeneous  polynomial  /„  and  a  subgroup  G  ofSp(6,R)  on  which  invariant  integration  is 
well  defined,  find  the  function  g(u)  such  that  the  following  condition  is  satisfied 


/„  =   /  dug(u)R(u}qnv  (5.27) 

Jo 

Here  u  denotes  a  general  element  of  the  group  G  and  /?(M)  denotes  the  Lie  transformation 
corresponding  to  u.  All  integrations  are  invariant  integrations  performed  over  the  group 
G.  Again,  an  appropriate  constraint  on  g(u)  is  imposed. 

This  problem  has  been  solved  [10]  by  taking  the  group  G  to  be  SU(3>}.  The  solution  is 
given  by: 


Here,  £/s  (all  of  which  can  be  shown  to  be  non-zero)  and  0;m's  are  the  coefficients 


obtained  by  expanding  $  and  /„  respectively  in  terms  of  SU(3)  basis  vectors.  Further, 
T^Lm  (u)'s  are  me  complex  conjugates  of  the  usual  D  matrices  that  occur  in  5£7(3) 
representation  theory.  Finally,  d  is  the  dimension  of  the  517(3)  representation  labeled  byj. 

Once  the  continuum  solution  has  been  obtained,  one  can  go  back  to  the  discrete  case  by 
using  discrete  subgroups  of  SU(3)  or  by  quadrature  formulas.  See  refs  [10,  12,  13]  for 
further  details. 

Now  that  a  solution  g(u)  has  been  obtained,  we  would  like  to  reduce  the  number  of  jolt 
maps  involved  in  the  refactorization  so  that  a  fast  numerical  integration  method  is 
obtained.  A  substantial  reduction  can  be  achieved  by  splitting  the  integration  over  5£7(3) 
as  an  integration  over  5O(3)  followed  by  an  integration  over  the  coset  space  SU(3)/SO(3}, 
In  ref.  [10]  it  is  shown  that  the  number  of  jolt  maps  required  by  following  this  approach  is 
less  than  that  required  by  Irwin  [8].  Irwin  uses  the  group  £7(1)  x  17(1)  x  £7(1).  However, 
there  are  still  many  theoretical  problems  regarding  integration  over  coset  spaces  etc.  which 
are  not  fully  resolved.  Work  is  still  proceeding  along  these  lines. 

5.3  Solvable  map  method 

In  this  approach  [9,11],  the  symplectic  map  M.  is  refactorized  using  the  so-called 
'solvable  maps'. 

Solvable  maps  [11]  are  generalizations  of  Cremona  maps.  The  class  of  Cremona  maps 
includes  only  those  symplectic  maps  for  which  the  Taylor  series  expansion  terminates 
when  acting  on  phase  space  coordinates.  The  class  of  solvable  maps  also  includes  those 
symplectic  maps  for  which  the  Taylor  series  expansion  can  be  summed  explicitly.  The  Lie 
transformation  exp(:a<jj+2  +  bqlflp\  :)  is  a  simple  example  of  a  solvable  map  that  is  not  a 
kick.  One  finds  that 

for    /  >  1,  (5.29) 

,  "          (5.30) 


where 

E  =  aq(+2  +  bq(+lPl.  (5.31) 

The  basic  idea  behind  the  solvable  map  method  is  to  represent  each  nonlinear  factor 
exp(:/re:)  in  MP  [cf.  eq.  (5.18)]  as  a  product  of  solvable  maps: 

exp(:/n:)=exp(:gi:)exp(:g2:)---exp(:gM:)     for    n  >  3.  (5.32) 

For  example,  consider  the  Lie  representation  of  a  general  fourth  order  map  in  one 
dimension  [cf.  eq.  (5.18)]: 

M4  =  Mexp(:/3  :)  exp(:/4:)  (5.33) 

where 


/3  =  aiq   +  a2q[pi  +  a2q\p\  +  a4p\  ,  (5.34) 

/4  =  a$q\  +  a6q\p}  +  •  •  •  +  a9p{.  (5.35) 
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This  can  be  represented  in  terms  of  solvable  maps  as  given  below: 

,         (5.36) 


where 

bt  =  at    for    i  =  1,  2,  .  .  .  ,  5    and    i  =  9,  (5.37) 

^6  =  «6-3ai«3,  (5.38) 

b7  =  a-,  -  9aia4/2  -  3a2fls/2,  (5.39) 

(5-40) 


Only  some  preliminary  work  has  been  done  on  this  method.  It  needs  a  stronger  theoretical 
foundation. 

Another  related  approach  is  the  monomial  factorization  method.  See  Refs  [25,  26]  for 
further  details. 

6.  Summary 

In  this  paper,  we  saw  that  if  integration  performed  on  a  Hamiltonian  system  is  not 
symplectic,  one  can  draw  wrong  conclusions  regarding  the  long-term  stability  of  the 
system.  We  looked  at  various  methods  for  symplectic  integration.  There  is  no  single  ideal 
method  yet.  Generating  function  methods  suffer  from  the  drawback  that  implicit 
equations  have  to  be  solved  numerically  during  the  process  of  integration  with  its 
attendent  convergence  problems.  This  can  also  lead  to  reduction  in  the  speed  of  the 
algorithm.  Symplectic  RK  methods  are  again  implicit.  Symplectic  RKN  methods  can  be 
explicit  but  are  slow  for  long-term  integration.  This  is  especially  so  when  one  is 
integrating  complicated  systems  like  particle  storage  rings  with  thousands  of  components. 
Integration  using  symplectic  maps  can  be  very  fast  since  one  has  to  merely  iterate  the 
maps.  Methods  using  Lie  algebraic  properties  (like  jolt  factorization  method  and  solvable 
map  method)  also  give  maps  where  the  final  values  of  variables  are  explicit  functions  of 
initial  values.  For  these  reasons,  these  approaches  look  most  promising  especially  for 
long-term  integration.  However,  some  theoretical  problems  associated  with  these 
methods  are  not  fully  resolved. 
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Painieve  analysis  and  integrability  aspects  of  some 
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Abstract.  A  brief  review  of  the  Painleve  singularity  structure  analysis  of  some  autonomous  and 
nonautonomous  nonlinear  partial  differential  equations  is  discussed.  We  point  out  how  the  Painleve 
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1.  Introduction  and  historical  background 

The  subject  of  completely  integrable  models  is  fascinating.  Decades  of  research  in  this 
area  has  led  to  several  new  physical  and  mathematical  developments  which  are  quite 
beautiful  and  which  unify  various  aspects  of  physical  problems  that  appear  to  be 
desperate.  Mathematically,  we  encounter  new  concepts  such  as  complete  integrability, 
Lax  pair,  solitons  and  so  on  and  several  new  methods  have  been  developed.  Among  them 
the  Painleve  singularity  structure  analysis  is  one  of  the  systematic  and  powerful  method 
in  nonlinear  science  to  identify  the  integrability  case(s)  and  the  complete  integrability 
properties  of  nonlinear  systems.  From  the  physics  point  of  view,  the  models  discussed  in 
this  article  describe  physical  phenomena  in  such  diverse  areas  as  nonlinear  optics, 
hydrodynamics,  condensed  matter,  plasma  physics,  relativistic  theory  and  so  on.  In  this 
review  article  we  will  discuss  the  Painleve  analysis  of  various  nonlinear  models  which  are 
completely  integrable.  These  models  describe  systems  of  nonlinear  differential  equations 
which  can  be  solved  exactly.  Most  of  these  models  would  be  continuum  models  (both  one 
and  higher  dimensions  as  well  as  autonomous  and  nonautonomous)although  we  will  also 
investigate  the  Painleve  analysis  of  the  Toda  lattice  and  some  nonintegrable  systems. 
Before  analysing  the  singularity  structure  analysis  of  these  equations,  first  we  will  briefly 
discuss  the  historical  background  of  Painleve  analysis. 

Determining  whether  or  not  a  given  system  of  nonlinear  ordinary  or  partial  differential 
equations  is  integrable  raises  many  fundamental  issues.  In  particular  there  is  the  issue  of 
what  is  meant  by  'integrability'  and  the  issue  of  how  that  integrability  can  best  be 
determined  without  having  to  resort  to  a  complete  solution  of  the  problem.  For 
Hamiltonian  systems  the  notion  of  integrability  is  well  defined,  i.e.  the  existence  of  as 
many  involutive  first  integrals  as  there  are  degrees  of  freedom.  For  non-Hamiltonian 
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systems  things  are  less  clearcut.  Clearly  the  existence  of  integrals  can  lead  to  a  reduction 

of  the  order  of  the  system  and  hence  to  a  solution  in  terms  of  an  'integration  by 

quadratures'.  However,  this  is  clearly  not  the  whole  story  since  there  are  simple, 

apparently  'integrable'  ,  equations  such  as  the  Painleve  transcendents  which  do  not  have 

algebraic  integrals  and  for  which  an  integration  by  quadratures  is  not  possible.lt  now 

seems  that  it  is  possible  to  identify  many  different  classes  of  integrable  systems  on  the 

basis  of  their  analytic  structure,  i.e.  the  types  of  singularities  exhibited  by  their  solutions 

in  the  complex  domain.  The  techniques  for  doing  this  can  be  applied  to  both  the  ordinary 

and  partial  differential  equations  and  furthermore  can  be  extended  to  provide,  in  many 

cases,  explicit  solutions  to  the  systems  in  question.  It  also  turns  out  that  the  complex 

domain  of  nonintegrable  systems  also  contains  much  valuable  information  and  overall  it 

would  seem  that  the  study  of  analytic  structure  can  provide  a  wide-ranging  and  unified 

treatment  for  a  large  class  of  nonlinear  problems.  Historically,  the  first  application  of 

these  ideas  is  due  to  Kovalevskaya  [1]  in  her  famous  work  on  the  rigid  body  problem. 

This  concerned  the  then  popular  problem  of  trying  to  find  a  general  solution  to  the  Euler- 

Poisson  equations  which  describe  the  motion  of  a  top  spinning  about  a  fixed  point.  Her 

approach  was  mainly  to  determine  the  conditions  under  which  the  only  movable 

singularities,  i.e.  those  singularities  whose  positions  are  initial-condition-dependent, 

exhibited  by  the  solutions  to  the  equations  of  motion,  in  the  complex  plane,  are  ordinary 

poles.  She  found  that  this  only  occured  for  four  special  combinations  of  the  adjustable 

system  parameters  (the  moments  of  inertia  and  the  position  of  the  centre  of  gravity).  She 

was,   able  to   identify   the   known   integrable   cases   and  one  new  one.   Following 

Kovalevskaya  came  the  extensive  work  of  Painleve  [2]  and  co-workers  to  determine 

the  categories  of  second  order  equations  whose  only  movable  singularities  are  ordinary 

poles.  Working  with  the  general  class  of  equations 


where  F  is  analytic  in  x  and  rational  in  y  and  dy/dx,  Painleve  found  that  there  were  50 
types  which  had  the  desired  analytical  property.  Forty-four  of  these  equations  are  solvable 
in  terms  of  known  functions.  The  reamining  six  equations,  referred  to  as  the  Painleve 
transcendents  [3],  have  transcendental  meromorphic  solutions  for  which  convergent 
expansions  are  explicitly  known.  Despite  their  mathematical  interest  it  appeared,  for 
many  years,  that  the  Painleve  transcendents  were  devoid  of  physical  content.  However, 
in  the  last  two  decades  or  so  they  have  started  to  reappear  in  various  important  physical 
contexts  such  as  the  work  of  Wu  et  al  [4]  which  showed  that  two-point  correlation 
function  of  the  Ising  model  satisfied  the  third  Painleve  transcendent.  In  the  last  few 
years  there  is  a  growing  very  large  number  of  literature  on  the  Painleve  property  as  a 
means  of  identifying  integrable  cases  of  nonlinear  ordinary,  differential  difference  and 
partial  differential  equations.  For  example,  this  approach  has  been  used  successfully  to 
predict  integrable  cases  of  a  variety  of  systems  such  as  the  Lorentz  equations  [5],  the 
Henon-Heiles  system  [6],  the  three  particle  Toda  lattice  [7],  many  systems  in  soliton 
theory  [8-14],  etc.  It  has  also  been  observed  that  there  are  integrable  systems  with 
movable  rational  branch  points.  This  has  led  to  the  concept  of  a  so-called  'weak  Painleve 
property'  [8]. 
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For  partial  differential  equations  (PDE)  'complete  integrability'  is  usually  taken  to 
mean  the  existence  of  an  infinite  number  of  conservation  laws.  As  is  well  known  such 
systems  can  sometimes  exhibit  N-soliton  solutions  and  be  solved  by  the  inverse  scattering 
transform(IST)  method  [9].  As  with  ODEs  there  is  also  important  question  of  finding 
direct  tests  of  complete  integrability.  Constructive  approaches  include  using  the  method 
of  differential  forms,  the  use  of  Hirota's  direct  method  [10]  for  constructing  N-soliton 
solutions  and  certain  criteria  based  on  the  properties  of  ODEs  obtained  by  various 
reductions  of  the  PDEs  in  question.  This  latter  approach  has  been  developed  by  Ablowitz, 
Ramani  and  Segur  [11]  and  is  based  on  the  idea  that  all  the  ODEs  obtained  by  similarity 
and  travelling  wave  reductions  from  the  PDE,  should  possess  the  Painleve  property.  This 
conjecture  has  proved  to  be  quite  valuable,  although  there  are  clearly  certain  drawbacks 
such  as  the  difficulty  of  identifying  all  possible  reductions  of  the  PDEs  to  ODEs  and 
inability  of  the  conjecture  to  provide  further  information  about  actual  solutions  to  the 
equation  in  hand.  In  order  to  overcome  these  difficulties  Weiss,  Tabor  and  Carnevale  [12] 
who  have  formulated  a  Painleve  (P-)  type  test  that  can  be  applied  directly  to  PDEs 
without  any  need  for  reductions.  This  approach  does  seem  to  provide  a  valuable  first  test 
and,  in  addition,  seems  to  be  capable  of  yielding  other  important  informations  such  as 
Backlund  transformations,  Lax  pair,  rational  solutions  etc.  The  major  difference  between 
the  P-analysis  of  ODEs  and  PDEs  is  that  the  singularities  of  the  latter  are  in  general  not 
isolated  but  lie  on  some  analytic  manifold,  the  singular  manifold,  determined  by 
conditions  of  the  form 

0(*i,x2j...,*B)=0,  (2) 

where  0  is  analytic  in  the  neighbourhood  of  that  manifold.  To  show  that  a  given  equation, 
say 

qt  =  K(q),q  =  q(x,t),  (3) 

possesses  the  generalized  P-property  requires  a  demonstration  that  the  solutions  may  be 
expanded  locally  in  a  Laurent-like  series  about  the  singular  manifold  in  the  complex 
hyperspace,  i.e.  an  expansion  of  the  form 

oo 

q(x,  0  =  t-*(x,  0        qj(x,  t)#(x,  r),  (4) 


where  a  is  some(integer)  leading  order  and  the  qj(x,t)  a  set  of  expansion  coefficients 
analytic  in  the  neighbourhood  of  the  singular  manifold  0(jc,  t)  =  0.  Such  an  expansion 
will  have  certain  values  of;',  termed  'resonances'  ,  at  which  the  corresponding  qj  should  be 
arbitrary.  One  resonance  always  occurs  at  7  =  —  1  and  corresponds  to  the  arbitrariness  of 
<p  itself  and  the  positions  of  the  other  resonances  are  determined  by  the  nonlinearities  in 
K(q). 

By  Cauchy-Kovalevskaya's  theorem  such  an  expansion  of  the  general  solution  must 
have  sufficient  number  of.  arbitrary  functions  equal  to  that  of  the  order  of  the  PDEs. 
Furthermore,  depending  on  the  form  of  K(q),  the  equation  may  also  have  different 
leading  orders,  each  one  of  which  will  have  its  own  resonance  structure.  We  usually  call 
the  leading  order  corresponding  to  the  general  solution  the  'principle  branch'  and  the 
others  (with  less  arbitrary  functions)  as  'lower  branches'.  If,  at  a  resonance  (of  any 
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branch)  the  associated  %  fails  to  be  arbitrary,  terms  of  the  form  <pj  In  0  must  be  included 
in  the  expansion.  This  makes  the  solution  multi-valued  about  the  singular  manifold  and 
hence  the  P-property  is  lost.There  are  essentially  four  steps  involved  in  the  P-analysis  of 
PDEs:  (i)  Determination  of  the  leading-order  behaviours;  (ii)  Identification  of  the  powers 
at  which  arbitrary  functions  can  enter  into  the  Laurent  series  called  resonances;  (iii) 
Verifying  that  at  each  resonance  values  sufficient  number  of  arbitrary  functions  exist 
without  the  introduction  of  movable  manifolds;  (iv)  Establishing  connections  with  the 
Lax  pair,  Backlund  transformation  (BT),  bilinear  form  and  other  integrability  properties. 

2.  Painleve  analysis  of  autonomous  nonlinear  partial  differential  equations 

2.1  Painleve  analysis  of  modified  Korteweg  de  Vries  equation 

A  simple  illustration  is  provided  by  the  MKDV  equation  [12] 

qt  -  6q2qx  +  qxxx  =  Q.  (5) 


The  leading  order  is  easily  determined  to  be  —  1  with  q$  ~  (j)x.  For  finding  the  resonances, 
we  substitute 

1'  (6) 

into  (5)  and  equating  the  coefficients  of  0;~4  ,  we  get  the  resonance  values  in  the  form 
j  =  -1,3,4.  (7) 

The  resonance  j  =  —  1  represents  the  arbitrariness  of  the  singular  manifold  <f>(x,t)  =  0. 
From  the  other  powers  of  0,  one  finds  that 

j  =  0;     $o-=&;  (8) 


;  =  2;     ^-642^-         +  ^  =  0;  (10) 

j  =  3;#3  arbitrary  if 


j  =  4;  #4  arbitrary.  Clearly  the  'compatibility  condition'  at  j  =  3  is  always  satisfied  by 
virtue  of  the  relations  found  at;  =  2.  Thus  the  expansion  (6)  is  valid  and  we  can  say  that 
the  MKDV  equation  passes  the  P-test.  To  simplify  the  computations  one  can  use  a 
modification  of  this  method  proposed  by  Kruskal  [13].  This  consists  of  expanding  0 
about  the  singular  manifold  in  the  form  <j>(x,  t)=x  +  tf(t),  such  that  <£(V>'(*,  r))  =  0,  and 
setting  qj-qj(t).  For  the  MKDV  equation  this  reduced  approach  yields 
<?o  =  1,  q\  =  0,  q2  =  if>t/6,  4s  arbitrary,  q4  arbitrary.  From  the  point  of  view  of  the  'test' 
alone,  this  modification  is  rather  useful.  However,  to  construct  the  integrability  properties 
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is  easily  shown  to  pass  the  Painleve  test  [12-15].  The  leading  order  is  —2  and  the 
resonances  occur  at  j  =  —1,4,6.  The  truncated  expansion  then  takes  the  form 

<VJ 

4  =^2  log  0  +  42  .  (13) 

with 

<t>x<t>t  +  I2q2<t>2x  +  4&0«r  -  3^  =  0,      (f)xt  +  12^0^  +  0^  =  0,          (14) 
and 

q2)t  +  12^2^2^  +  q2,xxx  =  0.  (15) 


The  set  of  equations  (14)  and  (15)  constitute  an  auto  BT  for  the  KDV  equation.  It  should 
be  noted  that  the  second  logarithmic  derivative  in  (13)  is  reminiscent  of  the 
transformation  used  in  Hirota's  method  [10].  By  making  the  substitution  fa  —  i/l2-  leads  to 

=  0,     ^,  +  (602  +  A)^x  +  V«x  =  0,  •  (16) 


which  are  precisely  the  Lax  pair  for  the  KDV  equation  [9].  One  of  the  most  enduring 
techniques  in  the  study  of  integrable  nonlinear  PDEs  is  Hirota  method  for  constructing  N- 
soliton  solutions  [10].  The  method  proceeds  without  the  knowledge  of  the  1ST  and  in 
some  cases  was  used  to  construct  N  soliton  solutions  before  the  scattering  transform  had 
been  found  [10].  To  get  the  bilinear  transformation  from  the  P-  analysis,  we  have  to 
assume  that  the  constant  term  in  the  expansion  is  equal  to  zero.  i.e.  in  the  case  of  KDV, 
q2  —  0.  Then,  we  get 


//«*  -  4/i/«  +  3/i  +//*  -/,/,  =  0  (17) 

which  can  be  written  in  terms  of  Hirota's  bilinear  operators  as 

(D4x+DxDt)f-f  =  Q,  (18) 

where  ££(/•/)  =  (dx  —  dx'}nf(x)f(xf}  \  x  =  xf.  As  is  well  known  /  can  be  developed  in 
a  series  expansion  which  self  truncates  at  each  order  yielding  1,2,3,...,  N-soliton 
solutions.  It  is  generally  believed  that  if  one  can  construct  N-soliton  solutions  for  N  >  3, 
then  they  will  exist  for  all  N  and  the  system  can  be  deemed  integrable.  It  should  be  noted 
that  many  equations  can  be  reduced  to  bilinear  form  and  one  can  often  construct  one  and 
two  soliton  solutions  for  them  by  this  method.  However,  self-truncation  for  all  N  only 
seems  to  occur  only  for  completely  integrable  systems. 

2.2  Painleve  analysis  of  Pohlmeyer-Lund-Regge  equation 

Lund  et  al  [16]  have  shown  that  the  dynamics  of  relativistic  vortices  (equivalenfly,  strings) 
interacting  through  a  scalar  field  has  led  to  a  set  of  two  coupled,  Lorentz-mvariant, 
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nonlinear  equations  in  two  independent  variables.  The  equations  are  in  the  form 

^  -  sin  $  cos  $  -  ^—  &xVt  =  0,  (19) 

COSJ  <3? 

(*,  tan2  3>}x  +  (^  tan2  V\  =  0.  (20) 

Eqs.  (19-20)   are  conditions   for   embedding   a   2-dimensional   surface   in   a  three 
dimensional  sphere  which  itself  embedded  in  a  four  dimensional  Euclidean  space. 
Eqs.  (19)  and  (20)  were  also  derived  by  Pohlmeyer  [17]  through  a  study  of  the  nonlinear 
a-  models  of  field  theory. 
For  convenience,  we  use  the  transformation  given  by  Getmanov  [18]  in  the  form 


).  (21) 

Under  this  transformation,  eqs.  (19)  and  (20)  are  transformed  into 

.     ffxfffff  /,      j      \2\      n 

•««+l  —  i  —  \2~3(l~  I  4  I  )  =0- 


To  apply  P-analysis,  we  define  q  =  a  and  q*  =  b.  Eq.  (22)  in  terms  of  these  change  of 
variables  takes  the  form  [19] 

,  N  bxbta  ,  ,     , 

-a(l-ab)=Q,     £*,  +  -  -  --^(l-o£»)  =  0.  (23) 


xt  ,       *, 

1  —  ab  1  —  ab 

Leading  order:  a  =  /3  =  —  1  with  «o^o  =  —  V'f-  Resonances:  y  =  —  1,  0,  1,  2.  Collecting  the 
coefficients  of  (f>~4  and  0~4,  we  obtain 

a0ki+fliko  =  0.  (24) 

From  eq.  (24)  it  is  clear  that  either  a\  or  b\  is  arbitrary.  In  a  similar  way  proceeding 
further  by  collecting  the  coefficients  of  4>~3  and  <^~3,  we  get 

(aob2  -  boa2}i]jt  =  ^Ot^i  +  ^O«IM  (^o«2  -  00*2)^  =  ^Or«i  +  «o^i/-  (25) 

Using  (24),  it  is  easy  to  show  that  the  functions  a2  or  bi  is  arbitrary.  Thus  the  general 
solution  (a(x,i),b(x,t}}  of  eq.  (22)  admits  the  sufficient  number  of  arbitrary  functions 
without  the  introduction  of  any  movable  critical  manifold,  thus  satisfying  the  P-  property 
and  hence  the  system  is  expected  to  be  integrable.  As  eq.  (22)  satisfies  the  required 
condition  to  be  integrable,  we  now  proceed  to  obtain  the  associated  integrability 
properties  of  the  system.  Now  to  establish  the  integrability  properties,  the  series 
representation  is  truncated  at  the  constant  level  term  (a,-  =  bj  =  0,  j  >  1)  as 


where  «0  and  bQ  satisfy  eqs  (23).  Equation  (26)  can  also  be  treated  as  an  auto  BT  of 
eq.  (22).  Then  to  find  the  bilinear  form,  we  put  a\  =  b\  =  0  and  then  by  defining 

flo      G    '          bQ      G* 

a  =  T  =  ¥'     ^  =  T  =  "F  (27) 

(p      F  (p       F  v     ' 
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soiunon  10  it  ana  also  conjectured  JN  soliton  solutions.  lo  construct  the  bilinear  torm  trom 
(28),  we  define 


.  .  (29) 

Under  this  transformation  eqs  (28)  are  transformed  into  the  bilinear  equations 

AcCf  •/  +  **•  *)  =0,  (DxDt~  I)/  -  g*  =  0,  DxDt(f  •/-£*•#)  +  2g*g  =  0. 

(30) 

Once  the  bilinear  forms  are  known,  then  one  can  generate  the  soliton  solutions  by 
expanding  the  dependent  variables  in  terms  of  power  series. 

2.3  Some  nontrivial  equations  and  Painleve  conditions) 

It  would  seem  that  all  the  known  integrable  PDEs  pass  the  Painleve  test.  Some  nontrivial 
examples  include 

I.  Hirota-Satsuma  equation  [14] 

qt  -  X(6qqx  +  $«*)  =  -6ppx,     p,  +  3qpx  +  pxxx  =  Q  (31) 

which  is  found  to  pass  the  P-test  only  when  A  =  i 
n.  Unidirectional  Zakharov  equation  [20] 


(\q  \}x-  (32) 

HI.  Nonlinear  Schrodinger-Maxwell  Bloch  equations  [21] 

qx  =  ikqtt  -ig\qf  q  +  2a2{p),      Pt  =  iup  +fqr),      77,  =  2f(qp*  +  q*p)  . 

(33) 

Painleve  condition:—  2f2k  =  g. 
TV.  Higher  order  nonlinear  Schrodinger  equation  [22] 

iqz  +  a\qn  +  0.2  \  q  |2  q  -  ie[a3qttt  +  o^V),  +  otsq(qq*)t]  =  0.  (34) 


Painleve  conditions:^  =  2ai,  #3  :  0:4  :  as  =  1  :  6  :  —  3. 

2.4  Non-Painleve  property  of  the  integrable  extended  nonlinear  Schrodinger  equation 

Recently  Liu  et  al  [23]  have  shown  that  the  extended  nonlinear  Schrodinger  equation  is  a 
completely  integrable  system  and  constructed  the  N-soliton  solutions  through  Hirota's 
bilinear  method.  The  equation  is  of  the  form 

fyx  +    qtt+          q  ~  I6q"'  +  2iai        2    *  +  iai        2    *  =  0> 
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iat  Jr~axx+±a2b  +  il-Sa^  +  2ot.\abax  +  aia2bx]  =  0,  (36) 

-  ibt  +  \bja-  +  \ab2  -  i[-Sbm  +  1a\abbx  +  aib2ax]  =  0.  (37) 


Leading  Order:  a  =  /3  =  —  1;  ao^o  =  ^.  Resonances:  7  =  —  1,  0,  3,  3,  3,  4.  The  resonance 
'—  1'  corresponds  as  usual  to  the  arbitrariness  of  the  singular  manifold  and  '0' 
corresponds  to  the  fact  that  either  ao  or  &o  is  arbitrary  as  seen  from  the  leading  order 
coefficient.  As  the  system  admits  more  resonances  at  7  =  3,3,3  than  the  required 
arbitrary  functions,  we  conclude  that  eq.  (35)  fails  to  satisfy  the  Painleve  test. 

In  the  cases  considered  so  far  reveals  that  if  the  P-property  is  satisfied,  then  the 
equation  is  completely  integrable.  Thus  it  is  conjectured  that  this  property  indicates  the 
solvability  of  a  field  equation.  On  the  other  hand  we  cannot  conclude,  in  general,  that  a 
PDE  which  is  completely  integrable  has  the  P-property.  Examples  are  the  Harry  Dym 
equation  qt  =  q^q^  and  the  nonlinear  diffusion  equation  qt  =  (q~2qx}x  [14].  Both 
equations  are  integrable.  The  first  can  be  solved  by  the  1ST  and  the  second  can  be 
linearized  to  the  linear  diffusion  equation. 


2.5  Painleve  analysis  of  nonintegrable  equations 

Painleve  analysis  is  also  very  useful  to  understand  the  dynamics  of  the  nonintegrable 
evolution  equations.  For  example,  for  the  real  Newell-Whitehead  equation  [25] 

qt  =  fc  +  q  -  2?3  (38) 

the  leading  order  is  —1  and  the  resonances  are  —1  and  4.  For  simplicity,  we  define  new 
variables  w  =  <t>t/(f>x  ,  v  =  tfrxx/fa  and  from  the  arbitrary  analysis,  we  find 

j  =  0  :  qQ  =  <^,  (39) 

y=l:9i=i(w-3v),  (40) 


+  lwwx  -  2W  +|w3 

-  v^  +  3  wx  +  2v  -  v3  -  2vwx  -  iw2v)  ,  (42) 


WWx  +  lw-Lw*^  '  (43) 


From  eq.  (43),  it  is  clear  that  the  Laurent  expansion  fails  the  WTC  test  since  q4  is  found 
not  to  be  arbitrary.  We  therefore  have  the  following  "consistency  condition"  (assuming 


(44) 
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Painleve  analysis 

There  are  two  possible  ways  to  proceed:  (i)  either,  introduce  logarithmic  psi-series,  i.e. 

(45) 


(ii)  or,  force  (f>  to  satisfy  the  compatibility  condition  at  j  =  4.  This  takes  the  form  of  a 
nonlinear  PDE  which  can  be  solved  exactly.This  approach  gives  a  type  of  conditional  P- 
property  and  can  yield  special  type  of  solutions  meromorphic  in  the  singular  manifold 
which  is  now  specified  as  opposed  to  being  arbitrary.  In  view  of  the  above  facts,  we  can 
conclude  that  the  real  Newell-Whitehead  equation  has  the  "conditional  Painleve 
property".  It  is  also  interesting  to  note  that  some  of  the  special  techniques  developed 
for  integrable  systems  are  also  shown  to  be  applicable  to  nonintegrable  equations.  Let 
us  consider  the  second  example  from  one  parameter  continuous  (Lie)  group  of 
transformations  point  of  view  [26].  For  this,  we  consider  the  nonintegrable  KdV  equation 
in  the  form 

qt  +  qnqx  +  qxxx  =  ®,     n>2.  (46) 

Using  the  invariant  variables 

(x  +  &) 
(•  =    (    ^  °}  (47) 

C      (*  +  /3)1/3  l     } 

and 

,  =       /(O  (48) 

9      (at  +  SfP"  (     ' 

in  eq.  (46),  we  obtain  the  invariant  equation  in  the  form 

r+/r-(—  +c/Uo  (49) 

\n  J 

Inserting 

/~<*(C-Cor  (50) 

in  eq.  (49),  we  find  that  there  is  only  one  possibility 

-2 


Since  n  >  2,  eq.  (49)  will  have  a  movable  branch  point  of  order  —  2/n  provided  (50)  is 
asymptotic  near  £Q.  To  see  this  asymptotic  nature,  we  define 

f  =  v~2'n  (52) 

The  equation  for  v  is 

2  ,  ...     6/2       \     ...     2/2       \/2      A   .,     2  .     2  3     2C  2  /     /> 
-  -vV"+  -  -  +  1  vvV-  -  I  -  +  1  1  1  -  +  2  v'  3-  -v'-  -v3+  —  v2v'=  0 
n  n\n       J  n\n       J\n       J          n        n         n 

(53) 


K  Porsezian 

There  is  a  regular  solution  of  (53),  that  is,  regular  at  (Q,  if  v(£o)  =  0,v'((o)+ 
(2  +  n)(2  +  2n)/n2  (v'(Co))3  =  0.  v"(Co)  is  a  finite  quantity,and  v"'(Co)  is  finite.  Then 
v(£)  is  analytic  at  £0  and  so  (50)  is  asymptotic  near  Co-  Thus  eq.  (49)  is  not  of  Painleve 
type.  For  more  information  about  the  P-analysis  of  nonintegrable  systems,  we  refer  the 
reader  to  refs.  [25,  26]. 


3.  Painleve  property  for  differential-difference  equations 

In  this  section,  we  will  briefly  discuss  the  Painleve  analysis  of  the  discrete  systems.  The 
Toda  lattice  [27] 

^  =  e,fl.-.-fl._fffl.-a*.  (54) 

is  probably  the  best  known  member  of  a  class  of  differential-difference  equations  that  are 
integrable.   To   establish  the  P-property   [27],   we  define  the  variables   qn  =  Qn,f> 

pn  =  e2n-en+i5  ie 

Pn,t  =  Pn(qn  -  #n+l),       tfn,r  =  Pn-l  ~ Pn- 

A  solution  of  these  equations  was  found  by  Toda  on  making  the  substitution 

a2 


which  immediately  leads  to 
and 

Iff  ~1  il2 

Jn+l/n-l        H      __^_,         ,  ,_Rs 

The  one  soliton  solution  corresponds  to  the  choice /„  cosh(o;n  -  (sinh  a)i).  These  results 
suggests  a  truncated  'Painleve  type'  expansion  of  the  form 

n(°)      J1)  JO)       ^(0) 


from  which  one  can  easily  determine  that  q(£]  =  fa^p^  =  -^  andpi:)  =  0Biff,  i.e. 

A  =  £log*.+rf>,     9n  =  |log(tl)+9(')  (60) 

and  that 


Thus,  in  a  sense  we  are  expanding  each  dependent  variable  pn  about  its  own  'singular 


manifold  Junction  <pn.  ine  same  approach  tor  the  two  dimensional  loda  lattice, 

Pn,X  =  Pn(qn  ~  tf«+l),       <ln,t  =  Pn-\  ~  Pn-  (62) 

yields 

(63) 


which  immediately  leads  to 

<Pn-\ 


=5  log 

CM; 


and 


From  the  relations  (63)  and  (64)  one  can  easily  determine  Lax  pairs  for  the  corresponding 
systems.  For  example,  in  the  case  of  the  2-D  lattice  the  scattering  scheme  is  found  to  be 

and  on  substituting  into  (65),  combined  with  cross  differentiation  of  (66),  deduce  that 

d\  (2) 

(3n  =  A  (spectral  parameter),  an  =  —  qn    and  7,,  =  ^-  .  This  type  of  P-analysis  also  yields 

the  BT's  for  the  Toda  lattices  and  reveals  some  amusing  connections  with  the  rational 
solutions  of  the  second  Painleve  transcendent  [27]. 

4.  Painleve  analysis  of  nonautonomous  nonlinear  partial  differential  equations 

The  problem  of  nonlinear  wave  propagation  in  dispersive  and  inhomogeneous  media  has 
been  of  general  interest  and  has  had  wide  range  of  applications,  e.g.  radio  waves  in  the 
ionosphere,  waves  in  the  ocean,nonlinear  optics,  magnetic  systems,  etc,  [31-35].  For  the 
past  few  years,  several  inhomogeneous  nonlinear  PDEs  have  been  studied  from  the 
Painleve  property  and  soliton  point  of  view.  In  a  series  of  papers,  we  have  systematically 
investigated  the  Painleve  analysis  of  many  of  these  equations  and  explained  the 
construction  of  soliton  solutions  [35-40].  In  this  section,  we  will  briefly  discuss  the  P- 
analysis  of  some  nonautonomous  equations.  As  pointed  out  in  section  I,  to  identify  the 
Painleve  condition(s)  of  these  systems,  we  have  to  use  the  general  manifold. 

4. 1  Painleve  analysis  of  inhomogeneous  spherically  symmetric  nonlinear  Schrodinger 
equation 

First,  we  will  discuss  the  P-analysis  of  the  inhomogeneous  spherically  symmetric 
nonlinear  Schrodinger  equation  of  the  form  [35] 

+  (f  ^«i+2/l«  P  1 


+  [2  jf /,<  \q  |2  dr>  +  4(n  -  1)  jT  £    q  f  d/]  q  =  0.  (67) 
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The  above  equation  can  be  derived  from  the  inhomogeneous  spherically  symmetric 
continuum  Heisenberg  ferromagnet  in  arbitrary  (n-}  dimensions  through  moving  helical 
space  curve  [35].  Equation  (67)  contains  the  following  integrable  models: 

(i)  n  =  0,  /  =  constant:  nonlinear  Schrodinger  equation 

tyi  +  «w  +  2|tf  \2q  =  0(-oo<x<  oo).  (68) 

(ii)  n  =  1,  /  =  ax  +  ft:  deformed  nonlinear  Schrodinger  equation 

ifc  +  (ctt  +  jS)(fe  +  2|9  2q}+2oi(qx  +  q  f  |  </  f  d/)  =0.  (69) 

\  ./o  / 

(iii)  n  =  2,  /  =  constant  :  deformed  radial  nonlinear  Schrodinger  equation 

' 


2 


=  0.  (70) 


After  removing  the  integral  through  new  variable  R  and  by  denoting  q  =  a  and  q*  =  b, 
eq.  (67)  can  be  written  as 

iflf+/flIT  +  —  -flr- 


(71) 

-  ibt  +fbrr  +~br  -b     +  1Kb  +  Ifrbr  +     /rr  +       ~/r    *  =  0, 

(72) 
arb  +  2^    =  0.  (73) 


Leading  order:  cc  =  j3=-l,  7=  -2,  fl0&o  =  ^J,  ^o  =  -/$•  Resonances:  _/  =  —!, 
0,2,3,4.  From  the  detailed  arbitrary  analysis  we  find  that  eqs.  (71-73)  are  free  from 
movable  critical  manifolds  only  when/  is  of  the  form: 


where  e\  and  £2  are  integration  constants.Hence,  the  inhomogeneous  spherically 
symmetric  nonlinear  Schrodinger  equation  is  expected  to  be  integrable  in  arbitrary  (n—  ) 
dimensions  only  when  the  inhomogeneity  is  of  the  form  (74).  Working  with  the  truncated 
expansions 


the  following  overdetermined  systems  of  equations  are  obtained:  0(0~3,  0~3,  0~3): 

0r2,     *b  =  -/0?.  (76) 


Painleve  analysis 


(M       j-ri- 
2fr  -\ /  jflO^r  ~ /(2<ZQr0r  +  ^O^rr)  "t~  2(<3Q*1  +  *0fll)  =  0, 

(77) 
=  0, 

(78) 

n-  1 

RQr  -  Ri(j>r  -  2fraob0  -f[(aobo)r  —  (a0fri  +  aifro)<?M  -  2 fa0 bo  —  0. 

(79) 


wdr  +  Wrr  +     /  fl     +2(aQR2  +  fli/Ji)  =  0,  (80) 

fti^i)  =  0,  (81) 


~  2    y-/(oo*i  +  «i  +  fro)  =  0. 

(82) 

By    noting  that   (a0  fro/  +  «0f  fro)  =  —  2<^r0rt   and  using  the  expression  for    0,   from 
eqs.  (77-78)  one  may,  after  some  simplifications,  obtain 


r"-1^ 

Treating  the  constant  of  integration  as  the  spectral  parameter  A  and  identifying  OQ  —  ii/'i. 
£>o  =  11$,  a\  =  q,  fri  =  q*,  and  (f)r  =  —  iifjifa,  gives  the  scattering  problem 

The  time  dependent  part  of  the  problem  may  be  obtained  directly  from  equations  (80-82) 
by  repeated  use  of  (83)  and  (84),  giving 


=  (iR  -  2;A2/>2(M-1))Vi  +    -2Xfqr*-1  +  i(Jq)T  +  f/<?,     (85) 


(86) 

Here/  is  given  in  eq.  (74)  and  the  consistency  of  the  linear  eigenvalue  problem  (84)  and 
(85-86)  is  indeed  the  evolution  equation  (67)  (only  for  the  choice  of  /  given  in  eq.  (74)), 
provided  A  evolves  as 

(87) 


4.2  Painleve  analysis  of  the  generalized  x-dependent  MKDV  equation 
We  consider  the  GMKDV  equation  in  the  form  [31] 


( 


=0  (88) 


In  this  equation  1/1,1/3,  AH  and  /^  are  real  constants  and  q  is  a  real.  For  ^i  =  A*3  =  0, 
eq.  (88)  reduces  to  the  well  known  MKDV  equation.  If  instead  p$  ^  0,  eq.  (88)  is  an 
integro  differential  equation;  it  can  be  reduced  to  a  pure  differential  equation  by  changing 
the  dependent  variable  from  q  to  j^00  q2(xl,  t)&  or  differentiating  with  respect  to  x  after 
dividing  by  qx.  However,  we  can  remove  the  integral  term  by  defining  a  new  dependent 
variable  R  =  J^°  q2tf  ,  t)<&.  Now  eq.  (88)  can  be  rewritten  as  a  set  of  coupled  equations 
in  the  form  [40] 

qt  +  fiiq  +  (1/3  +  At3*)(&w  -  6Rxqx]  +  (MI*  +  v\)qx 

+  /i3(3fe  +  4^-20,10=0,  (89) 

Rx  =  f  (90) 

to  within  an  arbitrary  function  of  t  (which  occurs  when  (90)  is  integrated),which  can 
always  be  removed  by  a  simple  time  dependent  gauge  transformation  of  q.  Leading  order: 
a  =  0  =  -1  with  ql  =  -^  RQ  =  $x. 
Resonances:  j  =  -1,  1,  3,4.  From  the  truncated  series,  we  get 


With  suitable  transformation  for  q0  and  q\  [15,  35,  40],  after  some  simplifications,  we 
obtain 

,  (92) 

(93) 


where  A  =  -4iA3(Ai3;t  +  "3)  -  2i\^(x  /  ffa!)x  +  viq2}  +  i(v\  +  H\x}\  B  = 


/i!*)g,     C  =  4 

[-fc  +  2g3]  -  (1/1  4-  fJ.\x)q  with  A,  =  -/ii  A  -  4^3  A3. 

4.3  Painleve  analysis  of  inhomogeneous  deformed  Kaup  system 

The  deformed  Kaup  system  is  in  the  form  [39] 

%  +  ^q2  +  xqqx  +  xrix  =  0,     rjt  +  2rjq  +  xr\xq  +  j^x  +  3qxx  +  xq^  =  0 

(94) 

Leading  order:  a  =  —  1,  /3  =  -2,  g0  =  ±20^,  TJO  =  —  2^.  Resonances:  j  =  -1,2,3,4. 
Lax  pair: 

ipxx  +  Uii>  =  0,     ^r  =  A^  +  5^  (95) 
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Painleve  analysis 
with 

*-A'+'£  +  2-g.     A-^-a.     B  =  -f  (96) 

The  compatibility  leads  to  eq.  (94)  only  when  A  satisfies 

A,  =  -2/A2.  (97) 

4.4  Painleve  analysis  of  parametrically  driven  Sine-Gordon  equation 

We  consider  the  generalized  sine-Gordon  equation  [41] 

qtt-qxx+f(t)smq  =  0.  (98) 

It  is  well  known  that  the  sine-Gordon  equation  qn  —  qxx  +  sin  q  =  0,  which  is  completely 
integrable,  does  not  pass  the  Painleve  test  directly,  but  after  the  transformation  V  =  elq 
[13,  41].  Under  this  transformation,  eq.  (98)  takes  the  form. 

vvn-vf-vvxx  +  yj  +  lzf(t}(v3  -  v]  =  o.  (99) 

Leading  order:  a  =  —  2,  VQ  =  4(<^  —  $}/f 

Resonances:  j  =  —  1,  2.  At  the  resonance  7  =  2  we  obtain  the  ODE 

<«»> 

As  expected  this  ODE  has  the  P-property.  The  general  solution  is  given  by 

f(t)  =  Ciec*',  (101) 


where  C\  and  Ci  are  integration  constants.Similar  type  of  equation  with  damping  was 
also  investigated  in  [42]. 

4.5  Some  inhomogeneous  equations  and  Painleve  conditions) 
I.  Generalized  KDV  equation:  [43] 

qt+f(t)qto  +  g(t)qm  =  0.  (102) 

Painleve  condition:  g(t]  =/(f)(a0  J'  dsf(s)  +  ^o).  «o  and  b0  are  arbitrary  constants. 
n.  Generalized  KDV  equation  [44]: 

qt  +  qqx  +qxxx+  a(x,  t}q  =  b(x,  0  .  (103) 

Painleve  conditions:  ax  =  Q,at  +  la2  —  bx  =  Q. 
HI.  Cylinderical  KDV  equation  [45] 

qt  +  qqx  +  qxxx+j-  =  0.  .  (104) 

Painleve  condition:  a  =  ^. 
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IV.  Generalized  nonlinear  Schrodinger  equation  [43] 

iqt  +  a(t)qxx  +  P(t)\q\2q  =  V.  (105) 

Painleve  condition:  a(t)  =  P(i)(ai  f  &sf(s)  +  b\),ai  and  b\  are  arbitrary  constants. 

V.  Damped  Nonlinear  Schrodinger  equation  [46] 

iqt  +  q»-  \q  I2  9  =  *(*,  ')<?  +  *(*.  ')•  (106) 

Painleve  conditions:  a  =  x?(&/2  -  jS2)  +  Jcai  +  a0  +  i/?,  &  =  0,  ao(f),  <*i  (0  a"11  0(0  ^ 
arbitrary  real  analytic  functions. 

VI.  Cylinderical  Nonlinear  Schrodinger  equation  [47] 

;«,  +  fe+l?l2«  =  |.  (107> 

VIT.  Inhomogeneous  nonlinear  Schrodinger  equation  [38] 


Painleve  condition:  /  =  ax  +  /?. 

VHI.  Generalized  inhomogeneous  nonlinear  Schrodinger  equation  [37] 

,  ,2  N     /      r  i  ,2 ,  A 

iqt  +  (ax  +  /?)(<?xx  -\-2\q  \  q)  +2a(qx  +  q        \q\   ax  ] 

\  Jo  / 

+  i/i!  M,  +  xV(fc«  +  6  |  q  |2  &)  =  0       (109) 
DC.  Radially  symmetric  nonlinear  Schrodinger  equation  [38]: 

"    +2Ul2^  +  4(n-l)^  /"V|<H2dr'  =  0         (110) 

r  JO 


Painleve  conditions:  n  =  1,2. 

X.  Inhomogeneous  Nonlinear  Schrodinger-Maxwell  Bloch  equations  [36] 

qx  =  *9«  ~  ig  I  g  I2  4  +  a2(x,  0«  +  (?)  ,    Pr  =  iwp  +  fqrj, 

r,t=2f(qp*  +  q*p).  (Ill) 

Painleve  conditions:  —  2/2fc  =  g,  0:2  = 


5.  Painleve  analysis  of  some  higher  dimensional  equations 

5.1  Painleve  analysis  of  Kodomtsev-Petviashvili  equation 
The  KP  equation  [12,  14] 

qtx  +  4x  +  qqxx  +  0qXm  +  qyy  =  Q,     a  =  ±l  (112) 

possesses  the  P-property.  Leading  order:  —2  with  go  =—  2$2  Resonances:  -1,4,5,6. 
Then  equating  the  various  powers  of  <f>  to  zero  we  obtain  sets  of  equations  and  one  can 
easily  verify  that  the  functions  ^4)g5  and  q&  are  arbitrary  and  so  the  P-property  holds. 
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ramieve  analysis 
From  the  truncated  expansion,  the  Lax  pair  is  found  to  be  in  the  form 


/  r*          \ 

(qx-a  /    qydx'  )=  0     (113) 

V  Joo  / 


the  compatibility  of  which  is  the  KP  eq.  (112).  The  Painleve  analysis  of  similar  type  of 
equations  is  also  discussed  in  [48]. 

5.2  Painleve  analysis  of  cylin  derical  Kadomtsev-Petviashvili  equation 
The  cylindrical  Kadomtsev-Petviashvili  equation  is  of  the  form  [49] 

qxt  +  qqxx  +  fx+  g^  +  a(t)qx  +  b(t)qyy  =  0  (1  14) 

Painleve   conditions:   da/  dt  +  2a2  =  0,  db/dt  +  4ab  =  0.   Solving  the   above   condi- 

tions,we  get  a(t)  =  0,  b(t)  =  constant,  or  a(t]  —  l/2(t  -  r0),  b(t)  =  bo/(t  -  tQ}2  . 

5.3  Painleve  analysis  of  Davey-Stewartson  equation 

The  higher  dimensional  version  of  the  nonlinear  Schrodinger  equation  [50] 

+  (\q\2}xx=Q     (115) 


where  cr,-  =  ±1,  i  =  1,  2.  Leading  order:  a  =  (3  =  —  1,  7  =  —  2  with  (OQ  +  b^)  =  2/cr2 
(cri^  +  0^),  RQ  =  —  (2/0-2)  ^l-  Resonances:  —1,0,2,3,3,4.  From  the  truncation,  the 
bilinear  form  is  constructed  as 

(iDt  -  ^D*  +  D]  -  a2\2)g  •  F  =  0(cri£j  +  D2y  -  a2X2}F  •  F  =  -atf  -  G 

(116) 

where  A  =  constant. 

6.  Summary  and  conclusions 

In  this  short  review,  we  have  briefly  discussed  the  Painleve  analysis  and  integrability 
properties  of  some  autonomous  and  nonautonomous  nonlinear  partial  differential 
equations.  For  a  class  of  physically  important  equations  we  have  demonstrated  that 
this  technique  is  one  of  the  very  useful  and  powerful  method  in  nonlinear  science  to 
establish  the  integrability  properties  like  Lax  pair,  Baucklund  transformation,  bilinear 
form  and  so  on.  In  this  article,  I  have  not  included  the  Painleve  analysis  of  the  coupled 
nonlinear  Schrodinger  equations.  For  coupled  systems,  this  analysis  is  found  to  be 
cumbersome  and  the  construction  of  Lax  pair  (from  the  truncation)  is  also  an  unsolved 
problem. 
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Abstract.  We  briefly  review  the  recent  progress  in  obtaining  (2  +  1)  dimensional  integrable 
generalizations  of  soliton  equations  in  (1  +  1)  dimensions.  Then,  we  develop  an  algorithmic 
procedure  to  obtain  interesting  classes  of  solutions  to  these  systems.  In  particular  using  a  Painleve 
singularity  structure  analysis  approach,  we  investigate  their  integrability  properties  and  obtain  their 
appropriate  Hirota  bilinearized  forms.  We  identify  line  solitons  and  from  which  we  introduce  the 
concept  of  ghost  solitons,  which  are  patently  boundary  effects  characteristic  of  these  (2+1) 
dimensional  integrable  systems.  Generalizing  these  solutions,  we  obtain  exponentially  localized 
solutions,  namely  the  dromions  which  are  driven  by  the  boundaries.  We  also  point  out  the 
interesting  possibility  that  while  the  physical  field  itself  may  not  be  localized,  either  the  potential  or 
composite  fields  may  get  localized.  Finally,the  possibility  of  generating  an  even  wider  class  of 
localized  solutions  is  hinted  by  using  curved  solitons. 

Keywords.  Solitons  in  higher  dimensions;  integrability;  coherent  structures;  Painleve  analysis; 
Hirota  method. 

PACS  Nos    03.40;  11.10;  03.50 


1.  Introduction 

The  concept  of  soliton  was  born  from  the  now  famous  numerical  experiments  of  Zabusky 
and  Kruskal  [1]  on  the  stability  properties  of  solitary  wave  solutions  of  the  Korteweg-de 
Vries  (KdV)  equation.  These  results  were  then  analytically  confirmed  by  the  seminal 
work  of  Gardner,  Greene,  Kruskal  and  Miura  [2],  who  essentially  developed  the  method 
of  inverse  scattering  transform  (1ST)  procedure  to  solve  the  Cauchy  initial  value  problem 
of  the  KdV  equation.  Since  then,a  very  large  class  of  (1  +  1)  dimensional  nonlinear 
dispersive  wave  equations  such  as  the  sine-Gordon  (sG),  nonlinear  Schrodinger  (NLS), 
Heisenberg  ferromagnetic  spin  equations  and  so  on  have  been  identified  [3,  4]  which  are 
also  solvable  by  the  1ST  method  exhibiting  the  so  called  soliton  solutions. 
Furthermore,  these  systems  are  understood  to  be  completely  integrable  infinite 
dimensional  dynamical  systems.  These  equations  encompass  a  vast  number  of  topics  in 
(1  +  1)  dimensions  in  many  areas  of  physics  and  other  sciences  and  engineering  [5].  In 
(1  +  1)  dimensions,  the  soliton  systems  admit  solitary  waves  and  these  waves  retain  their 
shape  and  speed  under  collisions.  Mathematically,  the  corresponding  nonlinear  evolution 
equations  are  linearizable  and  linear  spectral  problems  can  be  identified  with  each  one  of 
them.  What  are  the  possible  counterparts  of  such  soliton  systems  in  2  space  and  1  time, 


structures  can  oe  laenimea  nere,  wnenever  appropriate  oounaary  errecis  arc  mcmueu  ui 
the  (2  +  1)  dimensional  systems.  The  plan  of  the  review  is  as  follows.  In  §  2,  we  give  a 
very  brief  sketch  of  the  concept  of  solitons  in  (1  +  1)  dimensions  and  some  of  the 
counterparts  in  (2  +  1)  dimensions.  In  §  3,  we  point  out  how  such  soliton  equations  are 
associated  with  linear  spectral  problems  and  how  their  initial  value  problems  can  be 
solved  using  the  inverse  scattering  transform  method.  In  addition,  we  point  out  how  by 
treating  the  1ST  problem  as  just  a  d-bar  problem  associated  with  analytic  functions,  the 
above  procedure  can  be  extended  to  (2  4- 1)  dimensions.  In  addition,  we  list  some  of  the 
important  (2+1)  dimensional  integrable  nonlinear  evolution  equations  which  have  been 
considered  in  the  recent  literature  and  possible  algorithmic  procedures  to  look  for 
exponentially  localized  solutions.  In  §  4,  we  sketch  the  salient  features  associated  with  the 
Painleve  analysis  and  Hirota  bilinearization  procedure  treating  KdV  as  an  example. 
Exponentially  localized  solutions  for  the  simplest  (2+1)  dimensional  generalization  of 
KdV  is  obtained  in  §  5  indicating  the  influence  of  boundary  effects.  Section  6  contains 
brief  details  of  the  results  for  the  other  (2  +  1)  dimensional  evolution  equations.  Finally 
§  7  gives  a  summary  of  the  results. 

2.  The  concept  of  soliton  and  its  generalization 

2.1  Solitons  in  (1  +  1)  dimensions 

For  linear  dispersive  systems,  it  is  well  known  that  the  associated  evolution  equation 
admits  elementary  wave  solutions  of  trignometric  type,  satisfying  appropriate  dispersion 
relations  u  =  u(k}.  The  Fourier  transform  of  such  elementary  solutions  over  the  fc-space 
then  gives  the  general  solution,  thereby  solving  the  initial  value  problem  [6]. 

In  contrast,  often  the  elementary  wave  solutions  of  nonlinear  dispersive  systems  are 
expressible  in  terms  of  elliptic  functions  (example  cnoidal  waves)  and  in  particular,  one 
has  amplitude-dependent  dispersion  relations,  uj  =  u}(k,A}.  For  appropriate  choice  of 
constants,  one  can  often  obtain  solitary  waves  as  special  solutions  for  these  wave 
equations.  However,  because  of  the  nonlinearity,  Fourier  transform  of  such  elementary 
solutions  will  no  longer  be  a  solution.  On  the  other  hand,one  may  be  able  to  identify  a 
nonlinear  superposition  principle  asympotically. 

Example:  Korteweg  -  de  Vries  equation 

ut  +  6uux  +  u^  =  0.  (1) 

The  solitary  wave  solution  is 

fc2          [k  1 

M  =  «iM  =  -sech2 \-(x - k2t  +  8}  ,  (2) 

L          V-  J 

where  k  and  8  are  constants.  One  can  also  have  a  more  general  solution 


^cosech2X2  ,  .-!- — ^  . 
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Figure  1.    Two  soliton  interaction  in  the  KdV  equation  (see  (3)). 

>,-,  i  =  1,2  (ki,  <5, :  constants).  With  a  little  bit  of  analysis,  one  can 


check  that 


' 


-»       (  1k\  sech2(xi  -  A)     as     Xi  ~*  O(l),  X2  -+  oo 
-oo  1 2kl sech2(x2  +  A)     as     X2  ->  O(l),  xi  -*  o° 


J  2fcJ  sech2(xi  +  A)     as 
|  2fc2sech2(x2-A)     as 


X2  -»  oo 


(5) 
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and  so  one  can  interpret  the  solution  (3)  as  representing  two  solitary  waves  of  differing 
amplitudes  travelling  in  the  same  direction  undergoing  elastic  collisions  and  coming  out 
without  change  of  shape  or  velocity  except  for  a  phase  shift.  The  actual  plot  of  the 
solution  (3)  for  different  times  as  given  in  figure  1  actually  confirms  this  interpretation. 
Such  solitary  waves  are  then  essentially  called  solitons.  In  fact,one  can  obtain  explicit  N- 
soliton  solutions  too  as  is  well  known  [3]  and  also  show  that  the  solution  of  the  Cauchy 
initial  value  problem  for  general  initial  conditions  leads  asymptotically  to  such  Af-soliton 
solutions  in  the  background  of  small  oscillatory  decaying  waves. 

The  above  soliton  picture  holds  good  not  only  for  the  KdV  equation,  but  for  many 
other  ubiquitous  systems  in  (1  +  1)  dimensions;  sine-Gordon,  nonlinear  Schrodinger, 
modified  KdV,  Heisenberg  continuum  spin  chain  equations  are  some  of  the  standard 
examples  [3]. 

2.2  Excitations  in  (2+1)  dimensions 

A  question  naturally  arises  as  to  what  are  the  basic  nonlinear  excitations  in  higher 
dimensions,  especially  in  (2  +  1)  dimensions.  In  recent  years,  at  least  three  basic 
excitations  have  been  identified  in  (2+1)  dimensional  nonlinear  generalizations  of 
(1  +  1)  dimensional  soliton  possessing  systems.  They  are 

(i)  line  solitons  (decaying  everywhere  except  along  certain  lines), 
(ii)  lump  solitons  (algebraically  decaying)  and 
(iii)  dromions  (exponentially  decaying). 

The  line  solitons  are  nothing  but  straightforward  generalization  of  the  soliton  solutions 
to  (2+1)  dimensions.  They  decay  exponentially  everywhere  in  space  except  along 
certain  lines,  where  they  are  bounded.  For  example,  for  the  case  of  the  so  called 
Kadomtsev-Petviashvilli  I  (KPI)  equation, 

(Uf  +  6UUX  +  Uxxx)x  -  lUyy   =  0,  (6) 

which  is  obviously  a  generalization  of  KdV,  one  has  the  line  soliton  solution  (figure  2) 

<5)].  (7) 


Figure  2.    Line  soliton  solution  of  the  KPI  equation  (see  (7)). 
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Figure  3.    Lump  soliton  solution  of  the  KPI  equation  (see  (8)). 


Equation  (6)  has  also  a  lump  soliton  of  the  form 


=  x 


- 


(8) 


which  decays  algebraically  and  is  illustrated  in  figure  3.  Finally,  considering  the  Davey 
Stewartson  equation  I, 

("i  +  ^  =  0, 


6  =  ±1,  (9) 

it  admits  exponentially  localized  solutions  [7,  8],  namely  dromions  which  are  driven 
along  specified  tracks  determined  by  the  boundaries  (dromos  in  Greek)  of  the  form 
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Figure  4.     (1,1)  dromion  solution  of  the  DSI  equation  (see  (10)). 


where 

|=f  —  2put,     r\  =  r\  —  2s\it  (10t>) 

and  p,  s,  j,  k,  I  are  constants.  The  form  of  (lOa)  is  explicitly  shown  in  figure  4. 

In  the  remaining  sections,  we  wish  to  explain  how  such  exponentially  localized 
dromion  possessing  equations  can  be  generated  and  how  such  solutions  may  be  realized. 
To  start  with,  we  demonstrate  this  for  the  KdV  type  equations. 

3.  Linearization  and  IST-(2  + 1)  dimensional  KdV 

It  is  well  known  [3]  that  the  (1  +  1)  dimensional  soliton  systems  are  linearizable  in  the 
sense  that  given  the  nonlinear  evolution  equation 

where  K  is  some  nonlinear  functional,  then  the  compatibility  of  the  two  linear  systems 

where  LI  and  LI  are  linear  differential  operators,  gives  rise  to  the  Lax  equation 

[Li,L2]=0,  (13) 

which  is  equivalent  to  the  evolution  equation  (11).  Example:  KdV: 
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Direct    Scattering 


Time 
Evolution 


Inverse   Scattering 


Figure  5.    The  schematic  representation  of  the  1ST  method. 


d2 


(Schrodinger  operator) 


(14a) 


jr 
ot 

Then  given  the  initial  data  u(x,  0)  with  u  ..r^  0,  (i)  analyzing  the  scattering  problem  for 
Liip  =  0,  the  scattering  data  5(0)  at  time  t  =  0  can  be  obtained,  (ii)  Making  use  of 
Z/27/J  =  0,  one  can  obtain  the  time  evolution  of  S(t)  almost  trivially,  (iii)  Carrying  out  an 
inverse  scattering  transform  procedure  for  the  scattering  data  S(t)  and  by  solving  the 
associated  Gelfand-Levitan-Marchenko  equation,  the  'potential'  M(X,  t)  can  be  uniquely 
determined  and  so  the  initial  value  problem  can  be  solved. 

The  above  three  steps  are  schematically  illustrated  in  figure  5.  Now  the  crucial  point 
is  that  an  extension  of  the  above  procedure  to  (2+1)  dimensions  can  also  be  carried 
out  provided  we  interpret  the  1ST  procedure  as  (i)  a  d-bar  problem  in  complex  analysis 
for  the  direct  scattering  and  (ii)  the  inverse  scattering  part  as  equivalent  to  obtaining 
the  analytic  function  (given  the  d-bar  data)  using  the  generalized  Cauchy  integral 
formula  [9-11]. 

Thus  the  extension  of  the  1ST  analysis  crucially  depends  upon  the  linearization  of 
(2+1)  dimensional  equations  and  identifying  appropriate  Lax  pairs  in  order  to  carry  out 
the  d-bar  analysis.  One  way  of  doing  this  is  to  generalize  the  Lax  pairs  L\  and  LI  as  given 
in  (12)  by  introducing  operators  and  functions  dependent  on  a  second  spatial  variable  y  in 
a  consistent  way  such  that  the  Lax  equation  (13)  gives  the  appropriate  nonlinear  evolution 
equation.  We  demonstrate  this  for  the  case  of  KdV  equation  in  the  following  subsection. 
Similar  analysis  can  be  carried  out  for  other  ubiquitous  soliton  equations  [12]. 
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3.1  Generalization  of  KdV  to  (2+1)  dimensions 

There  are  several  ways  to  generalize  the  Lax  operators  L\  and  LI  but  retaining  the  form  of 
the  Lax  equation  (13).  We  may  identify  at  least  three  interesting  possibilities  in  the  case 
of  the  KdV  equation.  They  are  as  follows. 

Method  1  [13].  One  essentially  includes  first  derivative  operators  of  y  in  L\  while  making 
compensatory  changes  in  LI  in  order  to  obtain  the  evolution  equation  from  Lax  condition. 
For  example  for  the  KdV  Lax  pah-  (14),  one  can  make  the  modifications 

and 

so  as  to  get  the  KPI  and  KPn  equations  (for  a  =  i  and  1  respectively). 

Method  2  [14].  One  keeps  the  operator  LI  unchanged  while  introducing  first  derivatives 
of  y  in  LI\ 

Ll=L(dx}-\  (16a) 

M 
Li  =  dt  +f(X)dy  +  ]TVm(;t,y,06£*.  (16b) 

m=0 

For  KdV,  this  reads 


uxdy  +  3/?w^  +  2(3udx.  (17) 

Then  the  Lax  equation  [Li ,  L^\  =0  leads  to  the  so  called  breaking  soliton  equation: 

ut  =  4/3uuy  +  2f3uxd~luy  -  pu^  =  0,  (18) 

provided  the  eigenvalue  A  in  (17)  evolves  as  H 

"i^ 

A,  =  AAy  (19) 

leading  to  the  breaking  wave  nature  in  the  y  direction. 

Method  3  [15].  One  can  also  look  at  the  possibility  of  introducing  derivatives  in  the 
second  (y)  variable  with  order  greater  than  one.  However  such  a  possibility  in  general 
leads  to  constant  coefficients  of  L\  or  LI  and  so  to  trivial  evolution  equations.  To 
circumvent  this  difficulty  in  the  case  of  (2  +  1)  dimensional  systems,  one  realizes  that  it 
is  enough  to  have  only  a  sufficiently  broad  class  of  solutions  of  the  linear  equations 
£,1-0  =  0  and  Lity  —  0  and  that  the  spectrum  need  not  be  common.  As  a  result,  one  can 
impose  the  weaker  Lax  condition 

(20) 


m  me,  suospace  or  tne  eigenrunction  ip  or  equivaienuy 

[L1,L2]=7Li,  (21) 

where  7  is  some  operator. 

Example:  KdV  generalization 

L!  =  (d2x  -  a2^}  +  u(x,  v,  f)  =  d&,  +  «&  77,  ?),  (22) 

LI  =  dt  +  fc,a  +  fcid  +  Sfcjd  Xfy  +  3^6'^^.  (23) 


From  the  weak  Lax  equation,  one  can  then  obtain  the  Boiti  et  al  generalized  KdV 
equation  [15] 

ut  +  kiu#t  =  3fci  (ud~lu  (24) 


when  £2  =  0  in  (23).  When  ki  ^  0,  the  Nizhnik-Novikov-Veselov  (NNV)  equation  is 
obtained  as 

ut  +  huge  +  ^"^  =  3k2(ud^lUjl)ri  +  3ki  (ud~lu^}^.  (25) 

For  a  more  general  class  of  equations,  see  for  examaple  Ablowitz  and  Clarkson  [3].  In 
this  way,  the  other  soliton  equations  in  (1  +  1)  dimensions  can  also  be  generalized  to 
(2+1)  dimensions  though  the  exact  procedure  may  vary  from  system  to  system  [3], 
Some  of  the  higher  dimensional  systems  which  have  been  studied  extensively  in  recent 
times  include  the  following  evolution  equations  apart  from  the  ubiquitous  K-P  and 
Davey-Stewartson  equations  already  mentioned: 

(1)  Generalized  KdV  [15] 

«»  +  «*«  =  3  ("^"eV  (26) 

(2)  Generalized  Nizhnik-Novikov-Veselov  (NNV)  equation[16] 

ut  +  «&  +  "77771?  +  au$  +  bu^  =  3(ud~lu{)^  +  3(^1MT?)r?.  (27) 

(3)  Breaking  soliton  equation  [12] 

ut  +  au^  +  6auux  +  buxxy  +  4buuy  +  4buxd~luy  =  0.  (28) 

(4)  (2  +  1)  dimensional  generalized  NLS  equation  [17] 

iqt  -(a-  ffiqx  +  (a  +  (S)qm  -  2\q  (a  +  /?)  \q\        +  «i  (»/,  0 


-  (a  -  (3}{  f"   \q\ldrf  +  u2&  t}\}  =  0.  (29) 

W-oo  J  J 

(5)  (2  +  1)  dimensional  generalized  NLS  equation  [18] 

(30) 
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(6)  (2  +  1)  dimensional  generalized  sine-Gordon  equation  [19,  20] 

0,  (31a) 

(31b) 

(7)  (2  +  1)  dimensional  long  dispersive  wave  (2LDW)  equation  [21] 

Xqt  +  qxx-  2qv  =  0,  (32a) 

Ar,  -  r^  +  2rv  =  0,   •  (32b) 

(qr)x=v1l,ail^dx-X9y.  (32c) 

We  will  consider  the  existence  of  exponentially  localized  solutions  in  all  these  systems  in 
the  next  sections. 

4.  Painleve  analysis  and  Hirota  bilinearization 

4.1  Painleve  analysis 

In  order  to  investigate  the  integrability  aspects  of  the  above  type  of  nonlinear  evolution 
equations,  one  can  proceed  to  analyze  the  singularity  structure  of  the  solutions  in  the 
complex  space  of  the  independent  variables,  namely  the  so  called  Painleve  property.  Such 
a  procedure  has  been  found  to  give  considerable  wealth  of  informations  for  finite 
dimensional  nonlinear  systems  [22-24].  Application  to  pdes  has  also  been  well  developed 
and  the  procedure  of  Weiss,  Tabor  and  Carnevale  [25]  is  algorithmic  and  identifies  the 
nature  of  the  singularities  in  the  local  neighbourhood  of  a  noncharacteristic  singular 
manifold. 
Let  us  consider  a  NLEE  of  the  form 

U,  +  K(U)=Q,  (33) 

where  K(u]  is  a  nonlinear  functional  of  u(xi,x2, . . . , Jtm,  t)  =  u(X, t)  and  its  derivatives 
upto  order  N  so  that  (33)  is  an  Nth  order  nonlinear  pde.  Then  one  can  say  that  (33) 
possessess  the  P-property  when  the  following  conditions  are  satisfied. 

The  solution  of  (33)  must  be  single  valued  about  the  noncharacteristic  movable 
singular  manifold.  More  precisely,  if  the  singular  manifold  is  determined  by 

0(x,0=0,     <k,(V)^0,     <j)t(x,t)^Q,     i=l,...,M  (34) 

and  u(x,  t]  is  a  solution  of  the  pde  (33),  then  we  have  the  Laurent  expansion 


u(x,  0  =  [fa  OP  £  u,(x,  0^(jc,  0  (35) 

J=0 

in  a  deleted  neighbourhood  of  the  singular  manifold  (34)  and  m  is  an  integer.  By  the 
Cauchy-Kovalevskaya  theorem,  the  solution  (35)  should  contain  N  arbitrary  functions, 
one  of  them  being  the  singular  manifold  <f>  itself  and  the  others  coming  from  the  u/s. 
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Then  the  WTC  procedure  to  test  the  given  pde  for  its  /'-property  essentially  consists  of 
the  following  three  steps  [25]. 

(i)  Determination  of  leading  order  behaviours. 

(ii)  Identification  of  powers  j  (resonance)  at  which  the  arbitrary  functions  can  enter  into 
the  Laurent  series  expansion  (35)  and 

(iii)  Verifying  that  at  the  resonance  values  j,  a  sufficient  number  of  arbitrary  functions 
exist  without  the  introduction  of  a  movable  critical  singular  manifold.  An  important 
feature  of  the  WTC  formalism  is  that  the  generalized  Laurent  series  expansion  can 
not  only  reveal  the  singularity  structure  aspects  of  the  solution  and  integrability 
nature  of  a  given  pde,  but  can  also  provide  an  effective  algorithm  which  in  most 
cases  successfully  captures  all  its  properties,  namely  the  linearization  (Lax  pair),  the 
Backlund  transformation  (BT),  Hirota  bilinearization,  symmetries  and  so  on  [3].  As 
a  simple  application,  we  illustrate  the  above  aspects  with  KdV  as  an  example.  Any 
other  soliton  system  can  also  be  likewise  analysed  [3,26]. 

For  the  KdV  equation  (1),  we  substitute  the  formal  Laurent  expansion  (35)  around  the 
singularity  manifold  </>(*,  t]  =  0  into  (1)  and  equate  the  powers  of  (j)  to  zero.  One  finds 
that  the  exponent  m  =  4-2  and  that  at  j  =  —1,4,6  arbitrary  functions  can  enter  the 
Laurent  series  (35).  Identifying  the  arbitrariness  of  <j>  withy  =  —  1  recursively,  one  finds 

j  =  0  :  u0  =  -2$,  (36a) 

j  =  I  :  U}  =  2&K,  (36b) 

7  =  2:  <f)x(j)t  +  6M20JCC  -  2u3(f)2x  +  </w  =  0,  (36c) 

7  =  3  :  $#  +  6u2(/)xx  ~  2w3<^  +  (^^  =  0,  (36d) 

7  =  4:  0,(0*  +  6"2&*  ~  2"3$  +  </w)  =  °-  (36e) 


Now,  it  is  clear  that  by  the  condition  (36d),  (36e)  is  always  satisfied  so  that  u^(x^t)  is 
arbitrary.  Similarly,  one  can  derive  the  condition  at  7  =  5  and  prove  that  at  7  =  6,  u$(x,  t) 
is  arbitrary.  As  the  KdV  equation  is  of  third  order,  the  Laurent  series  admits  three  arbitrary 
functions  and  so  the  Painleve  property  is  satisfied. 

Now,  if  the  arbitrary  functions  «4  and  we  are  chosen  to  be  identically  zero  and  if  we 
require  «3  =  0,  then  Uj  =  Q,j  >  3  provided  HI  satisfies  the  KdV  equation.  Thus,  we  obtain 
the  Backlund  transformation  for  the  KdV  in  the  form 

M^Clog^  +  ifc,  (37) 

where  u\  and  u2  solve  the  KdV  and  (f>  satisfies  (36a-e)  with  w3  =0.  By  a  set  of 
transformations,  it  is  possible  to  show  that  the  defining  equations  for  <f>  are  equivalent  to 
linearizing  equations  (12)  and  (14).  One  can  apply  a  similar  procedure  to  any  other  NLEE 
in  (1  +  1)  or  (2+1)  dimensions  and  obtain  its  integrability  property  [24].  For  recent 
applications  in  (2  +  1)  dimensions,  see  for  example  Radha  and  Lakshrnanan  [27-33]. 

4.2  Hirota  bilinearization 

We  have  noted  above  that  the  Backlund  transformation  of  the  form  (37)  can  be  used  to 
bilinearize  the  NLEE.  Indeed  considering  the  vaccum  solution  u2  =  0  in  (37)  one  obtains 
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the  transformation 

u  =  2(108^  (38) 

so  that  the  KdV  equation  can  be  written  in  the  bilinear  form 

&r0  -  <t>x<t>t  +  &«*0  -  4^^  +  30^  =  0.  (39) 

Using  the  so-called  Hirota's  bilinear  operator 

ZWM)  =  (dx  -  d,}m(dt  -  ^XvW^OL^-,,  (40) 

equation  (39)  can  be  rewritten  in  a  compact  form  as 

<f>-<l>=:Q.  (41) 


The  properties  of  the  bilinear  operator  can  be  easily  worked  out.  For  example,  one  has  the 
relations  D£a-l  =  fij'a,  D%(a-b)  =  (-l}mD?(b-a),  D£(a-a)  =0,  m  odd,  and  so  on. 
Using  such  properties,  the  calculations  can  be  simplified  considerably. 
Now,  expanding  $  in  a  formal  power  series  in  e, 


(42a) 

n=Q 

where 

N 

0W  =  £«cp(TK),t»  =  fcfX  +  u^  +  7^  (42b) 

1=1 

and  fc,  w,-,  1?,.  are  constants,  the  JV  soliton  solution  of  KdV  can  be  obtained.  To  see  this, 
one  substitutes  (42)  in  (41)  and  equates  each  power  of  e  separately  to  zero  to  obtain  the 
following  set  of  equations: 

0(1}  :  0  -  0,  (43a) 

°(e)'-^)+&  =  0,  (43b) 

0(e2)  =  &  +&  =  -\(DxDt  +  D*x}^.^\  (43c) 

and  so  on.  The  procedure  is  then  to  use  (43b)  in  (43c)  and  succesively  solve  the  remaining 
equations.  In  practice,  one  finds  the  solution  for  #=  1,2,3  and  then  hypothesize  it  for 
arbitrary  N  which  is  to  be  proved  by  induction. 

For  example,  for  N  =  l,  <j>W  =  exp(rji)  and  from  (43b),  Wl  =  -fcj  and  (d^  +  fij) 
(j>(  >  =  0  so  that  (j>®  =  0  and  ^W  =  0,  i  >  2.  Thus,  the  solution  of  (43b)  can  be  written  as 

^(1)  =  exp^i),     rn  =  kix  -fy  +  rff\  (44) 

Making  use  of  (38)  and  (42a),  it  is  straightforward  to  obtain  the  one  soliton  solutions  of 
KdV  given  by  (2). 
Similarly  for  the  case  N  =  2, 


i  =  1,2  (45a) 
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so  that  the  solution  (42a)  becomes 

9(2)  =  1  +  exp^  )  4-  exp(r/2)  +  exp(?7i  +  %  +  Aw)  ,  (45b) 

where  A  12  is  a  constant.  Again  this  leads  to  the  2-soliton  solution  (3)  of  the  KdV  equation. 
In  an  analogous  fashion,  one  can  proceed  to  find  the  AT-soliton  solution  also.  It  is  also 
useful  to  note  that  with  the  solution  of  the  Hirota  equation  in  a  form  such  as  (45b),  it  is 
quite  easy  to  understand  the  elastic  nature  of  the  soliton  interaction  as  discussed  in  §  2. 
As  noted  above,  all  the  other  known  soliton  equations  in  (1  +  1)  dimensions  can  also 
be  bilinearized  and  the  soliton  solution  obtained  through  the  Hirota  method.  In  the  next 
sections,  we  will  apply  this  method  to  generate  interesting  coherent  structures  in  (2  +  1) 
dimensions. 

5.  Localized  coherent  structures  in  the  (2  +  1)  dimensional  KdV  equation 

As  an  example,  we  consider  the  generalized  KdV  equation  (26)  of  Boiti  et  al  and  obtain  its 
localized  solutions  [28].  We  rewrite  it  as 

ut  +  i*Ke  =  3(wv)e,  (46a) 

us  =  v,.  (46b) 

Note  that  the  potential  v  itself  satisfies  the  equation 

=  K      +  3     fee^1 


5.1  P  -property 

Considering  the  Laurent  expansion 

«  =  $~>(*.y>0^H*)  (48a) 

j=o 

(*>*0tf/f/',  (48b) 

around  the  noncharacteristic  singular  manifold  <f>(x,ytt)  =  0,  0X,  ^,  ,  0,  ^  0,  we  can 
algorithmically  check  that 

a  =  /3=:_2,     M0  =  20^,     v0=2(f>l  (49) 

and  that  one  member  each  of  (1*2,  va),  ("4,  V4)  and  (wg,  vg)  are  (the  coefficient  functions) 
arbitrary  in  addition  to  the  manifold  (j>  itself  without  the  introduction  of  a  critical  singular 
manifold.  Thus,  the  P-property  is  satisfied  here. 

5.2  Hirota  bilinear  form 

One  can  easily  check  that  the  Laurent  series  (48)  can  be  cut  off  at  constant  level  terms  to 
consider  special  solutions  of  the  form 

M  =  ^  +  ^.+  M2,  (50a) 

9        9 
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(50b) 


so  that  if  (u2,  vj)  solve  (46),  so  also  (it,  v).  Thus,  (50)  can  be  effectively  considered  as  an 
auto-Backlund  transformation  and  its  further  properties  can  be  worked  out.  Particularly 
useful  to  us  is  the  vacuum  solution 

M2=V2  =  0.  (51) 

Then  with  the  expressions  (49)  for  (MQ.VQ)  and  (MI,VI)  (from  a  knowledge  of  the  P- 
analysis),  the  transformations  (50)  can  be  simplified  to 

«  =  -20ft  log  <£,  (52a) 

v=-2%log<£.  (52b) 

On  substituting  the  above  into  the  original  evolution  equation  (46),  one  obtains  the 
bilinear  version 

2<jxj>v  -  l<pr,(f>t  +  20$£ft  -  2<^<^  +  60£0ft  -  60^0£ft  =  0.  (53) 

In  the  Hirota  operator  notation,  (53)  can  be  rewritten  as 

t-O.  (54) 


Expanding  0  as  a  power  series, 

^1+e^+eW-",  (55) 

equation  (54)  can  be  written  as  a  set  of  coupled  linear  pdes: 

0(e):     $  +  ^  =  0,  (56a) 

o(e2)  :    ^  +  4|,  =  -KD^+DID,}^-^,  (56b) 

etc. 

5.3  Line  solitons  and  ghost  solitons 

Equation  (56a)  obviously  admits  special  classes  of  solutions  of  the  form 

\  (57) 


where  ^-,  /,-  and  Xj  s  &r&  constants.  One  can  use  these  solutions  for  various  choices  of  TV 
to  obtain  the  so  called  line  solitons  as  in  the  case  of  KdV  equation  as  discussed  in  §  4.2. 
For  example,  with  the  choice  N  =  1,  we  have 


(58) 
and  the  corresponding  solution  to  the  evolution  equation  becomes 

u  =  -28^  log  4  =  -2^,  log(l  +  0(1))  -  -  —  sech2  (^)  ,  (59a) 


,  (59b) 
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where  e  has  been  scaled  out  in  the  final  form  of  the  solution.  These  structures  in  (2  +  1  ) 
dimensions  are  line  solitons,  because  eventhough  they  have  soliton-like  properties,  they 
are  localized  in  the  two  dimensional  space  except  along  the  lines  x\  —  0>  where  the 
solution  is  though  bounded  does  not  go  to  zero  as  x,  y  —  »•  oo.  AMine  solitons  can  also  be 
constructed  following  the  procedure  suggested  in  §  4  and  their  interaction  properties  can 
be  studied  in  the  usual  way. 

But  more  interestingly,  we  observe  the  fact  that  in  (59),  k\  and  l\  are  arbitrary 
parameters.  One  finds  that 

as    k\  —  >  0,     both    w,  v  —*  0,  (60a) 

whereas 


as    /i  ->0,  a->0,  v^sech2(fcif-*?r  +  Xi)-  (60b) 

^6 

Thus  eventhough  the  physical  field  vanishes,  the  potential 

ug*/  +  vi  &0  (61) 


survives.  Comparing  (60b)  and  (61),  one  concludes  that 

lim    \>(f  rt  t\  —  v,  (f  t\  — L  *(*rh(iri£  —  Ir't  4-  v^\  (fO} 

11111      y^S)  '/)  I  I    —    ^1  V.S)     /    — «^  oCL/Ill  ACjq  —  /Ci  I  T~  X]       I  \         / 

77-1—00  2 

is  the  ghost  or  background  soliton  at  the  boundary  77  =  —  oo. 


5.4  Localized  solution:  Dromions 

The  existence  of  ghost  solitons  at  the  boundary  77  =  —  oo  prompts  one  to  look  for  a  more 
general  ansatz  than  (58)  so  that  we  may  choose 


<f>  =  1  +  e*1  +  e*2  +  £e*'  +*2  ,  £  >  0,  (63a) 

where 

Xi=*i£-*if  +  ci,  X2  =  /i7/  +  c2  (63b) 

so  that 


(64) 
+  expXl   +  expeXpXlX2)]2' 

while 


v  =  -2fl^  log  0  =  +  2X2)] 


+  K  exp(xi 

(65) 

One  observes  that  while  the  physical  field  is  now  exponentially  localized  in  the  entire 
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plane,  the  potential  v  is  not.  In  this  case,  when 

u  —  »•  0     as     77  —>  -oo,  (66a) 

(66b) 


so  that  we  can  interpret  that  the  energy  is  continuously  injected  at  the  boundary  77  —  »  —  oo 
in  the  form  of  a  soliton  (with  k  ^  0)  so  that  the  physical  field  can  overcome  the  tendency 
to  disperse  so  as  to  get  a  fully  localized  structure.  Solution  (64)  is  then  called  the  (1,1) 
dromion  solution  to  the  NLEE  (46). 

Multidromion  solutions  can  also  be  obtained.  For  example  a  (2,  1)  dromion  solution  is 
obtained  as 


2(1  -  K)[kili  exp(xi  +  £2)  H-  k2l\  exp(%2  +  Xs)] 
u  = 


[1  +  exp(xi 
where 

(67b) 

•  (67c) 

(67d) 
One  may  obtain  (Af  ,  Af  )  dromions  also  proceeding  in  this  way. 

5.5  Further  general  localized  structures 

Unlike  the  case  of  (1  +  1)  dimensional  soliton  equations,  the  linear  set  of  Hirota 
equations  (56)  can  also  have  more  general  solutions  than  the  one  given  by  (57).  For 
example,  (56a)  admits  general  solutions  of  the  form 

;(7?)],  (68) 

j=i 

where  mj(r})  is  an  arbitrary  function  of  77.  For  example  with  the  choice  N  =  I  in  (68),  we 
can  obtain 


"(&  ^  t)  =  -ikim(  (r])  sech2i[*i£  -  jfejr  +  mi  (77)]  (69) 

where  m\(r])  =  (toni/drj)  is  arbitrary  (as  m}  (77)  is  arbitrary).  Choosing  m^rj)  suitably, 
one  can  obtain  various  types  of  localized  solutions.  For  example,  with  the  choice, 

m(  (77)  =  sech2  (OTJ  +  0)  ,     a,  /3  :  constants  (70a) 

we  have  the  exponentially  locaHzed  solution 

u  =    seca7          sechfc-  (?0b) 
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Many  other  interesting  classes  of  exponentially  localized  solutions  including  oscillatory 
and  rational  functions  can  be  similarly  constructed  for  suitable  choices  of  m,j(rf)  [35]. 

6.  Exponentially  localized  structures  in  other  (2  +  1)  dimensional  NLEEs  [34] 

The  procedure  elucidated  in  §5  for  the  (2+1)  dimensional  generalized  KdV  equation 
can  be  extended  to  other  (2  +  1)  dimensional  systems  discussed  in  §3  as  well.  In  the 
following,  we  summarize  the  main  results. 

6.1  NNV  equation 

Proceeding  as  in  the  case  of  the  generalized  KdV  equation  (46),  we  rewrite  (27)  as 

ut  +  u#£  +  urm  +  aut  +  bun  =  3(uv)^  +  3(w^)7?,  (7  la) 

u*  =  VT,,     UT,  =  q&  (71b) 

the  Painleve  property  can  be  established  [28].  Working  out  the  auto-Backlund 
transformation,  the  bilinearizing  transformation  becomes 

ii  =  -2^  log  6  (72a) 

(72b) 
(72c) 
The  line  solitons  work  out  to  be 


1.2  fnr>  N 

u  =  —  —  sectf,  (73a) 

(73b) 
(73c) 


where 

Xi  =  W  +  /ii?  -  (c*i  +  k^t  -  (d/i  +  l\)t  +  X?  (73d) 


so  that  in  the  limit  k\  — »  0,  u,  v  — >  0,  while  q  survives,  whereas  when  l\  — »  0,  M,  q  — >  0, 
but  v  survives.  Thus  we  have  two  nonparallel  ghost  solitons  in  the  present  case.  Following 
the  procedure  discussed  in  the  previous  section,  we  take  (j>  to  be  of  the  same  form  as  (63a) 
so  that  the  (1,1)  dromion  in  the  present  case  becomes 

,         \  2*i/i(l  —  K)  exp(xi  +  X2)  v      r.  f~A\ 

M(£i  ??)  t)  = o  )     A  >  0  (74) 

[1  +  exp(xi)  +  exp(x2)  +  ATexpUi  +  X2)j 

where  now 

Multidromions  solutions  can  be  similarly  worked  out. 
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6.2  Breaking  soliton  equation 

Rewriting  (28)  as 

«/  +  Q-UXXX  +  buyyy  +  6auux  +  4buuy  +  4buxv  =  0,  (76a) 

uy  =  vx,  (76b) 

one  can  check  algorithmicaily  that  the  Painleve  property  is  satisfied  in  the  nontrivial  case 
only  for  the  choice  a  =  0,  b  ^  0  (b  =  0,  a  ^  0  is  the  trivial  KdV  case)  so  that  we  have  the 
equation  of  motion 

Mt  +  buxxy  +  4buuy  4-  4buxv  =  0,  (77a) 

uy  =  vx.  (77b) 

Note  that  v  itself  satisfies  an  equation  of  the  form 

vr  +  bVxxy  +  4bvvx  +  4bvyd-\x  =  0.  (78) 

One  verifies  that  bilinearization  is  possible  only  for  the  special  case  (77)  and  the  required 
transformation  works  out  to  be 

(79a) 
(79b) 
The  bilinearized  equation  becomes 

(DxDt  +  bD3xDy)(j)-(j)  =  0.  (80) 

Expanding  (f>  in  a  formal  power  series  as  before  and  truncating,  one  obtains  the  line 
soliton  as 

(gla) 

(gib) 
where 

Xi  =  fcuc  +  /iy  -  Wftf  +  x[0).  (81c) 

As  A;  -»  0,  both  w  and  v  -»  0,  while  when  /t  -»•  0,  the  potential  v  ->  0,  but  the  field  M 
survives  as  a  static  ghost  soliton  of  the  form  u  -»  |  fc2  sech2(fcijc  +  x^)  at  the  boundary 

*  =  —CO. 

The  dromion  solutions  can  be  obtained  in  the  usual  way.  For  generating  the  (1,1) 
dromion,  we  take 


K  >  0  (82a) 

Xi  =  kix  +  x?  ,     X2  =  hy  -  b%ht  +  xf  (82b) 
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so  that  the  potential  v  becomes  fully  localized  exponentially, 
,      3 


2  [1  +  exp(xi)  +  exp(x2)  +Kexp(xi  + 
whereas'  the  physical  field 


/    „  ,N  _  3  (1  +  K}k\ expfai  +  X2)  +  *i  exp(xi)(l  • .  »  —  r vw,       (^ 

Ml^C«      y.      t      J         """""  ™  '  '"         ""'"*'  -i. -•-»-.-  -n_....m_,...--  -  I       Q^f.     I 

2  [1  +  exp(xi)  +  exp(x2)  +  ^exp(^i  + 


which  is  though  bounded  does  not  fall  off  to  zero  as  xi  —  »•  °°-  One  can  proceed  further 
[29]  and  obtain  an  (l,N)  dromion  of  the  form 


v        =  , 

(llW)  12' 


Finally,  as  in  the  case  of  the  generalized  (2+1)  dimensional  KdV  equation  (§  5.4),  one 
can  obtain  more  general  exponentially  localized  solution  of  the  form 

•  (86a) 

(86b) 

where  g(£)  and  h(£)  are  arbitrary  functions  in  the  variable  £  =  y  —  fc^  ?.  Choosing 
/x(£)  suitably,  one  can  indeed  obtain  a  very  large  class  of  localized  structures  for  the 
potential  v. 

Thus  in  the  case  of  the  breaking  soliton  equation  (77),  it  is  not  the  physical  field  which 
is  localized  but  it  is  rather  the  potential  which  admits  localized  solutions.  Looking 
from  another  point  of  view,  one  may  take  (78)  along  with  (77b)  as  the  basic  evolution 
equation,  in  which  case  we  have  now  the  new  physical  field  which  admits  localized 
structures. 


6.3  (2+1)  dimensional  NLS  equation 

Considering  the  symmetric  generalized  (2+1)   dimensional  nonlinear  Schrodinger 
equation  (29)  introduced  by  Fokas  [17],  it  can  be  rewritten  as 


iqt  +  (a  +  ftqm  -(a-  flqK  -  2Xq[(a  +  /3)V  -  (a  -  {3)U}  =  0,         (87a) 

tel?  =  V*  (87b) 

\q\l  =  Ur,.  (Sic) 

Equation  (87)  contains  three  important  systems: 

(i)  a  =  (3  =  ^:  Simplest  complex  scalar  equation  in  (2  +  1)  dimensions 
(ii)  a  =  0,  (3  =  1:  Davey-Stewartson  equation  I  (DSI) 
(iii)  a  =  1,  /3  =  0:  Davey-Stewartson  equation  ffl  (DS1H). 
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y  =  -Adw  log  0,     17  =  -  A%  log  0,  (88b) 

one  obtains  the  bilinearized  form 

[/A  -  (a  -  /3)Z>2  +  (a  +  PF$g-<l>  =  0,  (89a) 

DeZV>.0=-2Agg*.  (89b) 

Expanding 

£  =  E£l+£3£3  +  ---,       <£=l+£202+eV4  +  ---  (90) 

and  following  the  usual  procedure,  the  basic  line  solitons  are  obtained  as 

tf(f » *?>  0  =  ^exp(-^)  sech(xu?  +  ip)  expfai/),  (91  a) 

, V, 0  =  -A4sech2(x1J?  +  ^),  .    (91b) 
,  17, 0  =  -A^sech2(xi«  +  V),  (91c) 

Xi  =Pit  +  W-i[(<x-P)p2i-(a  +  P)s\]t  +  Xi)}-  (91d) 

One  identifies  two  nonparallel  ghost  solitons,  one  at  77  =  -oo, 

U  =  -Ap^sech2[xi/?  +  V],     XIR  =  Pi*[C  +  2(a  -  ^pnf],  (92a) 

and  the  other  at  £  =  —  oo. 

V  =  -A4sech2[xi*  + 1//],     xi*  =  *!«[»?  +  2(a  4-  /3)ji/f].  (92b) 

Consequently,the  exponentially  localized  (1,1)  dromion  is  obtained  with  the  choice 

0im  =  1  +jexp(xi  +  x*)  +  fcexpOa  +  ^2)  +  /exP(Xi  +  X*  +  Xz  +  %2)> 

(93a) 
with 

Xi  =  Pit  ~  i(<*  ~  P}p2!t  +  ci ,     X2  =  sw  +  i(a  +  flfit  +  c2,  (93b) 

where  pi,  si,  ci  and  c2  are  constants.  Choosing  g  appropriately,  one  then  obtains 

quo  =  pexp^  +  ^^  +  '{Pi^  +  M+[(Q  +  ^)ki|2-(a-/3)^i|2]}) 
1  +;exp(2p^|)  +*exp(2j!^)  +  /exp(2|plje|  +  j,^])  ' 

(94a) 
where 

|  =  £  +  2(a  -  /3)^i/r,     r)  =  r?  -  2(a  +  p)sut.  (94b) 

Pramana  -  J.  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
182  Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


Localized  coherent  structures  of  soliton  systems 

The  multidromions  can  also  be  constructed  by  generalizing  the  above  procedure. 
Similarly  exponentially  localized  breather  solutions  also  follow  by  putting  s\i  =  p\j  =  0 
in  (94a)  to  obtain 


6.4  TTie  (2+1)  dimensional  generalized  NLS  equation  used  by  Strachan  [18] 

The  system  of  equations  (30)  is  completely  integrable  and  satisfies  the  Painleve  property 
[27].  It  can  be  bilinearized  through  the  transformation 

<Z  =  |,     9*=^,     v  =  2Vog0  (96) 

as 

-(f),  -  (97a) 

(97b) 


While  admitting  line  solitons,  it  does  not  admit  two  nonparallel  ghost  solitons  driving  the 
boundaries  in  the  usual  sense.  Thus  the  basic  dromion  solutions  of  the  form  considered  in 
the  previous  examples  do  not  seem  to  arise  in  this  case. 

However,  one  finds  that  a  more  general  form  of  localized  solutions  corresponding  to 
curved  solitons  can  exist.  To  generate  such  localized  solutions,  we  expand  g  and  <fr  in  the 
form  of  a  power  series  similar  to  (90)  to  give  rise  to  the  following  set  of  equations: 

e:ig?}=g<£>  (98a) 

,2:0£)=g(V1)*  (98b) 

and  so  on.  Solving  (98a),  we  obtain 

AT 

j(y>  0  +  q,  (99a) 


where  m/(y,f)  is  an  arbitrary  function  of  (y,  f)  chosen  such  that 

m,(y,  0  =  mj(pj)  =  nij(y  -  ikjt]  (99b) 


and  kj  and  c/  are  complex  constants.  To  construct  one  soliton  solution,  we  take  N  =  \  and 
substitute  g^  in  (98b)  to  give 

4>W  =  exp(Xl  +  X\  +  2^),     exp(2tf)  =  -     .  (100) 


Hence,  the  physical  field  q  and  the  potential  V  assume  the  form 

q  =  kiR  sech(xi/?  +  ^)  exp(ixu),  (lOla) 

V  =  2fclj?(mw)     sech(xi/;  +  V),     Pw  =  y  +  but,     ku  =  Im(ki).     (lOlb) 
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It  is  evident  from  (101)  that  both  the  physical  field  q  and  the  potential  V  remain  finite  on 
the  curve 

C  =  X\R  +  "0  =  k\Rx  +  mlR(y,t)  +clR  =  0  (102) 


and  decay  exponentially  everywhere  (as  x,  y  —  >•  oo)  apart  from  the  curve  C  —  0  given  by 
(102).  We  call  such  line  solitons  as  "curved  solitons".  As  (m\R)p{R  is  arbitrary,  one  can 
choose  it  conveniently  as 

=  sech2(Pl*)  (103) 


so  that  a  one  dromion  solution  for  the  potential  V  becomes 

V  =  2klRsech2(plR}sech2(XiR  +  VO-  (104) 

One  calls  such  localized  solutions  as  "induced  dromions"  as  they  are  induced  by  virtue 
of  the  arbitrary  function  present  in  the  system.  One  can  indeed  construct  a  large  class  of 
localized  solutions  for  the  potential  by  choosing  the  arbitrary  function  properly. 

6.5  (2+1)  dimensional  sine-Gordon  equation 
The  system  (31)  under  the  transformations 

9  =  ~0,,     '  =  -f*«  (105) 

can  be  recast  in  the  form 

0>  (106a) 

0,  (106b) 

(106c) 

Again  the  P-property  can  be  established  algorithmically  [30].  One  then  obtains  the 
bilinearizing  transformation 

,-i,     r  =  J,  (107.) 

p  =  2d,(log4,)+2  I    mi  (?',  i)  dg  +  2  f    m,  (rf,t)  Arf  (107b) 

J—  oo  J—  oo 

so  that  the  Hirota  bilinear  form  of  eq.  (106)  becomes 

^  +  mi(Tj)f)M  =  0>  (108a) 

g-(j>  +  mi  (77,  i)h>(f>  =  0,  (108b) 

=  -2gh,  (108c) 

hDrpd-t  -  gD(Dt<t>-<l>  =  0,  (I08d) 

where  ml(rj,t)  and  m2(^t)  are  certain  arbitrary  functions  which  can  be  identified  with 


Considering  the  2DSGI  (a2  =  +1)  case,  we  note  that  the  presence  of  nonzero  mi  and 
mi  are  essential  for  the  formation  of  line  kink  solitons  as  well  as  localized  solutions. 
Expanding  g,  h  and  0  in  a  suitable  formal  power  series,  one  obtains  the  line  kinks  for  the 
case  in  which  m\  and  mi  are  non-zero  constants  as 


l+exp(2Xl+2«)' 
where  p\  ,  q\  and  S  are  constants  and 

(110) 


Then  we  have 

0&T/,f)=4tairl[«p(;a  +  a)].  (Ill) 

Taking  mi  and  mi  in  the  form 

?  (H2a) 


one  ultimately  obtains  the  (1,1)  dromion  solution  [30,  37] 

0_4tan-i 


6.6  The  (2+1)  dimensional  long  dispersive  wave  (2DLW)  equation 

Considering  the  2DLDW  equation  (32),  we  find  that  again  the  P-property  is  true  and  that 
with  the  transformation 


it  can  be  bilinearized  in  the  form 


Analysis  of  (115)  leads  to  the  line  soliton  solutions  of  the  form 

t-n*  +  JIT;  -  (p?/A)f  +  cQ 


, 

U       } 

l         } 
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while  the  potential  v  is  described  by  the  line  soliton 


v  = 


2i(Xi  4-  tf  +  W),  (H6c) 


where 

p2  p12 

Xi  =  -Pi*  +  si??  -  y*  +  ci,     xi  =  -pi*  +  ^17  +  -j-  1  +  ci, 

exp(2^)  =  7  -  ^  -  jr-  .  (1  16d) 

PV     '      (pi+Pi)(*i  +  *i) 

We  also  note  that  the  composite  field  is  given  by 

»  +  *'.+20-  (117) 


Further  in  the  limit  (  s\  +  ^  )  —  >  0  (or  equivalently  at  the  boundary  77  =  -oo),  one  has  the 
ghost  soliton 

'2  (118) 

The  above  features  indicate  that  it  is  not  the  physical  fields  q  and  r  that  will  get  localized, 
but  it  is  rather  the  composite  field  Q  =  "qr"  which  will  get  localized.  Indeed  with  the 
choice 


(H9a) 

2 

Xi=P\x-?~  +  ci,     X2  =  siT?  +  c2, 
one  obtains  the  (1,1)  dromion  for  the  composite  field  as 


[1  +exp(xi  )  +exp(%2)  +K  exp(xi 


(120) 

More  general  (1,AT)  dromions  can  be  given  by 

-  K)Pl 


<ar\ 

1    ^      [l+ 

Again,  it  should  be  mentioned  that  one  can  indeed  generate  induced  localized  solutions 
for  the  composite  field  by  utilising  the  arbitrary  function  f(q)  present  in  the  system. 

7.  Conclusions 

In  this  paper,  we  have  investigated  the  type  of  basic  nonlinear  coherent  structures  that  can 
arise  in  a  class  of  nonlinear  evolution  equations  in  (2  +  1)  dimensions.  In  contrast  to  the 
ordinary  solitons  in  (1  +  1)  dimensions,  it  was  pointed  out  that  much  richer  structures  can 
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arise  in  (2  +  1)  dimensions:  line  solitons,  lump  solitons,  exponentially  localized 
dromions  and  their  generalizations  and  so  on.  In  particular,  we  have  discussed  in  detail 
certain  algorithmic  methods  of  obtaining  the  localized  dromion-like  solutions,  either  in 
the  field  variables  or  in  the  potential  variables  or  even  in  the  composite  variables.  In  all 
these  cases,  the  relevance  of  boundary  contributions  to  form  localized  structures  was 
stressed. 

The  necessity  of  boundary  contributions  to  form  exponentially  localized  solutions  also 
justifies  the  type  of  NLEEs  we  have  discussed  here,  namely  all  of  them  have  nonlocal 
terms  or  equivalently  extra  fields  associated  with  them.  Simple  minded  (2+1) 
dimensional  (local)  generalizations  of  (1  +  1)  dimensional  soliton  equations  do  not 
seem  to  admit  exact  analytical  localized  structures  in  general.  More  work  needs  to  be 
carried  out  to  understand  the  basic  structures  in  such  systems  and  the  analysis  of  such 
systems  will  constitute  an  important  area  of  development  in  nonlinear  dynamics  in  future. 
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Abstract.  We  briefly  review  the  nonlinear  dynamics  of  diverse  physical  systems  which  can  be 
described  in  terms  of  moving  curves  and  surfaces.  The  interesting  connections  that  exist  between 
the  underlying  differential  geometry  of  these  systems  and  the  corresponding  nonlinear  partial 
differential  equations  are  highlighted  by  considering  classic  examples  such  as  the  motion  of  a 
vortex  filament  in  a  fluid  and  the  dynamics  of  a  spin  chain.The  association  of  the  dynamics  of  a 
non-stretching  curve  with  a  hierarchy  of  completely  integrable  soliton-supporting  equations  is 
discussed.  The  application  of  the  surface  embeddability  approach  is  shown  to  be  useful  in  obtaining 
such  connections  as  well  as  exact  solutions  of  some  nonlinear  systems  such  as  the  Belavin- 
Polyakov  equation  and  the  inhomogeneous  Heisenberg  chain. 
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1.  Introduction 

The  connection  between  differential  geometry  and  certain  nonlinear  partial  differential 
equations  (NLPDE)  has  attracted  the  attention  of  both  physicists  and  mathematicians 
for  over  a  century  now.  Nineteenth  century  geometers  had  shown  such  a  relationship 
between  the  structure  of  minimal  surfaces  (i.e.,  surfaces  whose  mean  curvature  is 
zero)  and  the  hyperbolic  Liouville  equation.  Another  classic  example  is  the  connection 
[1]  between  the  geometry  of  pseudospherical  surfaces  (i.e.,  surfaces  with  negative 
constant  Gaussian  curvature)  and  the  sine-Gordon  equation.  This  result  achieved  a 
special  significance  many  years  later,  when  it  was  realized  that  these  equations 
were  integrable  by  the  powerful  inverse  scattering  transform  (1ST)  method  discovered 
in  1967  by  Gardner  et  al  [2].  This  observation  inspired  a  lot  of  interesting  work  on 
the  relationship  [3]  between  the  differential  geometry  of  special  surfaces  and 
integrable  soliton-supporting  equations  [4].  Around  the  same  time,  the  differential 
geometry  of  moving  curves  [3]  and  their  relation  to  integrable  equations  was  also 
studied.  Interestingly,  this  latter  connection  first  arose  in  the  context  of  a  physical 
application  in  the  field  of  hydrodynamics  [5].  Pioneering  work  by  Hasimoto  [5]  showed 
mat  the  equation  of  motion  of  a  vortex  filament  regarded  as  a  space  curve  was  equivalent 
to  the  well-known,  integrable  nonlinear  Schrodinger  equation  (NLSE)  [4].  Using 
the  Hasimoto  transformation  that  relates  space  curves  and  complex  curvature 
functions,  Lamb  [6]  generalized  the  above  result  by  demonstrating  the  link  between 
the  motion  of  certain  space  curves  and  soliton-bearing  equations.  Soon  afterwards, 
Lakshmanan  [7]  showed  that  the  Landau-Lifshitz  equation  which  describes  the 
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continuum  dynamics  of  the  classical  ferromagnetic  Heisenberg  chain  was  gauge- 
equivalent  to  the  NLSE  . 

In  recent  years,  there  has  been  renewed  interest  in  looking  for  such  connections.  It  has 
been  shown  that  the  dynamics  of  a  non-stretching  planar  curve  can  be  connected  with  the 
integrable,  modified  Korteweg-de  Vries  (MKdV)  hierarchy  [8].  Another  example  is  the 
Belavin-Polyakov  equation  [9]  satisfied  by  a  unit  vector  field,  which  appears  in  a  variety 
of  physical  applications  such  as  the  nonlinear  sigma  model,  the  two-dimensional 
Heisenberg  ferromagnet,  etc.  Its  equivalence  to  the  exactly  solvable  elliptic  Liouville 
equation  has  been  established  [10].  A  surface  embeddability  approach  has  been 
demonstrated  to  be  useful  in  finding  explicit  solutions  to  the  spin  evolution  equation  of  an 
inhomogeneous  Heisenberg  chain  [11].  In  this  paper,  we  briefly  review  some  of  the 
above-mentioned  examples  of  nonlinear  dynamical  systems  which  can  be  described 
geometrically.  The  purpose  is  to  illustrate  how  the  application  of  the  classical  differential 
geometry  of  curves  and  surfaces  can  be  used  to  find  such  surprising  connections  as  well 
as  exact  dynamical  solutions. 

2.  Space  curves  and  surfaces 

We  begin  by  summarizing  some  of  the  salient  features  of  the  space-curve  formalism  and 
surface  theory,  which  will  be  useful  in  the  applications  to  be  considered  in  subsequent 
sections. 

2.1  Time  evolution  of  twisted  space  curves 

A  static  space  curve  in  E3  is  described  either  by  its  parametric  equations  or  by  its  natural 
equations:  K  =  K(s]  and  r  =  r(s),  where  K,  r  and  s  are  the  curvature,  the  torsion  and  the 
arc  length  of  the  space  curve.  Consider  a  curve  which  in  its  parametric  form  is  described 
by  the  position  vector  x  =  x(s).  Let  t(s]  -  dx/ds  be  the  unit  tangent  to  this  curve,  and  n 
and  b  the  unit  normal  and  binormal,  so  that  t,  n  and  b  form  an  orthogonal  triad  on  the 
curve.  They  satisfy  the  well  known  Frenet-Serret  equations  [1] 

ts=Kii,  ns  = -£t  +  rb,  bs  =  -rn  (2.1) 

where  the  subscript  denotes  (d/ds)  and  K  and  r  are  given  by 

K2=ts-ts,  r  =  t-(t,  xtss}/K2.  (2.2) 

Let  this  curve  evolve  in  time  t  in  such  a  way  that  the  local  'velocity'  at  s  is  specified  by 

V  =  (dx(s,  t]/dt)  =  x,  =  at  +  j3n  +  7b.  (2.3) 

The  scalar  functions  a,  /3  and  7  represent  the  tangential,  normal  and  binormal 
components  of  V.  As  we  shall  see,  their  functional  form  will  be  determined  by  the 
physics  of  the  model  under  consideration. 

There  is  another  aspect  of  curve  motion.  As  the  tangent  to  a  static  curve  is  xs  =  t  by 
definition,  it  is  clear  that  in  any  physical  application  involving  moving  curves,  it  is 
possible  to  write  down  the  time  evolution  of  t  from  (2.3)  by  using 

t/  =  Xs,.  (2.4) 
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In  some  other  models  (like  spin  chains),  t,  can  be  written  down  directly.  In  general,  since 
t  is  a  unit  vector, 

tt  =  gn.  +  fcb,  (2.5) 

where  g  and  h  will  be  local  functions  of  the  curvature  K  and  the  torsion  r.  Further,  when 
the  compatibility  conditions 

ttt  =  tir,     ntt  =  nst,     bts  =  bst  (2.6) 

are  imposed,  one  typically  obtains  coupled  nonlinear  partial  differential  equations  for  K 
and  T.  In  many  applications,  these  turn  out  to  be  well-known  exactly  solvable  equations. 
In  such  cases,  the  analysis  shows  that  the  above  property  arises  essentially  due  to  certain 
special  geometrical  features  with  which  the  curve  evolution  is  endowed. 


2.2  Surface  theory 

Consider  a  surface  generated  by  a  position  vector  r(w,  v)  which  is  a  function  of  two 
parameters  (local  coordinates)  u  and  v.  The  metric,  i.e.,  the  first  fundamental  form,  is 
given  in  the  usual  notation  [1]  as  /  =  (dr)2  =  E(du)2  +  2jFdwdv  +  G(dv)2,  where 
E  =  (rM)2,  F  =  rM-rv  and  G  =  (rv)2.  The  unit  normal  to  the  surface  is  defined  by 
n  =  (ru  x  rv)/|rM  x  rv|.  The  extrinsic  curvature  tensor,  i.e.,  the  second  fundamental  form 
is  defined  as  //  =  —  dr-dn  =  L(dw)2  +  2Md«dv  +  Af(dv)2,  where  L  =  n-rMM,  Af  =  n-rMV 
and  N  =  n-rvv.  At  every  point  on  the  surface,  we  can  introduce  a  moving  trihedron  made 
up  of  the  three  linearly  independent  vectors  ru,  rv  and  n.  Since  every  vector  can  be 
linearly  expressed  in  terms  of  these  three  basis  vectors,  one  obtains  the  following  Gauss- 
Weingarten  (GW)  equations  [1]: 

r««  =  rJ]rtt+r21rv+Ln  (2.7a) 

ruv  =  Tluru+r2l2rv  +  Mn  (2.7b) 

i-w  =  r22rM  +  r^2rv  +  Nn  (2.7c) 

(2.7d) 
(2.7e) 


where  the  expressions  for  the  Christoffel  symbols  F»  and  the  coefficients  /?/,  qi,  i=  1,2 
are  given  below  for  ready  reference 

Tjj  =  [GEU  -  2FFU  +  FEV]/2A;     r{2  =  [GEV  -  FGU]/2A 

T122  =  [2GFV  -  GGU  -  FGV]/2A;     1^  =  [2EFU  -  EEV  -  FEU]/2A 

T22  =  [EGU  -  FGV]/2A;     T\2  =  [EGV  -  2FFV  +  FG«]/2A 

pi  =  [MF  -  LG]/A;     P2  =  [LF  -  ME]  /A. 

qi  =  [NF  -  MGj/A;     q2  =  [MF  -  NE]/A,  (2.8) 

where  A  =  (EG  —  F2).  The  Gauss-Mainardi-Codazzi  (GMC)  equations  [1]  are  obtained 
from  the  compatibility  conditions  rMKV  =  ruvu  and  rMVV  =  rvvu.  We  will  write  down  the 

Pramana  -  J.  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences"  191 


Radha  Balakrishnan 

relevant  equations  when  we  discuss  specific  applications.  The  fundamental  theorem  of 
surfaces  states  that  if  the  surface  parameters  £,.F,  G,L,M  and  N  satisfy  the  GMC 
equations,  then  the  corresponding  surface  r(u,  v)  is  uniquely  determined  except  for  its 
position  in  space.  We  will  see  how  this  can  be  used  in  applications  where  the  GMC 
equations  turn  out  to  be  exactly  solvable  equations,  with  the  surface  parameters 
determined  exactly  as  explicit  functions  of  u  and  v. 


3.  Vortex  filament  motion  and  the  Hasimoto  transformation 

It  has  been  observed  that  vortex  filaments  in  a  fluid  very  often  preserve  their  identity 
when  they  move.  In  this  section,  we  summarize  Hasimoto's  work  [5]  which  shows  that  on 
using  a  local  induction  approximation  (to  be  explained  below),  the  propagation  of  a 
soliton  along  a  filament  can  be  obtained. 

A  thin  vortex  filament  in  a  fluid  can  be  regarded  as  a  space  curve.  The  fluid  is 
incompressible  and  inviscid  and  has  a  non-vanishing  vorticity 

w(x)=vxV(x),  (3.1) 

where  V(x)  is  the  fluid  velocity  at  a  point  with  position  vector  x  on  the  filament. 
Equation  (3.1)  implies  the  following  Biot-Savart  law: 

V(x)  =  [  dVw(x')  x  (x  -  x')/|x  -  xf.  (3.2) 

Thus  the  filament  is  transported  by  the  velocity  induced  by  its  own  vorticity.  For 
simplicity,  it  is  assumed  that  w(x)  is  tangential  to  the  filament  at  every  point.  Since  the 
vortex  has  a  thin  core,  the  volume  integral  in  (3.2)  can  be  replaced  by  a  line  integral  as 
follows: 

V/(x)  -  j  t(x')  x  (x  -  x')/|x  -  xfd/,  (3.3) 

where  V/  is  the  induced  velocity  and  x  =  x(s,  t]  is  the  position  vector  of  a  point  on  the 
filament  curve  at  time  t  .  Because  of  the  thinness  of  the  core,  the  contribution  of  the  local 
portion  of  the  filament  dominates  the  Biot-Savart  integral.  Expanding  x'  in  a  Taylor 
series  around  x(ji,  ?), 


xV,  0  -  x(*,  t)  +  (s1  -  s)xs  +  (l/2)(y  -  j)2x«  +  0(s'  -  s)3.  (3.4) 

Since  x,  =  t  and  x«  =  t,  =  Kn  from  (2.1),  substituting  (3.4)  in  (3.3)  yields 

V7  ~  K(t  x  n)  =  £b,  (3.5) 

Comparing  (3.5)  with  (2.3),  we  see  that  in  this  'local  induction  approximation',  the 
velocity  V/  =  xf  of  the  vortex  filament  has  no  tangential  and  normal  components  but 
only  a  component  along  the  binormal,  proportional  to  the  curvature  K.  Equation  (3.5) 
leads  to 

t,  =  Ksb  -  Km,  (3.6) 


Kt  =  -2Ksr-Krs  .  (3.7a) 

rt  =  [(Kss/K}-T2}s-KKs.  (3.7b) 

On  using  the  Hasimoto  transformation 


(3.8) 

Equations  (3.7)  reduce  to  the  nonlinear  Schrodinger  equation  (NLSE) 

ty,  +  V»«  +  (l/2)MV  =  0  (3.9) 

whose  exact  solutions  are  known  [4],  The  NLSE  is  a  completely  integrable  evolution 
equation,  associated  with  an  infinite  number  of  integrals  of  motion,  strict  Af-soliton 
solutions,  etc.  From  the  soliton  solution,  the  curvature  and  torsion  of  the  filament  can  be 
read  off  using  (3.8  ).  Thus,  the  local  induction  approximation  is  seen  to  lead  to  a  shape- 
and  velocity-preserving  motion  of  a  helical  vortex  filament  in  the  fluid.  It  is  interesting 
that  such  a  motion  of  vortex  filaments  has  been  experimentally  observed  in  certain  types 
of  fluids.  Finally,  this  example  shows  that  the  special  geometry  underlying  the  completely 
integrable  NLSE  is  encoded  in  the  simple  geometric  relationship  xf  =  Kb  [Eq.  (3.5  )]  for 
the  local  velocity  of  a  space  curve  moving  in  E3.  More  recently,  it  has  been  shown  [12] 
that  going  beyond  the  local  induction  approximation  can  lead  to  a  connection  with  more 
general  soliton-bearing  equations. 

4.  Dynamics  of  the  classical  Heisenberg  ferromagnetic  chain 

The  subject  of  quasi-one  dimensional  magnets  with  various  symmetries  is  interesting 
because  such  systems  can  be  fabricated  in  the  laboratory.  In  this  section  we  review  the 
isotropic  ferromagnetic  chain. 

Consider  a  one-dimensional  lattice  with  classical  spins  S,-  of  constant  magnitude  (i.e., 
(S,-)  =  S2  =  1)  at  every  site  i,  with  the  nearest  neighbour  interaction  Hamiltonian 

>0.  (4.1) 

The  spin  evolution  equation  derived  from  (4.1)  is  (dS,-/dr)  =  7S,-  x  (S,-+i  +  S,-_i).  On 
taking  the  continuum  limit  S/  —>•  S(x,  t),  this  reduces  to  the  nonlinear  partial  differential 
equation  (  the  Landau-Lifshitz  equation) 

S,  =  7S  x  S«,  (4.2) 

where  the  subscripts  denote  partial  derivatives.  The  underlying  geometry  of  (4.2)  is  most 
conveniently  analysed  by  identifying  S  with  the  unit  tangent  to  a  curve  [13],  so  that  the 
Landau-Lifshitz  equation  describes  a  moving  space  curve.  (Note  that  the  instantaneous 
spin  configuration  on  the  chain  can  be  mapped  to  a  corresponding  space  curve  at  that 
instant).  Thus  (4.2)  can  be  cast  in  the  form  (after  rescaling  time  t  to  Jt) 

t,  =  t  x  tM.  (4.3) 
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Pioneering  work  by  Lakshmanan  [7]  showed  that  Eq.  (4.3)  is  gauge-equivalent  to  the 
NLSE.  The  following  interesting  connection  between  the  present  interacting  spin 
problem  and  the  (seemingly  unrelated)  vortex  filament  motion  considered  in  §  3  is  worth 
noting:  The  filament  evolution  xf  =  Kb  (Eq.  (3.5))  when  differentiated  with  respect  to  s 
leads  to  (4.3)!  Thus  (4.3)  also  yields  the  same  coupled  equations  (3.7)  found  in  the  fluid 
mechanical  context,  and  thence  to  the  NLSE.  The  soliton  solutions  play  the  role  of 
nonlinear  excitations  in  the  spin  chain  [7].  It  must  be  noted  that  the  solution  of  NLSE 
contains  information  about  the  energy  and  current  densities  [13].  In  this  example  we  have 
seen  that  the  geometry  of  the  NLSE  is  encoded  in  the  special-  time  evolution  of  the 
tangent  vector  as  given  by  (4.3). 

.Before  concluding  this  section,  we  mention  that  there  are  some  extensions  of  the 
Heisenberg  chain  which  can  be  analysed  through  the  space  curve  formalism-for  instance, 
the  inclusion  [14]  of  the  Landau-Gilbert  damping  term  in  the  Hamiltonian.  Integrability 
in  higher  dimensions  for  the  spherically  symmetric  case  has  also  been  recently 
investigated  [15]. 

5.  Dynamics  of  nonstretching  planar  curves 

There  are  many  physical  applications  involving  curve  dynamics  which  preserve  certain 
global  geometric  quantities  such  as  the  length  of  an  open  curve  or  the  area  enclosed  by  a 
closed  curve.  Examples  are  polymers,  cell  membranes  or  closed  vortex  filaments.  An 
interesting  paper  by  Goldstein  and  Petrich  [8]  considers  the  dynamics  of  curves  on  a 
plane.  In  this  case  the  torsion  r  =  0,  and  Eqs  (2.1)  become 

ts  =  Kn,     ns  =  -Kt.  (5.1) 

Now 

x,  =  t/n  +  Wt,  (5.2) 

where  U  and  W  are  the  normal  and  tangential  components  of  the  velocity.  If  these  are 
chosen  to  be  local  functions  of  the  curvature  K  and  its  derivatives,  i.e., 

U=U(K,KS,KSS...)     and    W  =  W(K,KS,KSS..  .), 
we  get  from  (5.2) 

xs/  =  tr  =  (Ws  -  KU)t  +  (Us 
Since  t.tf  =  0,  this  leads  to 


WS  =  KU,     W  =      KUds  +  C,  (5.3) 

where  C  =  constant.  Hence,  in  general, 

t,  =  (Z/,+AW)n.  (5.4) 

At  this  stage,  one  poses  the  following  question:  How  should  a  planar  curve  evolve  in  time 
so  as  to  keep  its  length  L  =  Jds  fixed?  One  way  to  ascertain  Lt  =  0  is  to  ensure  that 
ds/dt  =  0  so  that  the  curve  does  not  'stretch'  as  time  evolves.  This  imposes  the 
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compatibility  conditions  tst  =  tts  and  nst  =  nts.  Using  (2.1),  (5.3)  and  (5.4)  in  these 
yields  the  following  evolution  equation  for  K: 

KUds  +  C    =  -fit/,  (5.5) 


where  the  operator  fi  is  given  by 

n  =  (_a,,  H-  A:2  -f  W;1*)  (5-6) 

with 


d~lKU=   I  KUds  +  C. 


It  is  clear  that  the  evolution  of  the  curve  is  decided  by  the  choice  of  U  and  W  in  (5.2). 
Since  U  and  W  are  related  by  (5.3)  in  general,  and  W  should  be  a  local  function,  either  U 
should  vanish  or  KU  must  be  a  total  derivative.  The  simplest  choice  U  =  £/(1)  =  0  gives 
W  =  WW  =  C  and 

Kt  =  -CKS.  (5-7) 

This  is  a  linear  equation  representing  the  trivial  motion  of  the  curve  along  itself 
with  velocity  C.  (Note  that  the  velocity  x,(1)  =  Ct  has  only  a  tangential  component).  The 
next  choice  U  =  £/(2)  =  -Ks  gives  W  =  W&  =  (±£2)  +  C,  giving  the  modified 
Korteweg-de  Vries  (MKdV)  equation 

Kt^-Klu  +  (3/2)K2Kt  (5-8) 

for  the  choice  C  =  0.  This  is  a  completely  integrable  equation.  The  one-soliton  solution 
K(s,  0  =  v7"  secn  (Vu(s  -  "0)  represents  a  moving  localized  profile  (lump)  traveling 
along  the  curve  with  velocity  u  >  0.  Note  that  the  velocity  x}2)  =  -Ksn  -  (|£"2)t  has 
both  tangential  and  normal  components. 

Proceeding  likewise,  i.e.,  by  choosing  the  right-hand  side  of  each  equation  for  K  to  be 
the  normal  velocity  U  of  the  succeeding  equation,  a  hierarchy  of  equations  is  obtained, 
the  next  one  being 

Kt  =  -Ksssss  -  (15/8)*%  -  (5/2)JKj  -  (5/2}K2Kss  -  WKKSKSS,         (5.9) 

for  the  choice  U  =  lX3>  =  ^w  +  \K2KS. 

Note  that  the  order  of  the  equation  increases  at  each  stage.  If  the  curve  is  closed,  the 
area  enclosed  is  A  =  £  §  (x  x  x^dj,  giving  At  -  f  Uds.  Since  £/  is  a  total  derivative  for 
all  members  of  the  hierarchy,  the  corresponding  evolutions  preserve  not  only  the 
perimeter  but  also  the  area  enclosed.  In  addition,  due  to  complete  integrabiliry,  the 
dynamics  also  possesses  an  infinite  number  of  integrals  of  the  motion  which  are 
polynomials  in  the  curvature  and  their  derivatives.  (There  could  of  course  be  other 
choices  like  U  =  KnKs,  W  —  -Kn+2/(n  +  2),  which  would  also  conserve  these  global 
quantities,  but  it  is  not  known  whether  the  resulting  dynamics  would  be  integrable).  In 
conclusion,  the  recursion  operator  fi  defined  in  (5.6)  has  a  universal  geometrical 
significance  in  defining  the  curvature  evolution  under  arclength-conserving  dynamics. 

It  has  also  been  pointed  out  [6,  16,  17]  that  the  Frenet-Serret  equations  are  equivalent 
to  the  Ablowitz-Kaup-Newell-Segur  (AKNS)  scattering  problem  in  the  1ST  with  zero 
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eigenvalue.  We  conclude  this  section  by  mentioning  that  Langer  and  Perline  [18]  have 
shown  that  the  dynamics  of  a  nonstretching  filament  regarded  as  a  non-planar  space  curve 
can  give  rise  to  an  analogous  recursion  operator  for  the  NLSE  hierarchy,  on  using  the 
Hasimoto  transformation. 


6.  The  Belavin-Polyakov  equation 

The  Belavin-Polyakov  equation  [9] 

my  =  m.x  x  m,     (m)2  =  1  (6.1) 

arises  in  several  physical  applications.  In  these  coupled  nonlinear  partial  differential 
equations,  both  the  independent  variables  x  and  y  could  be  spatial  variables,  or  one  spatial 
and  the  other  temporal,  depending  on  the  physical  context.  The  static  solutions  of  the 
nonlinear  sigma  model  in  (2  +  1)  dimensions,  the  static  two-dimensional  Heisenberg 
ferromagnet,  and  the  Heisenberg  antiferromagnet  in  (1  +  1)  dimensions  [19]  are 
examples  where  the  relevant  configurations  are  classical  unit  vectors  m  satisfying  (6.1). 
Let  us  first  apply  the  moving  space  curve  formulation  to  this  problem  [10].  It  is 
possible  to  identify  m  with  the  tangent  vector  to  a  space  curve  which  'evolves'  either 
spatially  or  temporally,  depending  on  whether  y  is  a  space  variable  or  a  time  variable. 
Thus  (6.1)  can  be  written  in  the  form 

t«  =  t,  x  t  =  -Kb,  (6.2) 

on  using  (2.1).  Writing  nu  =  a\t  +  a2b  and  bf(  =  git  +  #2n,  the  compatibility  conditions 
(2.6)  determine  a\  =  0,  a2  =  (-KS/K),  gi  =  K  and  g2  =  (KS/K}.  Further,  the  coupled 
evolution  equations  for  K  and  r  are 

Ku  —  Kr  (6.3a) 

and 

ru  =  (-Ks/K}s-K2.  (6.3b) 

These  equations  combine  to  give 

(KU(K\  +  (KS/K}S  =  -K2,  (6.3c) 

which  becomes  the  elliptic  Liouville  equation 

fas  +  <t>uu  =  -exp  (20),  (6.3d) 

on  setting  0  =  In  K.  Since  the  exact  general  solution  of  (6.3d)  is  known  [10],  the  exact 
solutions  for  the  curvature  K  =  exp  0  and  T  =  fa  of  the  evolving  space  curve  can  be 
determined.  Thus  the  use  of  classical  differential  geometry  of  space  curves  has  enabled  us 
to  establish  [10]  the  exact  solvability  of  (6.2),  by  connecting  it  to  the  exactly  solvable 
elliptic  Liouville  equation. 

This  connection  can  also  be  found  [10]  using  surface  theory.  By  identifying  the 
solution  rnfcr)  of  (6.1)  with  a  position  vector  r(w,v),  it  is  possible  to  describe  a 
surface.  The  relevant  expressions  for  the  first  and  second  fundamental  forms  have 
been  given  in  §2.  Using  them  in  conjunction  with  eq.  (6.1),  we  see  that  F  =  0  and 
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G  =  (my)  =  (mx  x  m)  =  E.  Thus  the  metric  has  a  conformal  ("isothermal"}  form. 
The  unit  normal  to  the  surface  n  is  defined  as  in  §  2.  From  (6.1),  we  obtain  (m^  x  my)  = 
— Em.  Hence  Jin*  x  m,,)  =  E.  A  short  calculation  shows  that  the  second  fundamental 
form  parameters  are  given  by  L  =  N  =  E  and  M  =  0.  Thus  the  Gaussian  curvature 
K=(LN-  M2}/(EG  -  F2}  and  the  mean  curvature  H  =  \(EN  +  GL-  2FM)/ 
(EG  —  F2)  are  both  constant.  With  m(jc,  v)  as  the  position  vector  creating  the  surface, 
the  Gauss-Weingarten  equations  (Eqs  (2.7))  become 

m^  =  (Ex/2E}mx  -  (Ey/2E}my  -  Em 

mxy  =  (Ey/2E)mx  +  (Ex/2E)YKiy 

myy  =  -(Ex/2E}mx  +  (Ey/2E}my  -  Em 

nx  =  -mx 

ny  =  —my  (6.4) 

We  now  obtain  the  Gauss-Mainardi-Codazzi  equations  [1]  from  the  various 
compatibility  conditions  as  mentioned  in  §  2.  Note  that  nxy  =  nyx  is  automatically 
satisfied,  while  the  conditions  m^  =  m^  and  m^,  =  myyx  both  reduce,  after  a  short 
calculation,  to  the  following  equation  for  the  coefficient  E  of  the  conformal  metric: 

(Ey/2E}y  +  (EX/2E}X  =  -E.  (6.5) 

Defining  (f>  =  |lnE,  the  above  equation  becomes  the  elliptic  Liouville  equation  (6.3d).  Its 
general  solution  can  be  written  down  as 

0(*,v)  -  iln{-4A'(z*X(z)/W)  +B(Z}}2}  (6.6) 

where  z  =  x  +  iy,z*  =  x  —  iy,A'(z*)  =  dA/dz*,  B'(z)  =  dB/dz  and  A,  B  are  arbitrary 
functions.  Hence 

E(*,y)  =  -4A'(z*)B'(z)/[A(z*)  +B(z)f.  •  (6.7) 

The  fundamental  theorem  of  surfaces  states  that  if  the  metric  and  curvature  tensors 
satisfy  the  Gauss-Mainardi-Codazzi  equations  (which  in  the  present  case  is  the  elliptic 
Liouville  equation),  then  the  Gauss-Weingarten  equations  can  in  principle  be  integrated 
to  determine  the  surface  (specified  by  m(jc,  v))  uniquely,  up  to  a  rotation  and  translation 
in  space.  Therefore  (6.1)  is  exactly  solvable.  Some  interesting  special  solutions  have  been 
found  in  Ref.  [19]. 

It  can  be  shown  that  the  basic  equations  (6.3a)  and  (6.3b)  determined  using  the  'direct' 
moving  curve  method  can  be  combined  into  the  following  equation  for  the  complex 
Hasimoto  function  q  =  K exp  (i  J^  rds): 

\q\2ds  =  0.  (6.8) 

This  equation  had  in  fact  been  obtained  earlier  [19]  in  another  context.  A  short 
calculation  verifies  that  its  imaginary  part  yields  (6.3a),  while  its  real  part  gives  (6.3b). 
These  equations  when  combined  become  the  elliptic  Liouville  equation  (6.3d),  as  we 
have  already  shown  above. 
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7.  The  inhomogeneous  Heisenberg  chain 

The  examples  discussed  in  the  previous  sections  show  how  classical  differential 
geometry  can  be  used  as  an  effective  tool  to  find  the  connection  of  a  nonlinear  system 
described  by  a  vector  evolution  equation  with  a  completely  integrable  NLPDE  or 
with  one  whose  exact  general  solution  is  known.  As  mentioned  in  the  introduction, 
while  these  connections  are  indeed  useful  in  predicting  similar  properties  for  the 
corresponding  vector  evolution  equation,  finding  the  explicit  solution  of  the  latter  is  not 
always  an  easy  task.  In  many  physical  situations,  finding  special  exact  solutions  explicitly 
would  be  of  interest  (the  corresponding  nonlinear  system  need  not  necessarily  be  always 
completely  integrable).  In  this  section  we  consider  the  example  of  the  inhomogeneous 
Heisenberg  chain  to  show  how  exact  solutions  can  be  found  using  an  appropriate 
geometric  approach  [11]. 

Several  years  ago,  this  author  proposed  a  model  Hamiltonian  [20] 

S2  =  l,  (7.1) 

where  Si  denotes  the  spin  vector  and  /;  the  site  dependent  (inhomogeneous)  exchange 
interaction  between  the  spins.  The  continuum  evolution  for  S  was  found  to  be 

S,  =  (/SxS,),,     S2  =  l,  (7.2) 

By  associating  S  with  the  tangent  to  a  moving  curve,  it  was  shown  that  the  above  equation 
is  gauge-equivalent  to  a  generalized  nonlinear  Schrodinger  equation  (GNLSE)  of  the 
form 


J~ 


/ 

J— 


=  0.  (7.3) 

Here  q  is  given  essentially  by  the  Hasimoto  function  defined  in  (3.8),  namely, 

(7.4) 

with  the  curvature  K  and  the  torsion  r  of  the  curve  being  given  by 

*2  =  (SX)2,     r  =  S-(SxxSxx)/K,2  (7.5) 

respectively.  There  are  only  two  forms  of  f(x)  for  which  the  conventional  1ST  method  is 
directly  applicable  to  the  above  GNLSE,  viz.,  (i)  when/  is  a  constant  (in  which  case  it 
reduces  to  the  usual  NLSE),  and  (ii)  when/  is  a  linear  function  [21]  of  jc.  All  other  /(jc) 
fall  into  a  special  class  [20].  In  a  interesting  recent  paper,  Cieslihski  et  al  [22]  have  used 
some  ideas  from  Sym's  surface  approach  [3]  to  find  exact  solutions  of  (7.2)  and  (7.3)  for 
several  other  forms  of  the  inhomogeneity  function  f(x)  .  Representing  S  as  the  spatial 
derivative  of  a  vector  r,  i.e.,  setting 

S  =  r*,  (7.6) 

it  can  be  verified  that  the  kinematic;  equation 

rr=/(r,xr«),     r2  =  1.  (7.7) 


implies  Eq.  (7.2).  Assuming  the  surface  generated  by  the  position  vector  r(jc,  r)  to  be 
equipped  with  a  metric  of  the  geodesic  form,  it  was  shown  [22]  that  the  Gauss-Mainardi- 
Codazzi  (GMC)  equations  [1]  for  this  surface  can  be  cast  in  the  same  form  as  (7.3),  upon 
making  appropriate  identifications  between  q  and/  on  the  one  hand,  and  the  coefficients 
of  the  first  and  second  fundamental  forms  (i.e.,  the  metric  and  the  extrinsic  curvature, 
respectively)  on  the  other.  By  specializing  to  surfaces  of  revolution,  it  was  demonstrated 
that  the  equations  for  the  geodesies  on  the  surface  could  be  integrated  explicitly  for 
certain  cases,  and  an  algorithm  was  found  for  obtaining  both  /  and  the  corresponding 
solution  S.  When  the  geodesic  coordinates  were  taken  to  be  x  and  /,  the  solutions  called 
'spins-on-meridians'  could  be  found  for  a  wide  variety  of  time-independent,  bounded, 
positive  functions /(x),  i.e.,  inhomogeneous  ferromagnetic  couplings. 

In  what  follows  we  use  [11]  a  geometric  formulation  different  from  the  above 
approach  in  the  sense  that,  although  we  too  identify  r(x,  t]  with  a  position  vector  that 
generates  a  surface,  we  do  not  compare  the  GMC  equations  to  (7.3),  nor  do  we  use 
the  general  equations  of  geodesies.  Our  strategy  is  to  regard  the  basic  kinematic 
equation  (7.7)  for  r  as  a  constraint  on  the  surface  generated.  Such  a  constraint  is 
expected  to  permit  only  certain  special  geometries  for  the  surface  [3].  Indeed,  for 
the  model  under  investigation,  the  surface  metric  is  shown  to  be  necessarily  of  the 
geodesic  form,  and  further,  certain  coefficients  of  the  two  fundamental  forms  get 
related  through  the  function  /.  Using  these  results  in  the  GMC  equations,  it  becomes 
possible  to  'integrate'  the  latter  and  find  the  expressions  for  L,  M  and  N  (the  coefficients 
of  the  second  fundamental  form)  as  well  as  the  corresponding  function  /  in  terms  of 
the  metric  coefficient  G,  its  ^-derivatives  and  two  (arbitrary)  integration  constants, 
for  time-independent  metrics.  It  is  then  demonstrated  that  the  solution  q  of  (7.3)  can 
also  be  written  in  terms  of  the  above-mentioned  quantities,  simply  by  re-expressing 
the  moving  curve  parameters  K  and  r  in  terms  of  surface  coefficients  by  using  the  Gauss- 
Weingarten  (GW)  equations  [1]  for  the  surface.  Thus  we  see  that  given  an  arbitrary 
metric  coefficient,  the  explicit  solution  for  q  along  with  the  corresponding  /  can  be 
written  down.  This  solution  q  is  a  complex  function  in  general,  and,  as  already  mentioned 
in  §4,  is  interesting  in  its  own  right  since  it  contains  information  [11]  on  the  energy  and 
current  densities  along  the  chain.  Furthermore,  on  inspecting  the  expressions  for  L,M 
and  N  obtained  from  the  GMC  equations,  it  is  readily  seen  that  they  correspond  to 
those  of  surfaces  of  revolution,  [1]  when  one  of  the  two  integration  constants  referred 
to  above  vanishes.  Also,  G1/2  plays  the  role  of  the  generator  of  revolution,  and  the 
explicit  solution  of  r(jc,  t)  can  be  written  down  in  this  limiting  case.  In  this  limit,  our 
expression  for  /(*)  reduces  to  the  result  obtained  [22]  in  the  geodesic  approach,  for  the 
'spins-on-meridians'  solutions.  The  corresponding  solutions  for  q  are  necessarily  real. 
For  certain  common  surfaces  of  revolution  like  the  sphere,  the  torus  and  the  catenoid,  the 
solution  S  of  Eq.  (7.2)  can  be  expressed  solely  as  a  functional  of  the  inhomogeneity 
function/  that  appears  in  the  equation,  essentially  because  the  surface  metric  can  also  be 
written  down  as  a  functional  of/  for  these  surfaces.  We  now  present  a  few  details  of  this 
analysis: 

We  start  with  the  kinematic  equation  for  r(x,  t)  Eq.  (7.7),  and  identify  r(x,  t)  with  a 
position  vector  generating  a  smooth  surface  in  £3,  with  local  coordinates  jc  and  t.  Note 
that  S2  =  r2  =  1  in  the  model.  From  the  evolution,  it  is  readily  verified  that  rx.rt  =  0.  We 
use  the  same  notation  (E.F,G,L,M,N)  for  the  metric  and  the  extrinsic  curvature 
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parameters,  respectively,  as  in  §6.  It  is  easy  to  verify  that  the  metric  is  necessarily 
constrained  to  be  of  the  geodesic  form  given  below: 


(7.8) 
The  unit  normal  n  to  the  surface  is  given  by 

n=(rJcxri)/G1/2.  (7.9) 

As  already  mentioned  in  §  2,  the  GW  equations  for  a  surface  can  be  written  in  terms  of 
L,M,N  and  the  usual  Christoffel  symbols  F|.  For  the  metric  given  in  (7.8),  the  latter 
reduce  to  (see  Eqs  (2.8)): 

r1  —  r2  —  r1  —  n-  "I 

1  n  -  1  11  -  1  12  -  u,  I  (7  IQ] 

r?2  =  i  G./G;     T\2  =  -2  Gx    and  1%  -  \  Gt/G.  J                         ^  ' 

Using  these,  the  GW  equations  (equations  (2.7))  read 

rxx=Ln  (7.1  la) 

rxt=±(Gx/G)rt+Mn  (7.  lib) 

rn  =  -2Gxrx  +  i(G,  /G)r,  +  JVn  (7.  1  Ic) 

n,  =  -Lr,-(M/G)rr  (7.  lid) 

n,  =  -Mr,  -  (AYG)iv  (7.  lie) 

Further,  for  this  (geodesic)  surface  it  can  be  shown  that 

(r*  x  Yxx)  =  L(rx  x  n)  =  (L/G1/2)^.  (7.12) 

Comparing  this  with  the  given  equation  (7.7),  we  get  the  constraint 

/  =  -G1/2/L,  (7.13) 

for  L  ^  0.  In  other  words:  for  a  given  /,  the  evolution  can  have  a  solution  only  if  the 
surface  coefficients  G  and  L  are  related  through/  as  above. 

For  a  surface  with  E  =  I  and  F  =  0,  the  compatibility  conditions  (r«)r  =  (r*),  and 
(rtt)x  =  (rxt}t  yield  the  following  GMC  equations: 

-(LN  -  M2)/G  =  (r2,),  -  (i*)r  +  ri.r2,  -  r^r2, 

-i-rj^-rj^  (7.14a) 

Lt-Mx=  LT\2  +  M(T2n  -  r^  )  -  NT2n  (7.  14b) 

Mt-Nx  =  LT\2  +  M(F22  -  r}2)  -  Arr22  (7.  14c) 
On  using  the  expressions  for  I*,  given  in  (7.10),  and  the  definition 

K=(LN-M2}/G,  (7.15) 
of  the  Gaussian  curvature  K,  Eqs  (7.14)  reduce  to 


(7.16a) 
L,-MX=M(GX/2G}  (7.16b) 

Mt-Nx  =  -L(Gx/2]  +  M(G,/2G)  -  N(GX/2G).  (7.  16c) 
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convenient  form, 

//~il/2\        ' ft"s~<l/2.  f<-i   i  <7\ 

Details  of  the  determination  of  G,  L,  M,  Af  are  given  elsewhere  [11].  The  equations  above 
can  be  analysed  to  yield 


/  =  ±0  (2A0  -<g-  C0c/r2)/(C^-3  -  0«)  (7-18) 

L=  -<t>/f  =  (0«  -  C00-3)/(2A0  -  02  -  C00-2)1/2  .  (7.19) 

M  =  C0cj>-1  (7.20) 

N  =  /(<£„  -  C00-3)  =  ~<K2A0  -  02  -  C0<T2)1/2,  (7.21) 

where 

0  =  G1/2.  (7.22) 

These  expressions  satisfy  the  GMC  equations,  of  course,  since  they  have  essentially 
been  obtained  by  integrating  the  latter  equations.  The  point  to  be  noted  is  that  the 
quantities  L,  M  and  N  as  well  as  /,  depend  on  <p  and  its  derivatives.  Thus,  given  an 
arbitrary  function  <p  (with  the  proviso  that  (2A0  —  </>2  -  C0^~2}  >  0),  we  can  use  (7.18)  to 
(7.21)  to  find  these  quantities  explicitly.  Combining  this  result  with  the  fundamental 
theorem  of  surfaces,  we  conclude  that  for  inhomogeneity  functions  /  determined  from 
(7.18)  in  this  fashion,  exact  solutions  r(jt,f)  of  (7.7)  exist.  In  addition,  the  solution  q 
corresponding  to  such  functions  can  be  found  explicitly.  Explicit  solutions  for  r  can  also 
be  found  for  a  special  sub-class  of/,  as  we  shall  see. 

As  mentioned  in  the  beginning,  the  solution  q  of  (7.3)  can  be  expressed  in  terms  of  the 
curvature  K  and  torsion  r,  the  parameters  of  a  (moving)  curve,  as  in  (7.4).  Using  these 
definitions  together  with  the  GW  equations  (7.11),  we  get 

«=(Sx.S,)1/2  =  (r«T«)=I.  (7.23) 

r  can  also  be  expressed  in  terms  of  the  surface  coefficients  after  some  algebra.  We  obtain 

r  =  S  •  (Sz  x  S^)/«2  -  vx  •  (r«  x  r^)/L2  =  M/G1/2.  (7.24) 

Substituting  these  expressions  in  the  Hasimoto  relation  (7.4)  gives  the  following  solution 
for  q  in  terms  of  L,  M  and  G: 

q  =  \  L  exp  i      MG~l/2dx.  (7.25) 


We  have  already  shown  that  L  and  M  are  given  by  (7.19)  and  (7.20),  for  G  =  G(JC). 
Substituting  these  quantities  in  (7.25)  and  using  (7.22)  we  get 

q  =  ifa»  -  C0^}(2A0  -ft-  C0<rT1/2  exp  \iC0  I  <r2d]  -  (7.26) 

\-       "  -1 


Pramana  -  J.  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences"  201 


Note  that  the  torsion  r  =  Co^T1  for  this  system.  Summarizing,  we  have  the  following 
result:  For  any  arbitrary  0  (such  that  (2A0  -$-  C0^}  >  0),  the  inhomogeneity 
function/  can  be  found  from  Eq.  (4.11),  and  for  that/,  the  corresponding  solution  q  is 
given  by  eq.  (7.26). 

Next,  we  discuss  solutions  for  r  in  the  two  possible  classes  of  surfaces  corresponding, 
respectively,  to  M  =  0  and  M  ^  0: 

(i)  M  =  0: 

It  is  evident  from  (7.20)  that  C0  must  vanish  when  M  vanishes.  Using  this,  (7.19)  and 
(7.21)  yield 

L  =  <!>xx/(2A0-<t>2x}l/2     and    -N  =  -0(2A0  -  02)1/2.  (7.27) 


It  is  verified  easily  that  these  extrinsic  curvature  coefficients  correspond  to  the  following 
surface  of  revolution  [1]: 

ft  \ 

r=  (       (2A0  -  <j>2x}1/2dx,  (frcost,  4>smt}.  (7.28) 

\  J  / 

Thus  <f>  plays  the  role  of  the  generator  of  a  surface  of  revolution.  Further,  since 
r2  =  S  =  1,  (7.28)  leads  to  2A0  =  1.  By  computing  r^  from  it  we  find  L2  =  (r^-r^)  = 
2A042OC/(2A0  —  <£2),  which  agrees  with  the  expression  for  L  given  in  (7.27),  on  setting 
2A0  =  1.  (The  expression  for  N  can  be  verified  similarly).  Using  A0=  \,  C0  —  0,  in 
Eq.  (7.18),  we  get 

/=±<Kl-^)1/2Mo,  (7.29) 

This  expression  for/  was  also  obtained  by  Cieslihski  el  al  [22],  by  integrating  the 
equations  for  the  geodesies  on  a  surface  of  revolution.  We  have  thus  established  that  their 
result  corresponds  to  the  limit  M  —  C0  —  0  of  the  more  general  expression  for/  given  in 
Eq.  (7.18),  which  we  derived  as  the  inhomogeneity  function  supporting  an  exact 
dynamical  solution  for  a  time-dependent  metric.  For  C0  —  0,  Eq.  (7.26)  reduces  to 


(ii)  M  ^  0: 

This  corresponds  to  Co  7^  0  (see  Eq.  (7.20)).  Since  surfaces  of  revolutions  necessarily 
have  M  =  0,  a  non-vanishing  M  represents  some  other  class  of  surfaces.  The  construction 
of  r  in  accord  with  (7.14)-(7.21)  with  C0  ^  0  is  non-trivial  and  is  an  open  problem. 
However,  the  solution  for  q  can  be  found  from  Eq.  (7.26). 

Finally,  we  present  the  solutions  for  S  in  terms  of  the  Gaussian  curvature  K. 
Equation  (7.17)  for  K  can  be  written  as 

K  =  -0«M  (7.30) 

Using  this  in  (7.29),  we  obtain 

*/=(l-^)1/2-  (7.31) 

Substituting  (7.31)  in  the  expression  for  r  in  (7.28)  and  computing  r^  yields,  with 
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2A0  =  1, 

S  =  r,  =  ((1  -  <£2)1/2,  (j)x cos t, 4>x sinf) 
=  (Kf,  (1  -  AT2/2)1/2cosf,  (1  -#2/2)1/2sin/).  (7.32) 

Hence  S  is  given  as  a  functional  of  the  product  Kf.  For  the  cases  of  a  torus,  a  sphere  and  a 
catenoid,  the  corresponding  solutions  have  been  found  explicitly  in  Ref.  1 1 . 

8.  Concluding  remarks 

Understanding  the  relationship  between  nonlinear  analysis  and  geometry  is  currently  an 
active  field  of  research.  Many  interesting  results  have  not  been  considered  here  due  to 
limitations  of  space.  For  instance,  it  has  been  shown  [23]  recently  that  if  a  non-stretching 
curve  moves  on  an  jV-dimensional  sphere  in  such  a  way  that  its  dynamics  does  not  depend 
explicitly  on  the  radius  of  the  sphere,  the  MKdV  hierarchy,  the  NLSE  hierarchy  and  their 
multi-component  generalizations  arise  in  a  natural  fashion,  for  N  =  2,  N  =  3  and  N  >  3 
respectively.  New  connections  have  been  established  using  the  differential  geometry  of 
constant  mean  curvature  surfaces  [24].  Another  aspect  which  we  have  not  discussed  in 
this  paper  is  that  of  a  geometric  phase  associated  with  moving  curves  [19]  and  solitons 
[25].  Interesting  parallels  have  been  found  between  elastic  dynamics  of  ribbons  [26]  and 
the  appearance  of  such  anholonomic  phases. 

Our  emphasis  in  this  paper  has  been  on  showing  how  physical  applications  can  be 
instructive  in  suggesting  ways  of  obtaining  interesting  mathematical  connections  between 
geometry  and  nonlinearity.  The  seminal  work  of  Hasimoto  [5]  in  fluid  dynamics  and  its 
subsequent  novel  developments  [27]  represent  a  case  in  point.The  added  advantage  of 
analysing  a  physical  system  is  the  possibility  of  observing  such  exact  mathematical 
solutions  (e.g.,  solitons)  in  the  laboratory  [28].  It  is  clear  that  the  underlying  nonlinearity 
in  the  system  plays  a  crucial  role  also  in  deciding  the  topological  character  of  the 
solutions.  For  example,  the  kink  configurations  of  the  sine-Gordon  equation,  which  are 
topologically  non-trivial,  get  replaced  by  (topologically)  uninteresting  plane  wave 
solutions  when  the  equation  is  linearized.  We  close  with  the  remark  that  it  would  be  of 
great  interest  to  understand  the  connection  between  nonlinearity  and  differential 
geometry  using  physical  applications  in  higher  dimensions. 
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Abstract.  Nonlinear  effective  Lagrangian  models  with  a  chirai  symmetry  have  been  used  to 
describe  strong  interactions  at  low  energy,  for  a  long  time.  The  Skyrme  model  and  the  chirai  quark- 
meson  model  are  two  such  models,  which  have  soliton  solutions  which  can  be  identified  with  the 
baryons.  We  describe  the  various  kinds  of  soliton  states  in  these  nonlinear  models  and  discuss  their 
physical  significance  and  uses  in  this  review.  We  also  study  these  models  from  the  view  point  of 
classical  nonlinar  dynamical  systems.  We  consider  fluctuations  around  the  B  =  1  soliton  solutions 
of  these  models  (B,  being  the  baryon  number)  and  solve  the  spherically  symmetric,  time-dependent 
systems.  Numerical  studies  indicate  that  the  phase  space  around  the  Skyrme  soliton  solution 
exhibits  spatio-temporal  chaos.  It  is  remarkable  that  topological  solitons  signifying  stability/order 
and  spatio-temporal  chaos  coexist  in  this  model.  In  contrast  with  this,  the  soliton  of  the  quark- 
meson  model  is  stable  even  for  large  perturbations. 

Keywords.  Nonlinear;  chirai  models;  Skyrme;  quarks;  mesons;  soliton;  baryon;  spatio-temporal 
chaos. 
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1.  Introduction 

Most  of  the  field  theories  of  physical  interest  are  highly  nonlinear  and  though  perturbative 
techniques  are  very  successful  in  elementary  particle  physics,  condensed  matter 
physics  and  several  other  areas,  one  has  to  grapple  with  the  nonlinearities  in  the 
underlying  field  theories,  sooner  or  later.  This  is  especially  true  of  theories  which  have 
no  small  parameter  in  which  a  perturbative  expansion  can  be  made,  like  the  models 
for  strong  interaction  or  strongly  correlated  electron  systems.  So  one  has  to  rely  on  non- 
perturbative  methods  for  these.  It  has  been  found  that  some  of  these  field  theories  have 
a  rich  structure  at  the  classical  level,  ihanks  to  their  nonlinear  nature.  They  possess 
localised,  stable  solutions  with  finite  energy  at  the  classical  level.  These  are  the  soliton 
solutions  in  the  field  theory  parlance.  Examples  are  the  kink  solutions  of  the  Sine-Gordon 
theory,  the  vortex  solutions  of  the  Abelian-Higgs  model  or  Chern-Simons-Higgs 
model  in  2  +  1  dimensions  of  relevance  to  superconductivity,  and  the  't  Hooft-Polyakov 
monopole  solution  of  a  (9(3)  gauge  theory  (or  any  gauge  theory  based  on  a  semisimple 
group  in  which  0(3}  can  be  embedded),  of  relevance  to  proton  decay  and  cosmological 
models  based  on  grand  unified  theories  [1].  All  these  soliton  solutions  have  one  thing 
in  common;  they  have  a  topological  number  or  a  winding  number  associated  with 
them  and  an  associated  conservation  law.  This  accounts  partly  for  their  stability  also. 
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There  are  some  models  with  soliton  solution  which  have  nothing  to  do  with  any  topology; 
the  SLAC  bag  model  [2]  and  the  Friedberg-Lee  model  [3]  are  examples  of  important 
theories  with  non-topological  soliton  solutions. 

Long  before  quantum  chromodynamics  (QCD)  emerged  as  the  theory  of  strong 

interactions,  it  had  been  recognised  that  strong  interactions  are  approximately  invariant 

under  chiral  symmetry.  In  fact,  it  was  known  that  for  describing  hadronic  physics,  the 

5t/(2)  x  SU(2)  symmetry  based  models  are  better  than  the  SU(3)  symmetry  based 

models.  In  1962,  Skyrme  proposed  a  Lagrangian  model  with  purely  pionic  degrees  of 

freedom,  invariant  under  517(2)  x  SU(2],  and  noted  that  the  soliton  solutions  of  this 

theory  fall  into  different  homotopy  classes  characterised  by  a  winding  number  n  [4]. 

Skyrme  identified  the  winding  number  with  the  baryon  number.  It  was  a  very  bold  and 

novel  idea  at  that  time,  to  describe  fermions  from  a  purely  bosonic  field  theory.  Later  it 

•  was  established  that  the  Skyrme  solitons  should  indeed  be  quantized  as  fermions  for  odd 

number  of  colour  charges  [5].  In  general,  one  uses  a  collective  quantization  approach  to 

quantize  the  classical  soliton  solutions  of  the  model  [6].  The  properties  of  the  nucleon 

deduced  in  this  model  are  in  reasonable  agreement  with  the  experimental  values.  The 

Skyrme  model  also  has  a  classical  solution  with  winding  number  2,  with  an  energy 

distribution  characteristic  of  a  diffuse  torus  [7,8].  When  quantized,  the  lowest  state  has 

the  quantum  numbers  of  the  deuteron,  with'  the  correct  size,  though  the  binding  energy  is 

far  too  large.  These  toroidal  solutions  exist  for  winding  number  3,  4, ...  also  [7].  Apart 

from  these,   other  states   are  obtained  when  the  Skyrme   model  is   extended   to 

SU(N),N  >  3  [9, 10].  The  Skyrme  model  has  also  been  applied  to  meson-baryon 

scattering  [11]  and  inter-nucleon  potential  [12]  with  some  success.  We  describe  the 

salient  features  of  the  Skyrme  model  in  §  2. 

Though  the  Skyrmion  picture  of  baryons  is  very  attractive,  it  has  some  problems  both 
at  the  phenomenological  as  well  as  at  the  conceptual  level.  On  the  basis  of  general 
arguments  grounded  firmly  in  the  non-perturbative  structure  of  QCD,  it  appears  that  a 
purely  topological  soliton  picture  of  nucleon  may  not  be  correct.  The  favoured  structure 
for  the  baryon  involves  three  valance  quarks  interacting  chiral-invariantly  with  meson 
fields  [13].  This  leads  us  to  the  chiral  quark-meson  model  (CQM)  [14].  In  this  model  also, 
there  is  a  soliton  solution  with  three  valance  quarks.  Here  the  baryon  number  of  the 
soliton  is  provided  by  the  quarks  explicitly.  Though  there  are  topological  properties 
associated  with  the  meson  fields  in  this  model,  the  soliton  solution  here  is  not  topological 
in  the  same  sense  as  in  the  Skyrme  model.  Phenomenologically,  CQM  is  somewhat  better 
than  the  Skyrme  model.  There  are  some  close  parallels  between  the  Skyrme  model  and 
CQM,  in  that  many  novel  soliton  states  of  the  former  are  present  in  the  latter  also.  For 
instance,  CQM  also  has  a  stable  state  with  the  quantum  numbers  of  the  deuteron  and 
states  with  higher  baryon  number  with  toroidal  energy  distributions  [15],  just  as  in  the 
Skyrme  model.  We  review  this  model  in  §  3. 

Gauge  theories  have  been  studied  from  the  view  point  of  nonlinear  dynamical 
systems,  in  recent  times.  One  considers  spatially  homogeneous  solutions  of  the  Yang- 
Mills  (YM)  theory  and  obtains  a  Hamiltonian  system  (with  a  finite  number  of  degrees 
of  freedom),  which  is  then  shown  to  be  stochastic  [16].  With  the  inclusion  of  Higgs 
fields,  Chern-Simons  term  etc.,  gauge  theories  remain  chaotic  in  the  case  of  spatially 
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YM  theory.  By  considering  a  harmonic  expansion  of  the  fluctuation,  the  problem 
essentially  reduces  to  the  Fermi-Pasta-Ulam  approach  for  coupled  nonlinear  oscillators 
[21].  It  was  shown  that  the  phase  space  near  the  Wu-Yang  static  solution  is  a  stochastic 
region  implying  that  the  spherically  symmetric  time  dependent  YM  system  is  non- 
integrable.  Joy  and  Sabir  have  shown  that  YM  and  the  Yang-Mills-Higgs  (YMH) 
theories  are  non-integrable  by  using  the  Painleve  analysis  and  also  by  a  computation 
of  the  maximal  Lyapunov  exponent  [22].  Systematic  and  detailed  numerical  work  by 
Kawabe  and  Ohta  showed  that  the  classical  YM  theory  is  always  chaotic  and  that 
there  is  no  phase  transition  [23].  However,  when  the  Higgs  fields  are  introduced  and 
the  phase  space  around  the  't  Hooft-Polyakov  monopole  solution  of  the  YMH 
theory  considered,  there  is  a  phase  transition  when  the  perturbation  parameter  is 
varied  [24]. 

It  would  be  of  interest  to  find  whether  chaos  in  the  underlying  theory  of  strong 
interactions,  namely  QCD,  is  revealed  at  the  level  of  the  effective  Lagrangian  models 
based  on  it,  also.  The  stability  property  of  the  Skyrme  soliton  has  been  discussed  in  the 
literature  [25].  When  the  fluctuations  around  the  soliton  solutions  are  small,  they  can  be 
treated  in  a  perturbative  manner  and  satisfy  a  linearized  set  of  equations.  Skyrmions  are 
topological  objects  and  it  is  not  surprising  that  these  studies  confirm  their  stability  under 
small  perturbations.  This  is  what  makes  it  a  meaningful  object,  fit  for  quantization. 
Though  large  fluctuations  may  not  be  relevant  for  the  static  properties  of  an  individual 
isolated  Skyrmion,  they  are  expected  to  play  a  role  when  they  are  subjected  to  external 
fields  or  when  we  consider  interactions  among  them.  This  is  our  motivation  for  studying 
the  fluctuations  in  a  non-perturbative  manner  by  solving  the  full  nonlinear  equations  for 
the  system,  though  at  a  classical  level. 

In  §  4  we  consider  fluctuations  around  the  B  =  1  soliton  solution  of  the  Skyrme  model 
and  determine  their  time  evolution  numerically  [26].  We  also  find  the  time  variations  of 
the  amplitudes  of  the  various  harmonic  modes  of  the  fluctuations.  Our  initial  fluctuations 
correspond  to  excitation  of  only  one  mode.  We  consider  both  fluctuations  whose 
magnitudes  are  small  compared  to  those  of  the  static  solitons,  as  well  as  large  initial 
fluctuations  which  are  comparable  to  the  magnitude  of  the  static  soliton  solution.  In  the 
case  of  the  Skyrme  model,  for  small  initial  fluctuations,  the  amplitude  of  the  initial  mode 
has  a  regular  behaviour,  whereas  the  amplitudes  of  the  other  modes  increase  slowly,  on 
the  average.  The  "radius"  of  the  soliton  oscillates  around  the  static  value.  However,  when 
the  perturbation  is  large,  the  amplitude  of  the  initial  mode  decreases  gradually  and  other 
modes  are  excited  substantially.  The  radius  of  the  soliton  also  tends  to  increase  with  time 
on  the  average.  All  these  indicate  chaos  in  the  system.  So,  both  topological  solitons 
(symbolizing  a  high  degree  of  symmetry  and  regularity)  and  stochasticity  coexist  in  the 
model. 

We  have  studied  the  stability  aspects  of  the  chiral  quark-meson  model  [26]  in  §  5. 
In  contrast  to  the  Skyrme  model,  we  find  that  the  soliton  of  the  chiral  quark-meson  model 
is  very  stable  even  for  large  perturbations.  The  amplitudes  of  the  initial  modes  remain 
constant  for  a  long  time  and  there  is  very  little  excitation  of  the  other  modes.  The  system 
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remarks. 

2.  Baryons  as  topological  solitons  in  the  Skyrme  model 

The  Lagrangian  of  the  Skyrme  model  including  the  pion-mass  is  given  by  [27] 


r[(tf+I/t)-2].  (1) 

Here  U  is  an  517(2)  matrix-  valued  function  of  the  pionic  field  variables  7ra(0  =  1,  2,  3)  of 
the  form 

(2) 

where  ra(a  =1,2,3)  are  the  Pauli  matrices  and  F^  is  the  pion-decay  constant,  whose 
physical  value  is  93  MeV.  e  is  a  dimensionless  coupling  constant  and  m^  is  the  pion  mass. 
Except  for  the  last  term,  Lsicyrme  is  invariant  under  the  chiral  51^(2)  x  SU&  (2) 
transformations  (left-handed  and  right-handed  isospin  transformations): 

U  -»  gUh*  (3) 

where 

g£SUL(2)     and    h  6  5t7R(2).  (4) 

As  things  stand,  the  Skyrme  Lagrangian  is  one  among  a  class  of  nonlinear  cr-models 
describing  the  low  energy  interactions  among  pions,  which  are  known  to  be  invariant 
under  5£7L(2)  x  5£7R(2)  transformations. 

Now,  for  finite  energy  configurations,  U  -*  UQ,  a  constant  unitary  matrix,  as  r  -*  oo. 
By  a  global  rotation  we  can  choose  UQ  to  be  the  identity  matrix.  Hence, 

U  —  >  1     as    r  —  >  oo.  (5) 

Thus,  we  may  think  of  all  points  at  spatial  infinity  as  being  identified  with  a  single  point. 
Because  of  this,  the  configuration  space  /?3  of  coordinates  r  becomes  the  three  sphere  S3, 
on  which  U  is  well  defined.  Now  as  U  £  SU(2),  it  can  be  written  as 

U  =  n0  +  h.n  (6) 

where, 

nj+n2=l.  (7) 

Hence  the  group  517(2)  is  identified  topologically  with  the  three  sphere  53.  Thus  the 
fields  U  are  functions  on  S3  and  takes  values  in  53.  Then  all  the  finite  energy 
configurations  U  fall  into  distinct  homotopy  classes  of  the  map  53  ->  53,  characterized  by 
a  winding  number  n.  The  explicit  expression  for  the  winding  number  n  is, 

n  =  ~  eijk  j  d3*Tr  [tfdtUrftyUtfdkUl.  (8) 
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Figure  1.    The  Skyrme  function  Q(x), corresponding  to  the  hedgehog  ansatz  (B  =  1). 


Skyrme  boldly  conjectured  that  the  static,  localized,  finite  energy  solutions  or  the  solitons 
of  the  model  correspond  to  the  baryons  and  that  the  winding  number  n  should  be 
identified  with  the  baryon  number  B.  The  second  term  in  Lskyrme  (quartic  term),  known  as 
the  Skyrme  term  gives  an  elegant  lower  bound  to  the  energy  of  the  soliton  and  prevents  it 
from  collapsing  to  zero  size.  It  is  the  stabilizing  term. 

The  lowest  energy  soliton  solution  with  n  =  B  =  1,  of  the  model  is  invariant  under  the 
combined  spatial  and  isospin  rotations: 


Then  U  has  the  form 

U  =  exp[iT-H 


cos6(r)  +  rr-rsin0(r). 


(9) 


(10) 


This  is  known  as  the  spherically  symmetric  or  'hedgehog'  ansatz.  The  winding  number 
for  this  ansatz  can  be  shown  to  be 


n=B=-[e(0)-0(oo)]. 


(U) 


0  for  the  B  =  1  soliton,  to  be  identified  with  the  nucleon  satisfies,  0(0)  =  TT  and 
0(oo)  =  0.  With  the  ansatz  in  equation  (10),  the  equations  of  motion  reduce  to: 

sin2  0  sin  20 


«     •    2^/^W 

—  +2  sin2©  0"H 
4  J  2 


•      n^^rZ        Sin  20 

+sin20  0'2  -- 
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where  we  have  introduced  the  dimensionless  variables  x  =  2eF^r  and  /3  =  (m^/4e2F'fy. 
Here,  prime  denotes  differentiation  with  respect  to  x.  The  solution  Q(x)  has  the  form 
given  in  figure  1.  The  size  of  the  soliton  solution  is  «  1  fm. 

The  Skyrme  model  Lagrangian  has  an  extra  discrete  symmetry  (apart  from  the  chiral 
symmetry),  which  has  no  experimental  basis.  This  additional  symmetry  is  removed  by 
adding  the  so  called  'Wess-Zumino  term'  [5].  A  careful  treatment  of  the  chiral  anomalies 
also  leads  to  this  term.  Witten  showed  that  the  'hedgehog'  solitons  should  be  quantized  as 
fermions  for  odd  number  of  colour  charges  when  the  Wess-Zumino  term  is  included  [5]. 
In  general,  one  uses  a  collective  quantization  approach  to  quantize  the  classical  soliton 
solutions  of  the  model  [6].  The  spin  and  isospin  values  of  the  quantum  states  pertaining  to 
the  'hedgehog'  soliton  are  given  by  7  =  /  =  n/2,  n  =  1, 2, ...  with  n  =  1  corresponding  to 
the  nucleons  and  n  =  3  to  the  A  resonance. 

The  static  properties  of  the  nucleon  calculated  from  the  Skyrme  model  are  in 
agreement  with  the  observed  experimental  values  to  an  accuracy  of  25-30%,  in  general 
[28].  However  there  exist  some  problems  at  the  phenomenological  level,  some  of  which 
are  serious. 

There  are  soliton  solutions  with  B  =  n  >  2,  which  are  cylindrically  symmetric 
configurations  in  which  the  meson  fields  are  twisted  n  times  in  the  azimuthal  plane  [7,  8]. 
For  these  configurations,  U  has  the  form 

U  =  cos0(p,z)  +  ZT'ttsm.0(p,z),  (13) 

n  =  (sin  a(p,  z)  cos  n$,  sin  a(p,  z)  sin  n(j),  cos  ce(p,  z)) ,  (14) 

where  p,  z  and  $  are  the  cylindrical  coordinates.  0  and  a  satisfy  the  boundary  conditions, 
0(p,z)->0    as    p2  +  z2-4cx),e(p,-z.)  =  e(p,z),e(0,0)=7r          (15) 
and 

a(p,-z)  =  a(p,z)+7r.  (16) 

The  baryon  number,  which  is  also  the  winding  number  is  given  by 

„     n  6(0,0)  ,     . 

B= i-^  =  n.  (17) 

7T 

There  are  classically  stable  solutions  for  this  form  of  U  with  energy  distributions 
corresponding  to  diffuse  tori.  Upon  quantization,  it  is  found  that  the  lowest  energy  state 
has  the  quantum  numbers  of  the  deuteron  and  can  be  identified  with  it  [8].  The  binding 
energy  is  very  high.  The  size  of  the  soliton  (measured  by  the  root  mean  square  radius)  is 
nearly  half  the  size  of  the  physical  deuteron. 

The  Skyrme  model  has  been  generalized  to  SU(3).  A  straightforward  generalization  of 
the  'hedgehog  ansatz'  for  SU(2)  to  SU(3)  consists  in  taking 

\ 


cos  0  +  zr.r  sin  0      :       0 
U=  ' 


(18) 


for  the  baryon  octet  [6].  Collective  quantization  will  yield  the  baryon  octet  and  the  spin-| 
decuplet,  with  the  Wess-Zumino  term  playing  a  crucial  role  in  giving  the  correct  quantum 
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numbers.  The  D/F  ratio  of  the  meson-baryon  couplings  is  predicted  correctly  in  the 
5/7(3)  model  [29].  However  the  mass  spectrum  of  the  baryons  is  not  satisfactorily 
explained  with  the  direct  generalization  of  the  'hedgehog'  [30].  There  is  an  alternate 
approach  to  the  strange  baryons  in  the  Skyrme  model,  in  which  a  strange  baryon  is 
viewed  as  a  bound  state  of  a  K-meson  with  the  'hedgehog'  soliton  [31].  This  has  some 
phenomenological  advantages. 

There  is  a  hierarchy  of  'spherically  symmetric'  ansatze,  when  we  consider  the  Skyrme 
model  for  Nf  number  of  flavours  with  U  €  SU(Nf]  and  Af  >  3  [6].  For  general  Nf,  there 
are  soliton  solutions  U  characterized  by  the  following  generalized  'spherical'  symmetry. 


(19) 

where  A,-  generate  any  5t/(2)  or  50(3)  subgroup  of  the  flavor  SU(Nf}: 

[Ai,A/]  =  t€,-i/fcAJk.  (20) 

In  other  words,  A,-  are  the  generators  of  the  spin-J  representation  of  517(2)  or 
5(9(3)  C  SU(Nf).  For  a  given  value  of  Nf,  J  can  take  values  J,  l,....,l(//f  -  1).  For 
ATf  =  3,  apart  from  the  solution  with  A/  =  A,-/2  (z  =  l,2,3)  corresponding  to  the 
embedding  of  the  517(2)  hedgehog  soliton  in  SU(3)  as  in  (20),  there  is  another  soliton 
solution  corresponding  to  the  following  50(3)  subgroup  of  5£7(3)[9]: 

AI  =  A7,  A2  =  —  A5     and    AS  =  A2.  (21) 

The  baryon  number  for  U  corresponding  to  this  is  given  by  B  =  0,  2,4  •  •  •  .  The  lightest 
dibaryon  in  this  sector  is  an  517(3)  singlet  with  strangeness  S  =  —  2  and  with  a  mass  of 
the  order  of  2.2  GeV,  while  the  next  set  of  states  are  octets  with  spin  1  or  2.  These  states 
have  been  predicted  by  the  MIT  bag  model  also  [32]. 

When  Nf  =  4,  apart  from  the  two  solitons  characterized  by  At-  =  A,-/2  and  the  one 
described  in  equation  (23),  there  is  another  soliton  solution  [10]  corresponding  to  the 
following  SU(2)  subgroup  of  517(4)  : 


A6  +  i>/3  An  ,     A2  =  i\/3  A2  +  A7 

and      A3  =  1^3  A3  +  A8  +  ^/3  A15  .  (22) 

The  associated  soliton  states  have  B  =  0,  1,  2,  ....  Upon  collective  quantization,  it  is  found 
that  the  low-lying  B  =  I  states  correspond  to  a  4*  representation  of  517(4)  with  spin-  1,  a 
20*  with  spins  i,  f,  §  or  \  and  a  20*'  with  spins  f,  |  or  \  [10]. 

3.  Baryonic  solitons  in  the  chiral  quark-meson  model 

As  has  been  mentioned  already,  the  Skyrmion  picture  of  baryons  is  troublesome  from  the 
point  of  view  of  QCD  and  the  quark  model.  The  favoured  structure  for  the  baryon 
involves  three  valance  quarks  bound  by  effective  gluon  exchanges,  and  interacting  chiral- 
invariantly  with  meson  fields.  A  model  in  which  the  quarks  interact  strongly  with  mesons 
only  in  a  solitonic  configuration  may  be  considered  as  a  simplified  picture  of  the  above. 
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In  its  simplest  version,  the  chiral  quark-meson  model  is  described  by  the  Lagrangian  [33]. 


[/t)_2].  (23) 

Here  \I>  =  I  ,  J  denotes  the  isodoublet  of  quark  fields  u,  d  and  \!>L,R  are  the  left-handed 
and  right-handed  components, 

(24) 

g  is  the  quark-meson  coupling  constant,  and  as  usual  U  =  exp(—  i(t.ti/F^)}.  Except  for 
the  last  term,  the  Lagrangian  is  invariant  under  the  chiral  SU(2)  x  517(2)  transformations 
given  by 

#L-»A#L,     #R-+5#R,     and    U  ->  AUB\  (25) 

where  A  and  B  are  the  left-handed  and  right-handed  SU(2)  transformations.  The  model 
has  the  same  structure  as  the  nonlinear  version  of  the  cr-model  introduced  long  ago  by 
Gell-Mann  and  Levy  [34],  except  that  \£  refers  to  the  quark  fields  here,  whereas  in  the 
old  model  it  refers  to  the  nucleons. 

In  the  mean-field  approximation,  the  quantum  fluctuations  of  the  it  fields  are  ignored. 
Taking  the  cue  from  the  Skyrme  model,  we  assume  a  hedgehog  ansatz  for  n 
corresponding  to  the  B  =  I  soliton  solution  with  3  quarks,  i.e., 

U  =  exp(-z'T.r  0(r))  =  cos  6(r)  —  nr.rsin  6(r).  (26) 

(Here  a  negative  sign  is  taken  in  the  exponential  to  facilitate  easy  comparison  with  the 
literature  on  CQM).  Anticipating  the  result,  the  following  form  for  ^  is  chosen  and  it 
provides  a  self-consistent  solution  to  the  equations  of  motion: 


.  xpHef),  (27) 

ia.rF(r)xJ  ^ 

where  e  is  the  quark  eigen  energy  and  ^  is  a  spin-isospin  singlet,  i.e., 

X  =  ^[M)-|<*T)].  (28) 
This  satisfies 

0,  (29) 


where  r/2  and  <r/2  are  the  isospin  and  spin  generators. 

With  the  ansatz  given  in  (26)-(28),  the  equations  of  motion  for  G,  F  and  6  following 
from  the  space  integral  of  L  in  (23)  are  [33] 

(30) 
)=0,  (31) 
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Figure  2.     The  upper  and  lower  components  (G(x),  F(x))  of  the  quark  function  for 
the  B  =  1  soliton  solution  in  the  chiral  quark-meson  model. 


0" 


26' 


_  m2  sin  6  +       [(G2  -  F2}  sin  6  -I-  2GF  cos  O]  =  0. 

PIT 

(32) 


The  factor  3  arises  in  the  last  equation  due  to  the  fact  that  three  quarks  make  up  the 
soliton  state. 

We  have  solved  equations  (30)-(32)  using  the  numerical  code  COLSYS  subject  to  the 
boundary  conditions  G(oo)  =  F(oo)  =  9(oo)  =  0,  F(0)  =  0  and  6(0)  =  IT.  [Note  that 
sin  6(0)  =  0,  to  ensure  that  6  is  well-defined  at  the  origin].  For  g  less  than  some  critical 
value  gcrit,  no  soliton  solutions  exist.  This  is  not  a  surprising  result  and  merely  indicates 
that  a  certain  minimum  level  of  attraction  is  needed  to  bind  the  quarks.  Numerically,  it  is 
found  that  gcrit  ~  3.95.  One  should  choose  g  so  that  it  leads  to  a  reasonable  energy  for  the 
soliton.  It  is  found  that  gF^  =  500  MeV  or  g  =  5.367  gives  a  satisfactory  result. 

It  is  convenient  to  use  the  dimensionless  variables, 


and     F(x)  = 


(33) 


In  terms  of  these  scaled  variables,  the  normalization  condition  for  the  quark  wavefunction 
is  given  by 

47T  f°°x2Ax(G2(x}+F2(x}}  =  1.  '  (34) 

Jo 

The  quark  wavefunctions  G(x)  and  F(x]  are  plotted  as  functions  of  x  in  figure  2.  6(x)  has 
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essentially  the  same  form  as  in  figure  1.  The  size  of  the  soliton  «  1.75,  in  dimensionless 
units.  This  corresponds  to  w  0.7  fm  as  it  should  be. 

The  state  corresponding  to  the  above  soliton  is  neither  that  of  N  nor  that  of  A,  but  a 
superposition  of  both.  One  has  to  use  appropriate  projection  techniques  to  find  the 
quantum  states  corresponding  to  the  neutron,  proton  and  A  states. 

Till  now,  we  have  worked  with  the  nonlinear  a  model.  One  could  also  work  with  the 
linear  cr-model  [14]  in  which  U  has  to  be  replaced  by  the  matrix  field 

M  =  ^r(cr-irn).  (35) 

Fit 

The  linear  model  is  essentially  given  by  (23),  with  U  replaced  by  M  and  with  the 
inclusion  of  some  additional  terms.  However  there  is  very  little  difference  between  the 
soliton  solutions  of  the  nonlinear  and  linear  versions  of  the  model.  Phenomenologically, 
the  linear  model  is  on  a  slightly  better  footing  (as  it  has  more  parameters)  and  leads  to 
satisfactory  results  for  the  static  properties  of  N  and  A. 

There  are  classically  stable  cylindrically  symmetric  configurations  with  B  =  2,  that 
involve  6  quarks,  in  this  model  also,  as  in  the  Skyrme  model.  For  these  configurations  U 
again  has  the  form  given  in  (13)  and  (14)  (with  n  —  2),  with  the  boundary  conditions 
given  by  (15)-(16).  For  the  B  =  2  toroidal  Skyrmion  (corresponding  to  the  deuteron), 

(1-3  +  2T3)  |  physical  state)  =  0,  (36) 

where  Lj  and  the  Tj  are  the  z-components  of  the  orbital  angular  momentum  and  the 
isospin  generator  respectively.  This  is  simply  the  statement  of  cylindrical  symmetry  in  the 
operational  form,  namely  that  a  spatial  rotation  of  a  about  the  z-axis  can  be  compensated 
for  by  an  isorotation  of  —  2a  about  the  z-axis.  For  the  quark  fields  in  our  model,  the 
relevant  symmetry  operator  is  #3  =  L3  +  TI  +  2T3,  where  83  is  the  z-cornponent  of  the 
spin  angular  momentum.  As  Lj  is  integral  and  £3  and  T-$  are  ±  \,  it  is  obvious  that  the 
lowest  values  of  £3  are  ±|  for  the  quark  fields.  The  most  general  form  for  ^  with 
£3  =  +|  will  be  of  the  form: 


where/1,/2,/3,/4,  gi,  gi,  #3  and  #4  are  functions  of  p  and  z.  Similarly,  we  can  construct 
the  most  general  \3>  with  K$  =  -\.  Plugging  in  this  form  for  \I>  and  U  into  the  equations 
of  motion,  we  get  ten  coupled  nonlinear  partial  equations  (/•,  #,-,  6  and  a).  It  is  not 
obvious  that  there  exist  any  simple  procedure  to  solve  this  set  of  equations.  However,  we 
assume  a  simplified  ansatz  for  \£  which  satisfies  all  the  symmetry  requirements  and 
yields  the  real  variational  minimum.  We  find  that  a  spinor  \I>  of  the  following  form 
minimizes  the  energy 


where  x  is  column  vector  in  spin-isospin  space  which  satisfies 

)x  =  0.  (39) 
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Figure  3.    Contour  plot  of  the  energy  density  function  e(p,  z)  for  the  B  =  2  toroidal 
soliton  in  CQM. 

The  following  two  solutions  x\  and  X2  with  #3  =  +1  and  -\  respectively  satisfy  the 
above  equation: 

1 


Xi  = 


•\/2(l  —  cos  6  cos  a) 


and 


X2  = 


>/2(l  —  cos  6  cos  a) 


[sin  au 


[sin  ad  T  ~ 


-(cos  a 


M  J,  +(cosa 


(40) 


(41) 


It  can  be  easily  checked  that  the  two  solutions  are  degenerate  in  energy. 

For  the  ansatz  given  by  the  equations  (13),  (14),  (38)  and  (40),  we  have  found  that  there 
is  a  solution  by  minimising  the  energy,  using  a  relaxation  procedure  [15].  We  find  the 
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Figure  4.    Contour  plot  of  the  energy  density  function  e(p,  z)  for  B  =  3. 


energy  of  the  configuration  to  be  2019  MeV.  As  the  classical  energy  for  the  B  —  \  state  is 
1119  MeV,  the  classical  binding  energy  is  219  MeV.  In  figure  3,  we  give  the  contour  plot 
of  the  energy  density  function  e(p,z)  which  clearly  indicates  a  toroidal  structure  of  the 
soliton  solution,  (the  variables  in  the  figure  are  scaled  in  units  of  gF^).  A  naive  value  for 
the  mean  square  radius  of  the  charge  distribution  corresponding  to  the  soliton  would  be 
given  by  (r2)  =  2?r /  pdpdz r2  (G2  +  F2}.  We  find-  (r2)1^  to  be  0.934  fm. 

The  symmetries  of  this  toroidal  solution  closely  parallel  those  of  the  corresponding 
object  in  the  Skyrrae  model  and  its  ground  state  has  the  quantum  numbers  of  the 
deuteron,  upon  quantization.  Whereas  the  stability  of  the  solitons  was  due  to  the  presence 
of  the  quartic  term  in  the  Skyrme  model,  here  it  is  due  to  the  quark-meson  couplings. 

For  the  solutions  with  5  =  3(9  quarks)  and  5  =  4(12  quarks)  we  have  used  a 
simplified  ansatz  of  the  form  given  in  (38),  where  G  and  F  are  now  taken  to  be  functions 
of  r  only.  Even  this  brings  out  the  salient  features  of  the  soliton  solution.  By  minimising 
the  energy,  we  find  that  the  soliton  solutions  are  again  toroidal  in  nature  and  closely 
parallel  the  corresponding  toroidal  solutions  in  the  Skyrme  model.  We  give  the  contour 
plot  of  the  energy  density  e(p,z)  for  B  =  3  in  figure  4.  The  quark-meson  model  gives 
larger  values  for  the  binding  energies.  It  is  gratifying  to  note  that  the  toroidal  states  are 
independent  of  the  particular  chiral-invariant  model. 

In  the  Skyrme  model,  the  strange  baryons  are  treated  as  excitations  of  a  bound  state  of 
a  kaon  and  the  hedgehog  soliton  in  the  Callan-Klebanov  approach  [31].  In  CQM,  an 
equivalent  of  this  method  using  a  random  phase  approximation  has  been  used  with  very 
limited  success  [35].  It  would  be  natural  to  think  of  a  strange  baryon  as  corresponding  to 
a  soliton  solution  with  nontrivial  strange  quark  component  and  kaonic  fields,  with  the  s 
quark  providing  the  strangeness  quantum  number  just  as  the  isospin  of  the  baryon  comes 
from  the  u/d  quarks  in  this  model.  We  do  indeed  find  soliton  solution  with  non-zero 
strangeness  and  5=1  [36].  Unfortunately,  these  exist  only  for  those  values  of  the 
coupling  constant  g  which  are  far  greater  man  the  phenomenologically  acceptable 
number  and  hence  do  not  correspond  to  the  known  strange  baryons. 
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Nonlinear  cnirai  moaeis 
4.  Regularity  and  chaos  in  the  Skyrme  model 

In  the  following,  we  study  the  dynamical  stability  of  the  skyrmion.  As  discussed  earlier 
our  approach  is  different  from  the  linear  stability  analysis:  we  explore  the  phase  space 
around  the  soliton  solution  directly,  by  solving  the  time-dependent  equations  numerically 
[26].  The  full  nonlinearity  of  the  theory  comes  into  play  in  the  time-evolution. 

The  spherically  symmetric  hedgehog  ansatz  in  equation  (10)  is  now  made  time- 
dependent: 

t/(r,  /)  =  cos  9(r,  /)  +  it  •  rsin  9(r,  t).  (42) 

It  can  be  seen  that  if  U  is  of  the  hedgehog  form  initially,  it  will  remain  so  at  later  times 
also,  as  the  "spherical  symmetry"  is  maintained.  In  other  words,  we  are  considering  the 
restricted  class  of  "spherically  symmetric"  fluctuations.  Corresponding  to  this  ansatz,  the 
Lagrangian  (integrated  over  space)  is  given  by 


L'  =  47r 


1   sin29/  A2     __.,     sm2t/  i      '«_/•_,.       _.      ,.  i  ,._, 

+  — =• — =—    29   -29^ =—     +-"(cos9-l)^,  (43) 

2ez     rl      \  rz      I         4  I 

where  overdot  denotes  differentiation  with  respect  to  t.  The  Hamiltonian  is  given  by 
If 


/(  F2  ( 
r2  drl  -*  I  9 


(44) 


8  now  satisfies  the  following  equation  of  motion: 


sin20sin2e  .    _      rt  ,Atf. 

=  0,  (45) 

J(f  H" 

where  x  and  (3  have  been  defined  earlier. 

We  consider  the  system  in  a  finite  volume.  This  is  perfectly  justified,  as  the  skyrmion 
has  a  finite  size.  We  discretize  the  spatial  variable  x  and  replace  the  derivatives  by 
corresponding  difference  expressions: 


(46) 
(47) 


6x 
and 
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the  number  of  lattice  points.  In  our  analysis,  we  have  taken  the  length  of  the  system  to  be 
8  units(the  size  of  the  skyrmion  in  dimensionless  unit  x  w  2)  and  the  number  of  lattice 
points  to  be  64. 

We  write  6(x,.f)  in  the  following  form 

e(jc,r)  =  eo(*)  +  £e(jt,f),  (48) 

where  9o  represents  the  static  solution  and  the  fluctuation  8©  (x,t)  is  a  combination  of 
various  harmonic  modes  of  the  form 

(49) 

Here,  L  is  the  length  of  the  system  (8  in  our  case)  and  N  is  the  number  of  lattice  points  (64 
for  us).  a,j(t)  is  the  amplitude  of  theyth  mode  which  is  got  by  inverting  the  series  given 
in  (49): 


aj(t)  =         <56(jc,  t)  sin  djc.  (50) 

LJQ  \£/ 

This  way,  the  problem  is  converted  into  one  corresponding  to  a  system  of  coupled 
anharmonic  oscillators  and  one  can  study  the  time  evolution  of  the  amplitudes  of  the 
modes.  This  is  essentially  the  Fermi-Pasta-Ulam  approach  for  coupled  systems  [21]. 

As  U  has  to  be  well-defined  at  r  =  0  and  U  -*  1  as  r  —  >  oo,  we  have  the  boundary 
conditions 

6Q(Qtt)  =  6Q(L,t)  =  Q  (51) 

which  is  automatically  satisfied  by  the  harmonic  expansion  for  £6  in  (49).  Moreover, 
we  take 

e(x,0)=0.  (52) 

Also,  we  excite  only  the  JV/2th  or  32nd  mode  initially,  following  standard  practice. 
Thus, 

«/(0)=0       j^32.  (53) 

A  =  032(0)  is  our  perturbation  parameter. 

We  now  integrate  out  the  equation  of  motion  in  (45)  with  the  aforementioned  initial 
conditions  using  a  standard  Runge-Kutta  routine  with  a  time  step  0.001  and  5  x  105 
iterations,  i.e.,  real  time  t  =  500.  The  total  energy  given  by  H'  in  equation  (44)  was 
constant  to  an  accuracy  of  0.2%  for  this  time  step.  We  have  studied  the  time  evolution  of 
the  system  for  various  values  of  the  perturbation  parameter  A. 

We  have  plotted  the  fluctuation  8Q(x,  t}  at  t  =  0  and  500  for  A  =  0.1  in  figure  5.  Apart 
from  large  fluctuations  near  x  =  0,  \6Q(x,  t)\  w  0.1  in  this  case.  This  is  understandable  as 
the  skyrmion  dynamics  is"  dominated  by  the  small  x-region  and  the  deviation  from  the 
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Figure  5.    5&  (x,  t)  at  r  =  0  (dotted  line)  and  t  =  500  (continuous  line)  for  A  =  0.1, 
in  the  Skyrme  model. 
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Figure  6.    6Q(x,  t]  at  t  ~  0  and  t  -  500  for  A  =  2. 
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Figure  7.    Amplitudes  Oj(t)  of  the  modes  corresponding  to  j  =10,28,32  and  40 
plotted  against  time  for  A  =  0.1  in  the  Skyrme  model. 
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Figure  8.    Amplitudes  o/(f)  for  A  =  1 
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skyrmion  is  small  when  A  =  0.1.  Contrast  this  with  the  behaviour  of  8Q(x,  t)  for  A  —  2  in 
figure  6.  Now  SQ(x:  t]  has  violent  fluctuations  for  substantially  longer  intervals  in  x.  For 
A  =  2,  the  initial  fluctuation  is  of  the  same  order  of  magnitude  as  the  static  solution  itself 
and  after  a  long  time,  9  would  have  no  resemblance  to  the  skyrmion! 

The  behaviour  of  the  amplitudes  of  various  harmonic  modes  is  an  important  criterion 
for  regularity  or  chaos.  We  have  plotted  the  amplitudes  a.j(t)  for  j  =  10,  28,  32  and  40  as 
functions  of  time  for  A  =  0.1  and  2  in  figures  7  and  8  respectively.  For  A  =0.1,  the 
amplitude  of  the  (initial)  32nd  mode  has  an  extremely  regular  periodic  behaviour  (note 
that  it  does  not  remain  constant!).  However  the  amplitudes  of  the  other  modes  increase 
gradually  on  the  average.  The  energy  transfer  to  other  modes  is  expected  to  be  substantial 
after  a  longer  interval  of  time.  In  contrast  to  this,  for  A  =  2,  the  amplitude  of  the  initial 
mode  decreases  gradually  on  the  average,  though  there  is  a  periodicity  in  the  variation. 
After  t  =  200,  other  modes  are  excited  substantially.  It  is  to  be  expected  that  after  a 
longer  interval  of  time,  all  the  modes  would  be  of  same  magnitude  leading  to 
'thermalization'  and  'spatio-temporal  chaos'. 

Finally,  we  have  plotted  the  root  mean  square  radius  of  the  soliton,  normalized  to  its 
initial  value,  as  a  function  of  time  for  A  =  0.1  and  2  in  figures  9  and  10.  For  A  =  0.1  the 
radius  of  the  soliton  has  a  periodic  behaviour,  whereas  for  A  =  2,  the  radius  increases 
gradually  on  the  average. 

We  have  increased  the  spatial  resolution  by  a  factor  of  2  by  taking  the  number  of  lattice 
point,  N,  to  be  128  and  checked  that  there  is  an  insignificant  change  in  the  numerical 
results. 


1.30 


0.60 


500 


Figure  9.    Variation  of  the  root  mean  square  radius  (r2)1'2  (  normalized  to  unity  at 
t  =  0),  with  time  for  A  =  0.1  in  the  Skyrme  model. 
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Figure  10.    Variation  of  (r2)172  for  A  =  2. 


Hence,  when  the  strength  of  the  initial  fluctuations  is  small,  the  evolution  of  the  system 
is  somewhat  regular  atleast  for  t  w  500;  however,  there  are  already  hints  that  the 
fluctuations  would  become  random  after  a  longer  interval  of  time.  If  the  strength  of  the 
perturbation  is  increased,  energy  transfer  to  other  modes  (different  from  the  initial  one)  is 
substantial  and  the  fluctuations  are  now  chaotic.  This  is  a  clear  indication  that  the  phase 
space  near  the  Skyrme  soliton  exhibits  spatio-temporal  chaos  and  the  system  undergoes 
"thermalization"  after  a  sufficiently  long  time.  The  system  is  non-integrable.  There  is  no 
evidence  of  any  phase  transition  in  the  system. 

It  might  be  thought  that  the  above  behaviour  is  characteristic  of  any  non-linear  system. 
As  the  Skyrme  model  is  a  highly  non-linear  system  with  an  interaction  term  which  has 
four  space-time  derivatives  of  the  field,  this  could  have  been  anticipated.  However,  it 
should  be  remembered  that  the  skyrmion  is  a  topological  object.  The  topological 
conservation  law  imposes  its  own  constraint  on  the  dynamics.  As  the  fluctuations  satisfy 
the  boundary  condition  60(0,0  =  60(oo,0  =  0  at  all  times,  the  baryon  number  is  not 
affected  by  the  dynamics.  Our  work  has  revealed  that  within  the  constraint  of  the 
topological  conservation  law,  the  soliton  exhibits  a  chaotic  behaviour  after  a  sufficiently 
long  time,  for  large  enough  perturbations. 


5.  Stability  of  the  B  =  1  soliton  of  the  chiral  quark-meson  model 

In  this  section,  we  investigate  the  dynamical  behaviour  of  the  chiral  quark-meson  model 
along  the  same  lines  as  in  §4  [26], 
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Figure  11.     8F\  (x,  t)a.tt  =  0  (dotted  line)  and  t  =  500  (continuous  line)  for  A  =  2  in 
the  chiral  quark-meson  model.  The  two  curves  merge  after  some  value  of  x. 


In  the  time-dependent  case,  for  the  B  =  1  soliton,  we  take: 
U  =  cos  O(r,  i)  —  h  -r  sin  6(r,  t) 


exp(—iet) 


for  a  consistent  set  of  equations. 
The  equations  of  motion  following  from  LCQM  in  (23)  would  now  be: 


(54) 


(55) 


=  -F'2 


2F? 


G2  = 


(-e  +  cos  6)G2  -  sin  6F2 

=  -G'2  -  (e  +  cos  9)F2  ~  sin  9G2 

2F2 


-e  +  cos0)Gi+sin0Fi 


F2  =  Gi  +  (e  +  cos  6)Fi  +  sin  6Gi 
e     9"     26'     sin20 

\J         \J  I  ^ 

x          x2 


G2F2)}  -  /52sin6  =  0, 


(56) 
(57) 
(58) 
(59) 


(60) 


where  the  scaled  variables  x,  Gz(;c)  and  F,-(JC)  are  defined  in  (33)  and  prime  denotes 
differentiation  with  respect  to  jc.  As  in  the  case  of  the  Skyrme  model,  we  take  the  length 
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Figure  12.     <5G2(jc,  r)  at  t  =  0  and  t  =  500  for  A  =  2. 

of  the  system  to  be  8  and  the  number  of  lattice  points  to  be  64.  The  fields  at  any  time  can 
be  written  as 


where 


N-l 


(61) 


(62) 


Here  *  stands  for  the  fields  Gh  Fls  G2,  F2  or  Q  and  *,0  is  the  static  solution  'for  the 
corresponding  field.  (G2,o  =  F2fl  =  0).  af(t)  represents  the  amplitude  of  the  jth  mode  of 
the  fluctuation  in  field  ^.  At  any  time  af(t)  can  be  found  by  inverting  the  series  in 
(62),  i.e., 


(63) 


We  take 


*(*,0)  =  0  (64) 

as  in  the  previous  section.  We  excite  the  32nd  mode  initially  for  8,  GI  and  FI,  G2  and  F2: 

4(0)=^.  (65) 

^  =  0),  (66) 
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Figure  13.     8Q  (x,  t)  at  t  =  0  (dotted  line)  and  t  =  500  (continuous  line)  for  A  =  2  in 
the  chiral  quark-meson  model. 


(67) 

=  1-1),  (68) 

where  A  is  the  perturbation  parameter.  In  eqs.  (66)-(68),  we  have  the  factor  Gi^O) 
and  Fifl(x=  1.1)  to  ensure  that  the  perturbation  has  the  natural  scale  for  GI  ,  FI,  GI 
and  F2. 

We  now  integrate  the  5  coupled  equations  (56)  to  (60)  using  the  standard  Runge-Kutta 
routine  with  a  time  step  0.001  and  5  x  105  iterations,  i.e.,  real  time  t  =  500.  The  time 
evolution  of  the  system  is  numerically  determined  for  various  values  of  the  amplitudes. 

We  have  plotted  the  fluctuations  8F\ ,  8G2  and  6©  (x,  t)  for  t  =  0  and  500  for  A  =  2  in 
figures  11-13.  We  observe  that  there  is  very  little  variation  in  these  functions  except  at 
the  origin  even  though  the  amplitude  is  large.  This  is  strikingly  different  from  the  time 
evolution  of  the  fluctuation  around  the  soliton  in  the  Skyrme  model,  for  the  same  value  of 
the  amplitude  of  the  32nd  mode. 

We  have  plotted  the  amplitudes  of  the  modes  j  —  10,28,  32  and  40  corresponding  to  the 
functions  GI,  F\  and  0  in  figures  14-16.  Again  there  is  very  little  variation  in  the 
amplitude  of  the  initial  32nd  mode  even  after  t  =  200.  The  amplitudes  of  all  the  other 
modes  remain  very  small  even  after  a  long  time,  for  all  the  functions. 

Hence,  for  reasonably  large  fluctuations,  the  system  exhibits  a  very  regular  behaviour. 
It  is  thus  clear,  that  the  soliton  of  the  chiral  quark-meson  model  is  much  more  stable  than 
the  Skyrmion  for  comparable  strengths  of  fluctuations. 
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Figure  14.    Amplitudes  a.j(t)  of  the  function  G\  corresponding  to;  =  10, 28, 32  and 
40  plotted  against  time  for  A  =  0.1  in  CQM. 
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Figure  15.    Amplitudes  aj(t)  of  the  function  F\  for  A  =  2. 
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Figure  16.    Amplitudes  a/(r)  of  the  function  9  for  A  =  2,  in  CQM. 
6.  Discussion 

In  this  article,  we  have  described  the  'soliton  solutions'  of  two  nonlinear  chiral  models, 
namely,  the  Skyrme  model  and  the  chiral  quark  meson  model  and  their  identification  with 
the  baryons,  the  fundamental  constitutents  of  matter.  It  is  fascinating  that  a  fully  bosonic 
theory  like  the  Skyrme  model  has  topological  soliton  solutions  which  are  identified  with 
fermions.  This  is  because  of  the  presence  of  the  topological  Wess-Zumino  term  and  the 
collective  quantization  procedure,  discussion  of  which  is  beyond  the  scope  of  this  review. 
The  interactions  between  the  solitons  of  the  two  models  has  been  studied  and  it  has  been 
shown  that  the  gross  features  of  the  internucleon  potential  are  reproduced.  The  scattering 
properties  of  these  solitons  in  3  +  1  dimensions  is  yet  to  be  investigated.  However,  there 
are  some  very  interesting  results  on  soliton  scattering  in  the  Skyrme  model  in  2  +  1 
dimensions.  Peyrard,  Piette  and  Zakrzewski  have  shown  that  in  scattering  processes 
initiated  at  low  relative  velocities,  the  Skyrmions  bounce  back  while  at  velocities  larger 
than  some  critical  value,  they  scatter  at  right  angles  [37].  Moreover,  the  Skyrmions 
preserve  their  shape  after  collision.  This  seems  to  be  a  characteristic  feature  of  solitons  of 
any  S2  model  in  2  +  1  dimension.  There  are  indications  that  in  all  head-on  collisions  of  N 
indistingushable  soliton  like  objects  in  2  + 1  dimensions,  the  scattering  angle  is  n/N 
[38].  It  would  be  of  great  interest  to  study  the  scattering  of  Skyrmions  and  related 
topological  objects  in  3  +  1  dimensions  to  find  whether  they  retain  their  shape  and  size 
and  whether  the  scattering  angles  are  still  discretized. 

We  have  considered  another  aspect  of  the  soliton  solutions  in  these  models,  namely 
stability  with  respect  to  fluctuations.  It  is  known  that  non-abelian  gauge  theories  with  and 
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without  the  Higgs  fields  exhibit  spatio-temporal  chaos.  We  have  estabilished  that  the 
Skyrme  model,  an  effective  Lagrangian  theory  based  on  QCD,  also  shares  this  feature,  by 
direct  numerical  computations.  It  is  interesting  to  note  that  topological  stability  and 
spatio-temporal  chaos  coexist  in  the  model.  In  contrast  to  this,  the  chiral  quark-meson 
model  which  is  also  QCD  based  is  very  stable  for  comparable  strengths  of  fluctuations 
around  the  soliton  solutions  and  comparable  time  scales.  The  region  of  phase  space 
around  the  soliton  solution  of  the  model  exhibits  more  regularity,  in  contrast  to  the  phase 
space  around  the  Skyrme  soliton.  The  reason  perhaps  lies  in  the  fact  that  the  couplings  in 
the  Skyrme  model  are  more  nonlinear  due  to  the  four-derivative  terms.  In  the  chiral  quark 
meson  model,  the  four-derivative  term  is  absent  and  instead  of  that  we  have  the  Yukawa 
coupling  term.  It  would  be  interesting  to  study  the  dynamical  behaviour  of  other  field 
theories  with  fermion  fields  to  find  whether  they  are  also  very  stable. 

The  large  fluctuations  around  the  chiral  soliton  solutions  that  we  have  discussed  would 
be  of  importance  when  we  consider  interactions  among  them.  Chaotic  field  configura- 
tions have  been  known  to  play  a  role  in  multiparticle  hadron  production  processes;  indeed 
a  chaoticity  parameter  is  introduced  to  explain  the  average  multiplicity [39].  Our  work  is  a 
preliminary  exercise  in  demonstrating  concretely  how  spatio-temporal  chaos  arises  in 
field-theoretic  models  of  strong  interactions.  It  remains  to  be  seen  how  specific  physical 
processes  are  influenced  by  it. 
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Abstract.  We  review  a  variety  of  control  methods  which  are  capable  of  enhancing  the  chaoticity 
and  mixing  properties  of  chaotic  flows  and  also  methods  which  work  towards  promoting  the 
coherence  properties  of  such  flows.  We  discuss  a  parameter  control  method  which  can  enhance  the 
chaoticity  and  the  rate  of  mixing  of  dissipative  as  well  as  conservative  flows  and  outline  methods 
which  promote  global  mixing  by  the  addition  of  noise  and  by  preventing  island  formation.  As  the 
inverse  side  of  this  problem,  we  summarize  methods  which  can  create  coherent  structures  in  chaotic 
dynamical  flows.  We  also  discuss  the  utility  of  these  methods  from  the  point  of  view  of  applications 
as  well  as  for  understanding  phenomena  which  occur  in  natural  systems. 

Keywords.    Lagrangian  chaos;  extended  systems;  control. 
PACS  No.    5.45 


1.  Introduction 

The  control  of  chaos  has  been  a  topic  of  much  recent  interest  [1],  The  control  of  chaos  has 
found  interesting  applications  in  contexts  ranging  from  lasers,  mechanical  systems,  and 
electronic  circuits,  to  chemical  reactions  and  biological  systems.  Some  of  this  effort  has 
been  directed  towards  the  imposing  of  periodic  orbits  on  chaotic  systems,  other  directions 
of  effort  have  been  towards  maintaining  chaos  in  contexts  where  the  destruction  of 
chaotic  behaviour  is  undesirable.  The  enhancement  of  chaos  via  control  methods  can  also 
find  useful  applications.  We  discuss  control  methods  which  work  towards  enhancing 
chaos  in  chaotic  flows  in  this  review.  We  also  discuss  methods  which  work  towards 
creating  coherence  in  chaotic  flows. 

The  enhancement  of  chaos  can  be  desirable  and  useful  in  a  variety  of  situations.  In  the 
case  of  biological  systems,  there  are  several  instances  of  situations  where  maintaining  or 
enhancing  chaos  is  desirable.  It  has  been  suggested  that  the  pathological  destruction  of 
chaotic  behaviour  may  be  responsible  for  heart  failure  [2],  and  some  types  of  brain 
seizures  [3].  Techniques  which  are  capable  of  enhancing  and  maintaining  chaos  could  be 
useful  in  such  contexts  [4]. 

An  important  example  of  a  situation  where  enhancing  chaos  is  useful,  is  the  process  of 
mixing  [5,  6].  The  mixing  properties  of  a  flow  are  important  in  various  engineering  and 
industrial  problems  like  combustion  processes  [7],  viscous  liquids  [8],  chemical  reactions 
[9,  10],  heat  transfer  processes  [11].  Often  the  performance  of  a  particular  process,  its 
yield,  the  rates  and  even  the  product  depends  on  the  mixing  properties.  For  example,  a 
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well  mixed  fuel-air  mixture  can  give  rise  to  efficient  combustion  and  a  chemical  reaction 
in  which  the  reactants  are  better  mixed  enhances  the  rate  of  reaction  and  the  yield. 
Therefore  it  is  essential  to  look  at  the  underlying  dynamical  properties  which  give  rise  to 
efficient,  or  poor  mixing.  Mixing  is  a  consequence  of  the  stretching  and  folding  of  chaotic 
flows.  A  system  which  has  exponential  stretching,  as  in  a  chaotic  flow,  can  mix 
efficiently.  Many  mixing  processes  like  fluid  flows,  combustion  processes,  chemical 
reactions,  heat  transfer  processes,  etc  can  be  modelled  by  chaotic  flows  [6].  An 
enhancement  of  the  chaoticity  of  such  systems  can  lead  to  an  enhancement  of  the  rate  of 
mixing,  an  outcome  which  has  desirable  consequences  in  many  of  the  situations 
described  above. 

Coherent  structures,  on  the  other  hand  are  the  flip  side  of  the  coin.  These  are  regions  in 
the  flow  which  remain  unmixed  and  may  or  may  not  permit  the  exchange  of  trajectories 
with  globally  mixing  surrounding.  The  presence  of  a  coherent  structure  in  otherwise 
strongly  mixing  surroundings  is  a  puzzling  but  ubiquitous  feature  of  natural  systems. 
Examples  of  coherent  structures  can  be  found  in  a  variety  of  natural  contexts,  ranging 
from  planetary  [12]  and  geophysical  [13]  systems,  to  chemical  mixers  [14],  coupled 
oscillator  arrays  [15]  and  turbulent  flows  [16].  Again,  low  dimensional  chaotic  flows  have 
proved  to  be  capable  of  encapsulating  much  of  the  generic  behaviour  seen  in  such 
systems  [17-19]. 

It  is  therefore  interesting  to  inquire  into  the  reasons  behind  the  formation  of  coherent 
regions  created  hi  such  chaotic  flows.  Control  methods  capable  of  the  creation  of 
coherent  structures  in  chaotic  flows  can  provide  insights  into  these  reasons.  These 
structures  could  also  find  useful  applications.  A  very  interesting  application  of  such 
structures  has  been  proposed  recently.  Such  structures  could  be  utilised  to  control  the  rate, 
products  and  yield  of  chemical  reactions  by  using  them  as  controlled  release  capsules 
[20],  The  idea  is  to  release  one  or  more  of  the  reactants  in  a  chemical  reaction  at  a 
specific  time  or  to  control  the  concentration  of  such  reactants.  These  reactants  are  placed 
on  coherent  structures  and  the  flow  is  mixed.  The  coherent  structure  prevents  exchange  of 
these  reactants  with  the  surrounding  environment.  Then  at  a  desired  time  the  structure  can 
be  destroyed  by  application  of  noise  and  the  reactants  released  into  the  reaction.  Such  a 
capsule  can  be  used  in  a  continuous  flow  reactor  and  permits  the  control  of  various 
reaction  rates  and  products. 

The  existence  of  mixing  and  coherent  regions  in  flows  can  have  relevance  for  a  variety 
of  other  phenomena.  The  flow  region,  as  visualized  by  injecting  a  passive  scalar  such  as 
a  dye,  acts  as  a  basic  template  for  various  interesting  physical  processes  such  as 
aggregation,  coagulation  or  breakup  [21,  22].  The  fabric  of  the  underlying  chaotic 
dynamics  provides  a  template  on  which  various  processes  like  aggregation,  coagulation 
and  drop  breakup  occurs.  By  exploiting  the  properties  and  symmetries  of  this  fabric,  the 
coherence  and  mixing  properties  of  mixing  flows,  one  can  understand  and  manipulate 
these  processes  better.  Here  again,  one  can  show  that  the  shape  and  sizes  of  various 
aggregates  and  rates  of  various  processes  depend  on  mixing  properties  and  presence  of 
coherent  structures. 

It  is  clear  that  many  of  the  situations  which  constitute  our  motivation  correspond  to 
spatially  extended  systems  such  as  fluid  flows.  It  is  therefore  relevant  to  discuss  the  sense 
in  which  the  analysis  of  the  dynamics  of  low-dimensional  systems  in  phase  space  is 


are  relevant  to  mixing  and  coherence  in  real  space  in  the  same  sense  in  which  the 
Lagrangian  description  of  a  two-dimensional  fluid  defined  in  physical  space  by  the  fluid 
velocity  field  obtainable  from  a  stream-function  can  be  mapped  on  to  a  Hamiltonian  flow 
in  two  dimensional  phase  space  [23,5].  The  evolution  of  the  trajectories  of  the  system 
under  the  Lagrangian  description,  corresponds  to  the  evolution  of  tracer  particles  in  the 
fluid.  The  system  is  effectively  many-dimensional  as  many  initial  conditions  evolve  at  the 
same  time.  Such  a  picture  can  be  extended  even  to  turbulent,  spatially  extended  problems, 
such  as  turbulent  jets  [19].  Control  methods  for  such  systems  must  be  capable  of 
constraining  or  mixing  many  system  trajectories.  It  will  be  seen  that  the  control  methods 
under  discussion  succeed  in  achieving  this  objective. 

We  review  a  variety  of  control  methods  which  address  some  of  the  issues  raised  above. 
We  discuss  methods  for  the  enhancement  of  chaos,  mixing  and  mixing  rates  in  §  2.  We 
discuss  methods  which  work  towards  enhancing  mixing  via  enhancing  stretching  rates, 
via  the  addition  of  noise  and  by  preventing  the  formation  of  islands  in  time  periodic 
Hamiltonian  flows.  Section  3  discusses  methods  for  the  formation  of  coherent  structures 
in  chaotic  flows  and  maps.  The  methods  discussed  include  that  of  introducing  a  fold  and 
that  of  using  localized  parameter  perturbations  in  the  vicinity  of  low  order  periodic 
points.  Section  4  discusses  phenomena  involving  active  tracers  viz.  aggregation  and 
breakup.  A  brief  discussion  (§  5)  concludes  this  review. 


2.  Enhancing  chaos,  mixing  and  the  rate  of  mixing 

Enhancing  mixing  and  the  rate  of  mixing  in  a  process  is  a  topic  of  much  recent  research, 
for  the  reasons  of  obvious  practical  advantage  listed  above.  Recently  some  novel  methods 
to  achieve  this  goal  have  been  proposed  [22,  24,  25].  These  processes  use  different  ways 
to  increase  mixing,  and  the  rate  of  mixing.  While  one  of  the  proposed  methods  works  for 
time  periodic  flows  by  exploiting  the  underlying  symmetries  of  the  flow  to  prevent  the 
island  formation  which  constitutes  a  barrier  to  efficient  mixing  [22],  another  method  [25] 
considers  the  effect  of  noise  on  such  flows  and  its  role  in  enhancing  mixing.  Recently,  a 
new  method  has  been  proposed  [24],  which  enhances  the  rate  of  mixing  of  chaotic  flows 
by  enhancing  their  chaoticity.  This  method  enhances  the  chaoticity  as  measured  by  the 
average  Lyapunov  exponent  by  optimally  perturbing  the  system  parameters  in  regions  of 
low  local  Lyapunov  exponent.  This  increase  in  chaoticity  translates  into  an  enhancement 
of  the  rate  of  mixing,  and  can  also  be  useful  in  other  contexts  where  the  enhancement  and 
maintenance  of  chaos  is  important  [4].  We  summarize  this  method  below. 

2.1  Enhancing  chaos  and  the  rate  of  mixing  via  control  of  stretching  rates 

This  method  has  the  two-fold  objective  of  enhancing  chaos  and  thereby  enhancing  the 
rate  of  mixing  of  the  system.  An  important  parameter,  which  characterizes  the  degree  of 
chaos  in  a  chaotic  flow  is  the  Lyapunov  exponent,  which  gives  the  average  rate  of 
stretching.  However,  the  rate  of  stretching  is  not  uniform  over  a  chaotic  attractor  in  the 
case  of  dissipative  flows  or  over  the  phase  space  of  a  conservative  flow.  Thus  the  local 
Lyapunov  exponent  (LLE),  a  measure  of  the  local  rate  of  stretching,  is  different  in 
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different  regions  of  the  phase  space.  The  nonuniform  nature  of  the  spatial  distribution 
of  the  LLEs  is  exploited  to  construct  a  mechanism  that  can  enhance  chaos  and  hence 
the  rate  of  chaotic  mixing.  The  average  rate  of  stretching  is  enhanced  by  introducing 
a  small  parameter  perturbation  which  enhances  the  LLE  whenever  the  system  trajectory 
visits  a  region  where  the  LLEs  take  values  much  smaller  than  their  average  value. 
This  procedure  works  quite  efficiently  as  small  perturbations  in  parameter  made  for 
small  times  compared  to  the  total  time  of  evolution  can  lead  to  substantial  enhancement 
of  the  Lyapunov  exponent  and  to  the  rate  of  mixing.  The  method  is  applicable  to 
both  dissipative  and  conservative  examples  and  discusses  mixing  in  phase  space. 
However,  here  as  in  all  other  contexts  considered  in  this  review,  the  conservative 
examples  are  relevant  to  mixing  in  real  space  in  the  following  sense.  Consider  a  two- 
dimensional  fluid  with  a  fluid  velocity  field  (M(jt,y,/),v(jt,y,  ?)).  For  an  incompressible 
fluid,  we  have 

ux  +  vy  =  Q.  (1) 

The  fluid  velocity  field  can  be  obtained  from  an  exact  differential  dijj  using  the  equations 


where  ijj  =  i/j(x,y,t)  is  known  as  the  stream-function. 

The  Lagrangian  description  in  two  dimensions  involves  following  fluid  particle 
trajectories  (e.g.  those  of  tracer  particles)  obtained  by  solving  the  equations 

x  =  u(x:y,t],     y  =  v(x,y,t}.  (3) 

Using  equation  2  above,  we  have 


dy       '        ~  dx       ' 

Thus  the  stream-function  behaves  like  the  Hamiltonian  of  a  Hamiltonian  flow  in  two- 
dimensional  phase  space.  The  evolution  of  tracer  particle  trajectories  in  the  two- 
dimensional  fluid  has  been  mapped  on  to  the  Hamiltonian  dynamics  in  two-dimensional 
phase  space.  Thus,  statements  about  the  mixing  or  coherence  of  the  phase-space 
trajectories  map  on  to  the  behaviour  of  the  tracer  particle  trajectories. 

Consider  an  autonomous  nonlinear  dynamical  system,  of  dimension  n,  evolving  via  the 
equations 

x  =  F(x,AO,  (5) 

where  the  set  of  parameters  \L  takes  values  such  that  the  trajectory  shows  chaotic 
behaviour.  Let  w(x,  t)  be  the  tangent  vector  to  the  trajectory  at  the  point  x  and  time  t.  The 
evolution  of  w  is  given  by 

w  =  (w  •  V)F.  (6) 

The  Lyapunov  exponent  of  the  system  is  defined  by 
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where  x(0)  is  the  value  of  x  at  t  =  0  and  1 1 w|  |  is  the  norm  of  w.  We  now  define  the  local 
Lyapunov  exponent  A(x)  as 

\f  ^      «m    l  i    IKxfr  +  AfJ.i  +  AQH 

A(x)  =  hm  -T—  In n — 7— rr — rri •  (o) 

v  '     Af-,o  Ar  ||w(x(r),f)|| 

The  quantity  A(x)  represents  the  local  rate  of  stretching  at  the  point  x.  This  is,  in  general, 
not  uniform  over  the  attractor.  Clearly,  the  Lyapunov  exponent  A  (eq.  (7))  is  the  average 
value  of  the  LLEs  for  a  long  orbit . 

The  control  mechanism  is  activated  in  the  regions  where  the  local  Lyapunov  exponent 
falls  below  its  average  value,  that  is  if 

A(x)<(A-7O  (9) 

where  a\  is  the  standard  deviation  of  the  distribution  of  local  Lyapunov  exponents  and  7  is 
some  chosen  factor.  The  control  is  activated  so  as  to  change  the  parameter  /u  of  the  system 
to  (j,  +  sdfj,.  Here  s  takes  values  -1  or  -hi  and  d/^  is  small.  The  control  remains  active 
as  long  as  the  above  condition  is  satisfied,  after  which  it  is  reset  to  the  previous  value. 
The  sign  of  s  is  decided  in  the  following  manner.  Let  WT  be  a  row  vector  and  MT  given 
by  MT  —  VF.  The  equation  for  the  norm  of  W  can  be  written  as 

\\W\\2  =  WT(MT+M)W.  (10) 

The  rate  of  change  in  the  norm  due  to  the  change  in  the  parameter  should  be  positive  in 
order  to  increase  the  local  rate  of  stretching  i.e.  s  should  be  chosen  such  that  the  quantity 

A||W||2  =  ||W(/,  +  dM)||2  -  j|^)||2  ~  WT(M/  +  M»)Wd»  (11) 

where  M^  =  dM/dy,  should  be  positive. 

The  above  equation  is  obtained  from  an  expansion  to  linear  order,  however  the  effect  of 
the  change  in  parameter  is  nonlinear  and  often  changes  in  the  Lyapunov  exponent  are 
much  greater  than  those  predicted  by  a  linear  equation  like  (11). 

The  above  control  procedure  gives  rise  to  a  substantial  enhancement  of  the 
chaoticity  of  the  system,  as  measured  by  the  average  Lyapunov  exponent.  This  can 
be  demonstrated  by  plotting  the  Lyapunov  exponent  for  both  controlled  and  uncontrolled 
flows.  This  is  demonstrated  in  figure  1  for  the  Lorenz  system  [26]  at  parameter 
values  a  =  10.0,  r  =  30.0  and  b  —  8/3.  The  parameter  r  is  chosen  as  the  control 
parameter  and  dr  =  1.0,  7  =  0.5.  The  Lyapunov  exponent  is  enhanced  from  0.95  to 
1.44.  Figure  1  shows  the  behaviour  of  A  with  r  in  both  controlled  and  uncontrolled 
systems.  As  can  be  clearly  seen,  there  is  a  substantial  enhancement  in  A  over  a  range  of 
parameter  values. 

It  can  also  be  demonstrated  that  this  change  in  the  Lyapunov  exponent  actually  gives 
rise  to  an  enhancement  of  the  rate  of  mixing.  The  attractor  is  covered  with  a  grid  of  boxes 
and  a  large  number  of  initial  conditions  are  chosen  in  a  particular  box.  These  initial 
conditions  are  then  allowed  to  evolve  without  the  control.  The  same  set  of  initial 
conditions  is  allowed  to  evolve  separately  with  the  control  procedure  operative.  The 
initial  conditions  in  one  box  spread  over  a  number  of  boxes  in  course  of  evolution.  A 
comparison  of  the  number  of  occupied  boxes,  i.e.  the  number  of  boxes  which  have  at  least 
one  of  the  evolving  initial  conditions  as  a  function  of  time  for  the  controlled  and 
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Figure  1.  The  Lyapunov  exponent  for  Lorenz  system  at  parameter  values 
ff  =  10.0,  r  =  30.0  and  b  =  8/3.  The  parameter  r  is  chosen  as  the  control  parameter. 
The  figure  shows  the  behaviour  of  A  with  r  in  both  controlled  and  uncontrolled  Lorenz 

system. 


uncontrolled  systems  gives  an  idea  of  the  relative  rates  of  mixing  of  the  two  systems. 
Figure  2  plots  this  number  for  both  controlled  and  uncontrolled  cases  for  Lorenz  system. 
It  is  clear  that  the  controlled  system  mixes  at  a  faster  rate  than  the  uncontrolled  one,  and 
that  the  control  procedure  has  successfully  manipulated  the  entire  ensemble  of 
trajectories  in  a  manner  in  which  the  rate  of  mixing  is  enhanced. 

The  procedure  used  above  to  enhance  chaos  and  the  mixing  rate  can  be  easily  modified 
to  apply  to  the  case  of  discrete  maps.  For  maps,  the  evolution  equation  (eq.  (5))  can  be 
written  as 


=f(Xr,jLt), 


(12) 


where  xr  are  the  dynamical  variables  at  time  t.  The  evolution  of  the  tangent  vector  w  is 
given  by 


wr+i  =  (w,  •  V)  f . 


(13) 


The  control  procedure  is  the  same  as  above.  The  parameter  fj,  is  changed  to  n  +  sd/j,  when 
condition  (9)  is  satisfied.  To  decide  the  sign  s  we  write  (13)  in  matrix  form  as 


Wt+l=MWt, 
where  MT  =  Vf .  The  equation  for  the  norm  of  W  is 


236 


(14) 


(15) 
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Figure  2.    The  number  of  occupied  boxes  as  a  function  of  time  for  the  controlled  and 
uncontrolled  Lorenz  systems. 


Thus  the  rate  of  change  in  the  norm  of  W  due  to  change  in  the  parameter  is  given  by 


=  Wf  (MrMM  +  M 


(16) 
(17) 


where  the  last  step  is  obtained  by  expanding  to  lowest  order  in  d/j  and  AfM  =  dM/dfi.  For 
control  to  enhance  chaos  and  rate  of  mixing,  the  sign  s  for  the  parameter  change  dfj,  must 
be  such  that  A||W,+i||2  is  positive. 

The  Lorenz  system  discussed  above  is  an  example  of  a  dissipative  flow.  However,  the 
procedure  is  completely  general  and  can  be  applied  to  conservative  systems  as  well, 
provided  the  change  in  parameter  is  made  in  a  way  in  which  the  conservative  nature  of 
the  flow  is  preserved.  We  demonstrate  the  increase  in  chaos  and  rate  of  mixing  in  the 
standard  map  [23]. 


Yt+i  =  Yt 


(18) 


where  the  variable  X  and  Y  lie  between  [0, 1].  The  increase  in  the  rate  of  spread  of  initial 
conditions  in  phase  space  of  the  controlled  standard  map  as  compared  to  the  uncontrolled 
one  is  demonstrated  in  figure  3.  The  parameter  values  are  K  =  1.5,  dK  =  0.2, 7  =  0.25.  It 
can  be  easily  seen  that  the  controlled  map  retains  its  area-preserving  property.  We  iterate 
10,000  initial  conditions  initially  in  a  small  box.  Figures  3(a)  and  3(b)  show  the 
uncontrolled  and  controlled  systems  respectively  after  10  iterations.  Figures  3(c)  and  3(d) 
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Figure  3.    (a)  and  (b)  show  the  uncontrolled  and  controlled  systems  respectively 
after  10  iterations,  (c)  and  (d)  show  the  corresponding  figures  after  30  iterations. 


show  the  corresponding  figures  after  30  iterations.  It  is  clear  that  the  controlled  system 
has  spread  out  more  than  the  uncontrolled  situations.  Thus  the  rate  of  mixing  is  enhanced. 
The  Lyapunov  exponent  for  the  orbit  with  the  initial  condition  x  =  0.2,  y  =  0.3  increases 
from  A  =  0.286  to  A  =  0.384.  The  invariant  KAM  tori  in  the  phase  space  of  the  standard 
map  function  as  barriers  to  global  mixing.  If  the  effect  of  control  on  asymptotic  global 
mixing  is  observed,  the  controlled  and  uncontrolled  attractors  look  very  similar  in  both 
cases  except  that  the  area  of  empty  regions  in  the  uncontrolled  situation  is  larger  than  the 
area  of  empty  regions  in  the  controlled  situation. 

The  control  procedure  works  quite  efficiently  as  it  produces  a  substantial  enhancement 
of  the  Lyapunov  exponent  for  quite  small  changes  in  the  parameters.  This  is  due  to  the 
fact  that  the  control  procedure  works  by  switching  between  three  types  of  chaotic  flows, 
those  characteristic  of  parameter  values  /^,  ^  +  d/j,  and  (j,  —  d/i.  This  switching  introduces 
an  extra  time  dependence  in  the  problem  and  is  the  origin  of  the  efficiency  of  the 
procedure. 
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Thus,  this  mechanism  is  successful  in  enhancing  the  rate  of  mixing  of  the  system  and 
also  in  enhancing  the  chaoficity  of  the  system.  It  can  therefore  be  useful  in  contexts  where 
the  maintaining  and  enhancement  of  chaos  is  important.  Other  methods  of  enhancing 
mixing  have  also  been  proposed  in  various  specific  contexts.  A  very  simple  way  of 
enhancing  the  mixing  in  a  flow  is  by  the  addition  of  noise. 

2.2  Noise  and  mixing 

The  Lagrangian  trajectory  of  particle  is  given  by  (3).  However  in  realistic  situations  there 
is  a  stochastic  correction  applied  to  this  equation.  This  correction  may  be  due  to  the 
Brownian  motion  of  the  constituent  particles  or  due  to  extrinsic  noise. 

The  equation  of  motion  for  a  diffusive  particle  is  given  by  the  generalized  Langevin 
equation  where 

dx 

^=u(x,  *)  +  £(/)  (19) 

and  u(x,  t]  is  the  velocity  at  point  x  at  time  t.  The  probability  density  function  of  the 
stochastic  term,  (,(t)  is  a  gaussian  and  the  variance  of  this  distribution  is  related  to  the 
molecular  diffusivity  by  o2  =  2DA*,  where  Af  is  the  time  interval  between 
displacements.  To  apply  this  kind  of  term  to  discrete  maps,  the  particles  are  stochastically 
displaced  after  each  iteration  of  the  map.  The  addition  of  such  a  noise  term  enhances 
mixing.  This  has  been  explicitly  demonstrated  [25]  in  the  case  of  the  blinking  vortex 
map  [18]. 

Another  method  which  enhances  global  mixing  has  been  proposed  for  time  periodic 
flows. 


2.3  Symmetries  and  island  formation 

Time  periodic  flows  can  be  modelled  by  a  Hamiltonian  system  with  the  streamfunction 
acting  as  the  Hamiltonian  [23].  When  a  KAM  curve  for  such  a  system  is  perturbed,  it 
breaks  into  elliptical  and  hyperbolic  points.  Islands  are  formed  around  elliptic  points, 
which  are  regions  of  regular  behaviour  separated  by  an  invariant  curve  from  the 
chaotic  component.  These  islands  affect  the  transport  and  mixing  in  two  ways: 
firstly  they  do  not  exchange  trajectories  with  the  chaotic  counterpart  and  hence  act 
as  trapping  regions,  and  secondly,  due  to  their  'stickiness',  trajectories  which  come 
close  to  these  islands  spend  a  larger  fraction  of  time  in  the  vicinity.  Both  these  properties 
are  a  hindrance  to  efficient  mixing.  Therefore,  to  increase  the  mixing,  one  should 
control  the  flow  in  such  a  way  so  as  to  prevent  the  formation  of  these  regions. 

It  has  also  been  observed  that  in  many  cases,  the  placement  of  these  periodic  points  in 
the  phase  space  is  symmetric  [27].  It  is  possible  to  use  these  symmetries  to  locate  these 
periodic  points,  and  to  devise  a  protocol  which  would  prevent  the  formation  of  these 
islands  [22]. 

An  interesting  demonstration  of  this  method  can  be  made  in  the  case  of  the  cavity 
flow,  where  the  system  consists  of  fluid  in  a  cavity  whose  two  walls  are  moving 
transversally  (say  the  top  and  bottom  walls),  whereas  the  other  two  walls  are  held 
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rigid.  The  motion  is  a  composition  of  two  independent  operations,  that  due  to  the  top  wall 

T,  [this  means  that  a  set  of  particles  x  is  mapped  to  a  set  Trx  after  time  t  under  the  action 

of  the  top  wall  alone]  and  that  due  to  the  bottom  wall  Bt.  The  system  is  completely  ,v, 

characterized  by  a  sequence  of  top  and  bottom  wall  motions.  If  we  have  a  periodic  £$ 

operation  of  T,  B,  each  operating  for  equal  duration  of  time  then  the  system  has  x-axis  as 

the  symmetry  curve. 

To  enhance  mixing  one  tries  to  use  the  symmetries  in  such  a  way  that  the  islands  are 
never  formed  .  The  flow  BTBTBT...  mentioned  above  has  x  symmetry  and  the  islands  lie 
on  the  x-axis  or  in  pairs  symmetrically  distributed  on  both  ides  of  x-axis.  One  way  to 
prevent  the  formation  is  to  follow  the  flow  Q0  =  BT  by  an  identical  flow  which  is 
rotated  by  180°.  That  is,  now  the  flow  is  [R(BT)R]BT  =  Qj  =  TBBT.  This  protocol 
switches  the  position  of  the  islands  such  that  they  never  get  a  chance  to  form.  Other 
protocols  which  prevent  island  formation  are  also  possible,  e.g.  a  switch  in  the  flow  after 
two  periods  will  also  give  similar  results.  These  protocols  have  been  tested 
experimentally  for  the  cavity  flow  [22]  and  indeed  increase  the  global  mixing  by 
removing  the  islands.  A\ 

Thus,  by  exploiting  the  underlying  symmetries  of  the  flow,  one  can  remove  the  islands. 
Since  the  islands  hamper  mixing  by  both  trapping  as  well  as  by  slowing  down  the 
trajectories,  this  removal  enhances  mixing.  However,  as  long  as  the  sequence  of  protocol 
remains  finite,  there  will  always  be  islands  present  in  the  flow  which  will  not  be  removed 
by  this  sequence.  Secondly,  these  protocols  can  be  applied  only  in  case  of  time  periodic, 
incompressible,  nondissipative  flows.  The  applicability  of  such  protocols  in  case  of  flows 
that  are  dissipative  or  non-periodic  remains  unclear. 

f 

3.  Coherent  structures 

As  mentioned  earlier,  the  existence  of  localized,  coherent  structures  in  flows  that  are 

otherwise  chaotic  or  turbulent  is  a  commonly  observed  feature  of  natural  systems  [14, 13].  _*. 

Again,  the  reasons  for  the  formation  of  coherent  structures  in  otherwise  irregular  flows  "    j 

can  be  investigated  using  the  well  known  fact  that  spatially  extensive  systems  can  be  I 

modelled  by  low  dimensional  chaotic  flows  [17].  Recent  work  suggests  that  control 

methods  which  can  create  coherent  structures  in  chaotic  flows  and  maps  may  provide 

useful  insights  into  the  ubiquitous  existence  of  these  objects  in  natural  systems.  We 

outline  two  such  methods. 

The  first  method  creates  the  coherent  structure  by  the  introduction  of  a  fold  in  the 
vicinity  of  the  periodic  points  of  the  system  [28].  The  second  method  uses  localized 
parameter  perturbations  in  the  neighbourhood  of  the  fixed  points  to  create  the  coherent 
structures  [29]. 

3.1  Folding 

Chaotic  flows  are  typically  flows  which  stretch  and  fold.  Such  flows  are  usually  well  »^i 

mixing.  Thus  there  is  an  ultimate  relation  between  stretching  and  folding  and  the  mixing  I 

properties  [30].  The  prototype  for  describing  the  stretching  and  folding  of  flows  is  a  j 

horseshoe  construction.  These  horseshoes  are  formed  in  chaotic  regions  of  the  flow.  A  | 
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Figure  4.  (a)  The  tent  map  in  the  unit  interval  [0, 1].  (b)  shows  the  tent  map  with  an 
additional  fold  introduced.  This  additional  fold  is  responsible  for  trapping  of 
trajectories. 

horseshoe  is  termed  complete  if  the  fold  is  located  outside  the  initial  square  and 
incomplete  if  the  fold  is  located  inside.  Complete  horseshoes  generate  efficient  mixing 
where  as  incomplete  horseshoes  do  not.  Shinbrot  and  Ottino  [28]  show  that  the  proximity 
of  the  fold  of  an  underlying  horseshoe  map  to  a  periodic  point  characterizing  the  order  of 
the  horseshoe  can  produce  an  incomplete  horseshoe  and  generate  an  invariant  region  or  a 
coherent  structure. 
This  is  easiest  to  illustrate  in  the  case  of  the  tent  map  (figure  4(a))  given  by 


Xn+l=2Xn 
=  2(1  - 


(20) 


Every  small  interval  between  0  and  1  is  stretched  by  a  factor  of  2  after  every  iteration 
of  the  map  (20).  It  is  therefore  impossible  for  any  such  interval  to  contain  a  localized 
structure.  The  only  exception  to  this  occurs  near  a  fold  where  the  interval  is  doubled  over, 
thus  conceivably  counteracting  the  stretching.  To  construct  an  invariant  interval  7  by 
using  folds,  a  fold  must  be  placed  sufficiently  near  a  fixed  point  that  points  within  7 
remain  within  7  after  successive  iterations  of  the  map. 

In  the  case  of  the  tent  map,  it  is  easy  to  achieve  this  by  changing  the  phase,  a,  as 
defined  in  the  new  map 

Xn+i  =  2(a  -  Xn)     0  <  Xn  <  a 

=  2(Xn  -a]     a<Xn<±+a 

=  2(1  +  en  —  Xn]    5  +  ot  <Xn  <l.  (21) 

This  map  (figure  4(b))  now  contains  both  a  fixed  point  at  Xn+i  =Xn  =  (2a/3)  and  a 
fold  at  Xn  =  a  in  the  interval  7  =  (0, 2a)  and  points  initially  in  the  interval  7  remain 
trapped  forever  inside  the  interval  7  despite  repeated  iterations  of  the  map  (21).  The  effect 
of  the  fold  is  to  create  a  trapping  region  in  the  map.  In  the  case  of  the  tent  map  with  the 
fold  no  new  coherent  structure  is  created  despite  the  trapping  of  trajectories. 
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Figure  5.  This  figure  shows  the  phase  space  plot  of  a  standard  map  in  presence  of 
an  additional  fold.  The  map  parameter  K  =  5.1  and  the  fold  is  introduced  in  the  region 
0.3  <  x  <  0.7  as  can  be  clearly  seen  this  results  in  a  coherent  structure  being  formed 
in  the  neighbourhood  of  the  fixed  point  at  (0.5, 0.0).  The  values  of  the  constants  in  the 
cubic  polynomial  are  a0  =  0.0069,  a\  =  -0.713  and  a2  =  1.0023,  a3  =  0.321. 

The  effect  of  the  fold  in  creating  a  coherent  structure  can  be  easily  demonstrated  in  the 
context  of  the  standard  map  (18).  The  variable  X  is  restricted  to  [0,  1]  and  the  variable  Y 
to  [-0.5,0.5].  The  map  shows  chaotic  behaviour  for  parameter  values  exceeding  0.975 
and  has  a  hyperbolic  fixed  point  at  (0.5, 0).  A  fold  is  introduced  in  the  region  R  around 
the  fixed  point  in  the  following  manner.  We  replace  (K/(27c}}  sm(2TrXn)  by  a  functiom 
F(Xn)  when  (Xn,Yn]  lies  hi  R.  To  produce  a  fold  F(X]  is  chosen  so  as  to  contain  at 
least  one  extremum  in  R  with  F(X}  =  (K/(2ir))  sin(27o:n)  at  the  bounds  of  R.  A  suitable 
choice  is  the  modified  cubic  polynomial  F(X)  =  O.Q  +  a\(X  — •£)+  a-i(X  —  ^)2+ 
ai(X  —  }jf  where  OQ,  a\  and  ai  are  chosen  so  as  to  match  F(x)  with  (K/(2Tr)} 
sin(27rxn)  at  X  =  0.3  and  Y  =  -0.2.  It  is  clear  that  the  effect  of  the  fold  is  to  constrain  a 
finite  fraction  of  the  ensemble  of  trajectories  to  an  invariant  neighbourhood  of  the  fixed 
point  in  spite  of  the  presence  of  stretching.  The  resulting  coherent  structure  is  shown  in 
figure  5. 

The  same  idea  can  be  applied  to  experimentally  realizable  systems  and  has  been 
implemented  by  Shinbrot  and  Ottino  for  the  numerical  realization  of  the  journal  bearing 
flow  where  a  fluid  constrained  between  two  asymmetrically  placed  cylinders  is 
chaotically  advected  by  driving  the  inner  and  outer  cylinders  alternately  [23,  6].  Control 
is  implemented  by  controlling  the  clockwise  and  anticlocjkwise  rotation  of  the  outer  and 
inner  cylinder.  The  tracer  particles  of  the  fluid  are  chaotically  advected  for  a  repeating 
protocol  which  drives  the  inner  cylinder  clockwise  through  1080°,  outer  anticlockwise 
through  240°  and  outer  through  1080°.  A  fold  leading  to  a  coherent  structure  in  this  case 
can  be  provided  by  a  sequence  which  drives  the  outer  cylinder  clockwise  through  32°, 
followed  by  rotations  of  the  inner  cylinder  through  1080°,  the  outer  anticlockwise 
through  170°  and  the  inner  through  1080°  [28]. 
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The  robustness  of  these  structures  has  been  studied  by  adding  a  noise  term.  The 
structure  is  stable  for  small  noise  strengths  but  is  destroyed  for  large  noise.  An  interesting 
application  of  these  coherent  structures  is  to  use  them  as  controlled  release  capsules 
where  one  or  more  reactants  of  a  chemical  reaction  can  be  confined  in  a  coherent 
structure  while  the  remaining  flow  is  mixed.  At  a  desired  moment  the  coherent  structure 
can  be  destroyed  by  the  application  of  a  perturbation  thus  releasing  the  reactant  in  the 
chemical  reaction.  This  can  be  used  to  control  various  reaction  rates  and  products  [20]. 

The  fold,  however  varies  from  case  to  case  and  does  not  have  a  generic  form.  Also  the 
placement  of  such  a  fold  could  be  difficult  in  various  realistic  situation  and  time 
consuming.  A  new  method  to  produce  these  structures  via  parameter  perturbations  has 
been  proposed  in  [29].  This  method  is  natural  and  generic  and  is  also  easier  to  implement 
than  the  method  of  the  fold. 

3.2  Localized  parameter  perturbation 

We  now  summarize  a  new  method  by  which  coherent  structures  can  be  created  in  two- 
dimensional  area-preserving  maps  [29].  Such  maps  can  serve  as  representations  of 
realistic  fluid  flows  [18,  31].  Our  method  consists  of  the  application  of  parameter 
perturbations  in  the  local  neighbourhoods  of  the  periodic  points  of  the  system.  The 
perturbation  is  applied  in  a  fashion  in  which  the  area-preserving  nature  of  the  map  is 
retained. 

It  is  easy  to  see  that  this  control  mechanism  is  a  natural  mechanism  in  the  context  of 
realistic  flows  where  parameter  perturbations  in  localized  neighbourhoods  can  arise  in  a 
variety  of  situations  in  nature  and  in  experiment.  Examples  of  these  are  stirring  protocols 
in  experiment  and  convective  flows  in  environments  of  non-uniform  temperatures.  Noise 
can  also  be  a  source  of  such  perturbations.  The  localized  variation  of  parameters  such  as 
density  are  believed  to  be  responsible  for  the  formation  of  coherent  vortices  in  planetary 
and  oceanic  contexts  [13, 32].  Experiments  designed  to  simulate  Jovian  and  other 
planetary  surfaces  have  successfully  created  vortices  by  incorporating  localized 
perturbations  [33]. 

The  simplest  example  where  this  method  can  be  demonstrated  is  the  two-dimensional 
area-preserving  standard  map  given  by  (18).  As  said  before,  the  map  shows  chaotic 
behaviour  for  parameter  values  exceeding  0.975  and  has  a  hyperbolic  fixed  point  at 
(0.5,0).  The  system  is  controlled  in  the  following  manner.  The  parameter  K  in  (18)  is 
replaced  by  the  parameter  k  where  k  =  K  —  e  if  \X  —  Xf\  <8,\Y  —Yf\  <  8  and  k  =  K 
for  values  of  x,  y  which  lie  outside  this  square.  Here  (X/,  Yf]  is  a  period- 1  point  of  the 
uncontrolled  map  (18)  and  e  is  a  control  parameter.  It  is  clear  that  the  dynamics  of  the 
controlled  map  is  area-preserving  as  the  Jacobian  J  remains  unity. 

Figure  6(a)  shows  the  phase-space  plot  of  the  uncontrolled  map  (18)  for  K  —  6.0  with 
200  initial  conditions  chosen  randomly  over  the  unit  square,  iterated  500  times.  It  is  clear 
that  the  map  is  completely  chaotic  and  the  distribution  of  points  is  uniform  over  the  phase 
space.  Compare  this  with  the  phase-space  plot  (figure  6(b))  of  the  controlled  map 
described  above  where  the  fixed  point  (*/,  v/)  is  chosen  to  be  the  hyperbolic  fixed  point 
(0.5,0.0),  and  8  =  0.1,  e  =  2.0  iterated  the  same  number  of  times  for  the  same  initial 
conditions.  It  is  clear  that  a  prominent  new  coherent  structure  now  appears  in  the 
neighbourhood  of  the  fixed  point  which  was  absent  in  the  original  standard  map.  Thus 
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Figure  6.  (a)  The  phase-space  plot  for  the  uncontrolled  standard  map.  (b)  The 
formation  of  the  coherent  structure.  The  values  of  K,  e,  8  are  6.0,2.0  and  0.1 
respectively,  with  (xf,yf)  =  (0.5,0.0). 

perturbing  the  map  parameter  in  the  vicinity  of  the  fixed  point  constrains  a  part  of  the 
ensemble  of  trajectories  to  give  new  coherent  structures.  The  method  can  also  be 
successfully  demonstrated  in  case  of  maps  representing  realistic  fluid  situations,  e.g.  in 
the  Blinking  vortex  map  [18]. 

If  the  local  rates  of  stretching  i.e.  local  rates  of  divergence  of  trajectories  in  different 
regions  of  phase  space  are  calculated,  the  controlled  system  shows  decreased  rates  of 
divergence  (as  compared  with  the  uncontrolled  system)  in  the  regions  where  coherent 
structures  are  formed  [291.  The  decreased  divergence  of  the  trajectories  has  the  effort  of 


Coherence  in  chaotic  maps  and  flows 

consequence.  The  reduction  in  local  mixing  tends  to  favour  coherence.  It  is  interesting  to 
compare  this  method  with  the  control  procedure  of  §2.1.  The  control  there  works  by 
tailoring  perturbation  towards  increasing  the  mixing  tendency  by  increasing  the  local 
Lyapunov  exponents  in  regions  of  low  local  Lyapunov  exponent.  This  increases  the 
diverging  tendency  of  trajectories  and  hence  the  mixing.  Conversely,  the  present  method 
works  by  decreasing  the  divergence  rate  of  trajectories  in  the  neighbourhood  of  low-order 
periodic  points,  another  way  of  locating  localized  neighbourhoods  where  the  divergence 
rates  are  of  significance. 

A  comparison  of  the  present  method  with  the  introduction  of  fold  discussed  in  previous 
section  is  also  interesting.  In  contrast  to  the  introduction  of  a  fold  in  the  vicinity  of  the 
periodic  point  the  present  method  introduces  a  parameter  perturbation  which  modifies  the 
distribution  of  the  stretching  rates.  Since  the  perturbation  retains  the  area-preserving 
nature  of  the  systems,  the  distribution  of  contraction  rates  (the  distribution  of  the  smaller 
Lyapunov  exponents  of  the  system)  correspondingly  compensates  for  the  alteration.  This 
in  effect  modifies  the  folding  of  the  system  without  the  explicit  introduction  of  an 
additional  fold.  In  both  cases,  the  control  of  the  flow  in  the  neighborhood  of  low  order 
periodic  points  produces  coherence,  but  the  nature  of  the  periodic  points  (whether 
hyperbolic  or  elliptic)  does  not  appear  to  be  of  significance. 

Thus  we  see  that  coherent  structures  can  be  created  in  simple  chaotic  flows  using 
some  simple  control  techniques.  These  structures  can  be  creatively  used  for  various 
applications  and  the  control  mechanism  described  can  provide  a  useful  insight  into  the 
ubiquitous  presence  of  these  structures  in  various  situations  as  well  as  the  mechanism 
responsible  for  their  creation.  The  structures  can  be  characterized  by  using  some  local 
quantifiers.  Such  an  approach  may  also  shed  light  on  the  way  these  structures  affect 
various  transport  quantities  in  chaotic  flows. 

The  flow  region  thus  has  a  variety  of  objects,  ranging  from  fast  mixing  chaotic  regions 
to  islands  and  coherent  structures.  This  backdrop  provides  an  underlying  template  for 
various  interesting  processes  like  aggregation,  coagulation  and  breakup  etc.  The  presence 
of  efficient  mixing  or  coherent  structures  modifies  and  greatly  influences  these  processes. 
Therefore,  by  controlling  various  mixing  properties  or  by  creating  coherent  structures, 
one  may  be  able  to  greatly  influence  these  processes  to  one's  advantage. 

4.  Mixing,  aggregation  and  breakup 

There  are  a  lot  of  interesting  phenomena  which  are  associated  with  mixing  in  fluid  flows. 
These  phenomena  deal  with  situations  where  the  particles  in  fluid  flow  are  no  longer 
passive  but  interact  with  each  other.  They  collide,  stick  together,  aggregate.  A  fluid 
element  elongates,  stretches  and  breaks  up.  In  other  words,  in  a  background  of  chaotic 
flows  revealed  by  passive  tracers,  a  number  of  phenomena  involving  active  tracers  can 
take  place. 

One  of  the  most  common  phenomena  which  occur  in  the  case  of  active  tracers  is  the 
break  up  of  droplets.  The  crucial  factors  in  this  context  are  the  velocity  field  of  the  fluid, 
the  viscosity  of  the  droplet  and  surface  tension  effects  [34].  The  breakup  in  case  of 
chaotic  flows  depends  on  the  placement  of  the  droplet.  If  the  droplet  is  placed  in  an  island 
then  it  undergoes  a  periodic  sequence  of  stretching  and  contraction  and  there  is  no  net 
stretching.  If  the  viscosity  is  strong  enough  to  overcome  surface  tension  effects  then  the 
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droplets  placed  in  the  chaotic  region  become  elongated  into  long  filaments.  This  filament 
is  then  stretched,  compressed  and  folded  in  a  complicated  way.-  The  breakup  of  these 
filaments  can  occur  by  various  mechanisms  such  as  capillary  wave  instabilities, 
endpinching,  necking  etc.  However  there  are  some  modes  of  breakup  which  are  peculiar 
to  the  nonlinear  chaotic  flows.  The  prime  example  is  the  breakup  in  folds,  which  is 
associated  with  change  in  sign  of  stretching  rate  [6].  In  most  of  these  modes  of  breakup, 
the  breakup  is  directly  dependent  on  the  mixing  properties  of  the  flow.  The  repetitive 
nature  of  the  mixing  involving  a  number  of  stretching  and  folding  generates  an 
equilibrium  drop  size  distribution  which  is  self  similiar.  The  crucial  factor  in  the  breakup 
is  the  stretching  rate.  Clearly,  the  control  methods  discussed  above  can  be  used  to 
redistribute  the  stretching  rates  and  to  modify  the  rate  of  mixing  so  that  the  break  up  of 
droplets  is  accelerated  or  slowed  down  depending  on  the  application. 

The  second  interesting  process  involving  active  tracers  is  the  process  of  aggregation. 
This  is  the  reverse  process  of  breakup.  The  simplest  scenario  involving  aggregation  or 
coagulation  is  that  of  particles  being  advected  by  a  flow  and  interacting  with  each  other 
over  short  range  interactions.  For  example,  if  two  particles  come  together  within  a 
capture  diameter  d,  they  stick  to  each  other  and  move  together  subsequently.  Thus  the 
size  of  the  particle  clusters  grows  till  it  becomes  too  big  when  mixing  takes  over  and 
breaks  it  up  thus  limiting  further  growth  of  these  clusters.  Hence  once  again,  the  size  to 
which  such  a  cluster  would  grow  is  dictated  by  localised  mixing  properties.  Coherent 
structures,  with  their  trapped  and  localized  trajectories  could  act  as  preferred  sites  where 
large  structures  could  grow.  Thus  understanding  and  controlling  these  localized  properties 
is  once  again  the  key  to  create  tailor  made  aggregates  of  desired  size  in  various  flows.  The 
case  of  the  growing  cluster  is  also  reminiscent  of  those  formed  by  diffusion  limited 
aggregation.  The  aggregates  can  be  isotropic  or  highly  anisotropic  depending  on  the 
presence  of  islands  and  other  coherent  structures.  Control  methods  could  be  used  to 
promote  desired  cluster  configurations. 

All  the  interactions  described  here  are  'one-way'.  The  way  the  particles  interact  is 
influenced  by  the  flow  and  various  local  mixing  properties.  However,  the  presence  of  such 
interaction  may  also  change  the  nature  of  flow  and  hence  influence  the  whole  process.  For 
example,  the  addition  of  small  amounts  of  polymer  to  a  Newtonian  fluid  decreases  the 
amount  of  chaos  in  the  flow  [35].  The  underlying  presence  of  chaos,  however  still 
persists.  These  phenomena  are  little  investigated  up  to  now  and  much  remains  to  be 
understood.  Again,  control  methods  may  provide  insights  into  these  problems. 


5.  Conclusion 

We  have  summarized  some  recently  proposed  applications  of  control  methods  to  problems 
of  mixing  and  coherence  in  chaotic  dynamical  systems.  This  is  an  important  problem 
both  for  its  own  intrinsic  interest  and  also  from  the  point  of  view  of  applications.  These 
methods  provide  insights  into  the  origin  of  mixing  and  unmixing  behaviour  in  natural 
systems.  They  could  be  useful  for  improving  the  rate  of  mixing  of  flows,  enhancing 
global  mixing,  and  also  for  creating  coherent  regions  in  flows.  The  results  could  find 
applicability  to  fluid  flows,  chemical  reactions,  oscillator  arrays  and  other  extended 
systems  and  are  relevant  for  promoting  efficient  engineering  processes  and  tailoring 

Pramana  -  J.  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
246  Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


catalytic  reactions.  One  of  the  methods  discussed  is  also  useful  for  enhancing  the 
chaoticity  of  dynamical  systems  and  could  be  useful  in  biomedical  applications  where  the 
destruction  or  reduction  of  chaos  could  have  pathological  consequences.  These  methods 
could  also  be  used  for  tailoring  aggregation  and  break-up  processes  for  suspended 
particles  in  fluid  flows.  We  hope  to  explore  some  of  these  directions  in  future. 
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Abstract.  In  this  paper  we  consider  the  Bonhoeffer-van  der  Pol  (BVP)  equation  which  describes 
propagation  of  nerve  pulses  in  a  neural  membrane,  and  characterize  the  chaotic  attractor  at  various 
bifurcations,  and  the  probability  distribution  associated  with  weak  and  strong  chaos.  We  illustrate 
control  of  chaos  in  the  BVP  equation  by  the  Ott-Grebogi-Yorke  method  as  well  as  through  a 
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1.  Introduction 

During  the  last  decade  or  so  remarkable  progress  has  been  made  in  exploring  the 
complexity  of  nonlinear  systems,  where  the  presence  of  chaos  and  other  related 
phenomena  have  been  extensively  investigated  [1,  2].  Particularly,  routes  to  chaos, 
characterization  of  periodic  and  chaotic  motions,  multifractal,  fractal  basin  boundary 
structures,  etc.  have  been  well  established.  In  recent  years  there  have  been  increasing 
interests  in  the  study  of 

(i)  statistical  dynamics  of  local  expansion  rates  [3-8], 

(ii)  characterization  of  chaotic  attractors  at  bifurcations  [9-14], 

(iii)  characterization  of  weak  and  strong  chaos  [15, 16], 

(iv)  controlling  of  chaos  [2,  17-39]  and 

(v)  synchronization  of  chaotic  trajectories  [2,  40-45]. 


1.1  Local  expansion  rates  and  characterization  of  chaotic  attractors  at  bifurcations 

Recently,  the  features  associated  with  the  fluctuations  of  local  expansion  rates  of  various 
attractors  and  statistical  dynamics  of  local  Lyapunov  exponents  have  been  studied  [3-14], 
Chaotic  attractors  have  various  local  structures  depending  on  their  routes  of  onset  and 
evolution.  For  instance,  chaotic  attractors  are  generally  non-hyperbolic  and  have  local 
homoclinic  tangency  structures.  Bifurcations  of  chaos  change  the  structures  of  chaotic 


and  the  variance  crn(q)  of  fluctuations  of  AB(*i)  around  An(#)  are  useful  for  describing 
the  coherent  large  fluctuations  at  the  bifurcations  of  chaos. 

1.2  Weak  and  strong  chaos 

In  the  literature,  chaotic  motion  with  small  positive  Lyapunov  exponent  is  called  weak 
chaos  and  chaotic  motion  with  sufficiently  large  positive  Lyapunov  exponent  is  termed  as 
strong  chaos.  Recently,  Chen  and  Chow  [15]  have  shown  that  weak  and  strong  chaos  can 
be  characterized  using  probability  distribution  of  £-step  difference  of  a  variable  of  the 
given  dynamical  system. 

1.3  Controlling  of  chaos 

Even  though  the  presence  of  chaotic  behaviour  is  generic  and  robust  there  are  practical 
situations  where  one  wishes  to  avoid  or  control  chaos  so  as  to  improve  the  performance  of 
a  dynamical  system.  The  ability  to  control  chaos  that  is  to  convert  chaotic  oscillations 
into  a  desired  regular  ones  with  a  periodic  time  dependence,  would  be  beneficial  in 
working  with  a  particular  system.  Recently,  more  interest  has  been  focussed  on 
controlling  of  chaos  and  various  methodologies  that  control  chaos  [17-26,  33]. 

1.4  Synchronization  of  chaotic  trajectories 

An  approach  which  is  closely  related  to  controlling  of  chaos  is  that  of  obtaining  the 
phase  space  trajectory  of  a  chaotic  system  synchronized  with  another  chaotic  trajectory 
[2,  4CM-5]  which  has  practical  applications  hi  secure  communication.  Here,  synchroniza- 
tion is  obtained  by  setting  some  of  the  variables  of  the  synchronizing  system  to  the  values 
of  the  desired  trajectory.  The  remaining  variables  asymptotically  synchronize  with  the 
desired  values  provided  the  subsystem  Lyapunov  exponents  for  these  variables  are 
negative. 

Thus,  there  are  many  interesting  topics  in  chaos  theory  which  have  received  much 
attention  during  the  last  few  years  and  many  fascinating  results  have  been  found.  We  have 
carried  out  investigations  in  these  directions  with  reference  to  certain  discrete  and 
continuous  dynamical  systems  [8,  12-14,  16,  27-33].  The  goal  of  the  present  paper 
is  to  study  the  salient  features  associated  with  some  of  the  above  aspects  of  chaos 
with  reference  to  the  Bonhoeffer-van  der  Pol  (BVP)  equation.  The  BVP  oscillator  is 
an  interesting  dynamical  system  of  considerable  physical  and  biological  significance 
[46]. 

The  plan  of  the  paper  is  as  follows.  In  §  2  we  characterize  the  chaotic  attractor  at 
various  bifurcations,  using  the  variance  an(q)  of  fluctuations  of  coarse-grained  local 
expansion  rates  of  nearby  orbits.  Next,  the  probability  distribution  P(AVt),  k  -  1, 2, . . . 
of  AV*  =  Vm+k  -  Vm  where  V\,  Vi, . . .  are  the  values  of  the  variable  V  in  the  Poincare 
map  of  the  BVP  system  is  investigated  in  §  3.  A  stationary  P(AV*)  distribution  is  shown 
to  occur  for  strong  chaos  while  nonstationary  P( AVjt)  is  observed  for  weak  chaos.  Then, 
in  §4  we  consider  the  problem  of  controlling  of  chaos.  We  present  a  brief  summary  of 
previous  results  on  controlling  of  chaos  in  the  BVP  system.  We  show  the  stabilization  of 
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unstable  periodic  orbit  (UPO)  of  BVP  oscillator  by  the  method  of  Ott-Grebogi-Yorke 
[17],  and  the  suppression  of  chaos  by  an  additional  instantaneous  burst.  Finally,  §5 
contains  summary  and  conclusions. 

2.  Characterization  of  chaotic  attractors  at  bifurcations  in  the  BVP  oscillator 

In  this  section  we  study  the  characterization  of  chaotic  attractors  at  critical  bifurcations 
with  reference  to  the  BVP  equation  [28,  47,  48]  which  describes  the  propagation  of  an 
electrical  impulse  or  voltage  pulse  along  the  membranes  of  nerve  cell: 


(la) 
=  c(V  +  a-bR). 


The  variable  V  can  be  seen  as  the  electric  potential  across  the  cell  membrane,  while  I?  is  a 
so-called  recovery  variable  in  which  dynamic  aspects  of  the  state  of  the  membrane  are 
aggregated,  a,  b,  c  are  membrane  radius,  specific  resistivity  of  the  fluid  and  temperature 
factor  respectively  and  AQ  +  A\  cost  is  the  membrane  current. 

For  a  chaotic  orbit  {Xn},  n  =  1,  2,  .  .  .  generated  by  a  two  dimensional  map,  we  define 
the  coarse-grained  expansion  rates  [9-1  1]  as 

1  (2) 

where  X(Xm)  is  the  local  expansion  rate  of  nearby  orbits  at  Xm  along  the  unstable 
manifold.  The  values  of  An(Xi)  for  different  Xi's  are  randomly  distributed  between 
minimum  and  maximum  values.  To  describe  the  fluctuations  of  the  local  expansion  rates 
we  consider  the  partition  function  [9-1  1] 

Zn(q)  =  (exp[-n(q  -  1)  A,  (Xj)]),  (-00  <  q  <  oo)  (3) 

where  (...)  denotes  the  long  time  average.  The  scaling  exponents  can  be  defined  in 
terms  of  the  partition  function  as 

(4) 

'  (5) 

(6) 

For  the  BVP  oscillator  A(Xm)  along  the  unstable  manifold  at  Xm  is  calculated  in  the 
Poincare  map.  The  dynamical  structure  functions  An(q)  and  crn(q)  are  calculated  for  a 
range  of  values  of  the  control  parameter  A\  about  its  critical  values  A\c  at  which 
bifurcations  such  as  sudden  widening  of  a  chaotic  attractor,  band  merging  and 
intermittency  occur. 

The  parameters  in  (1)  are  fixed  at  a  =  0.7,  b  =  0.8,  c  =  0.1,  AQ  =  0.  A  band  merging 
crisis  is  found  to  occur  at  A\  =  A\m  PS  1.28653:  as  AI  is  decreased  to  A\m  three  bands  of  a 
chaotic  attractor  merge  to  form  a  single  band  chaotic  attractor.  Figure  la  shows  an  (q)  at 
AI  =  1.287  far  from  A\m,  where  we  have  used  n  =  21.  The  <Jn(q]  has  a  peak  at 
<7  =  la  ~  1-65.  <Tn(q)  just  before  the  band  merging  is  shown  in  figure  lb.  The  an(q)  has 
two  peaks  at  q  =  qa  PS  2.4  and  q  —  q@  w  0.4.  Two  peaks  are  observed  for  AI  values  just 
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Figure  1.    on(q)  versus  q  for  A\  =  1.287  (a)  far  from  band  merging  chaos  and  for 
AI  =  1.286  (b)  just  before  band  merging  chaos. 
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Figure  2.    u,,^)  plots  for  (a)  AI  =  1.0922  in  the  intermittency  region  and  (b) 
AI  =  1.096  far  after  intermittency. 


after  the  band  merging  also.  For  example,  for  AI  =  1.285  peaks  at  qa  —  1.85  and 
qp  =  -0.075  are  observed.  Further,  for  q=l,  An(l)  =  A00  and  An(oo)  =  Amin  and 
AM(-oo)  =  Amax.  Therefore,  an(q]  with  q  >  1  and  q  <  1  can  explicitly  describe  negative 
and  positive  large  fluctuations  of  An(Xi)  respectively. 

In  the  BVP  equation,  type  I  intermittent  chaos  is  observed  for  AI  6  (1.092, 1.094).  For 
AI  values  just  below  1.092  a  period-4jT(T  =  2-ri)  attractor  is  found.  In  the  intermittency 
region  the  chaotic  attractor  has  two  types  of  local  structures  which  produce  the  laminar 
and  turbulent  bursts  respectively.  Two  types  of  local  structures  have  been  captured  by  the 
dynamical  structure  function  crn(q}.  Figure  2a  shows  crn(q)  versus  q  in  the  intermittency 
region.  The  AI  value  used  is  1.0922  and  n  —  10.  The  crn(q}  has  two  peaks  at 
q  =  qa  KS  1.75  and  q  =  qp .  w  0.5.  Figure  2b  shows  the  result  for  AI  =  1.096  far  after 
intermittency  at  which  fully  developed  chaotic  motion  is  observed.  an(q]  has  only  one 
peak  at  q  =  qa  w  0.6. 

A  crisis,  or  sudden  expansion  in  the  size  of  the  attractor  is  observed  at 
A  =AW  «  0.747486.  For  AI  =0.73,  far  from  the  crisis,  crn(q)  has  only  one  peak  at 
q  =  qa  PH  1.55.  Just  before  the  crisis,  corresponding  to  AI  =  0.747482  the  variance  crn(q) 
has  two  peaks  at  q  =  qa  «  2  and  qp  w  0. 
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From  the  above  analysis  we  note  that  ga  peak  is  observed  for  all  chaotic  attractors. 
However,  additional  peaks  in  crn(q)  versus  q  plot  occur  just  before  and  after  the  critical 
bifurcations.  Thus  it  turns  out  that  crn(q}  is  useful  for  characterizing  the  chaotic  attractors 
at  bifurcations. 


3.  Characteristics  of  probability  distribution  in  chaotic  attractors  of  BVP  oscillator 

In  this  section  we  illustrate  the  characterization  of  weak  and  strong  chaos  in  the  BVP 
oscillator  using  the  probability  distribution  of  the  fc-step  difference  quantity  of  the 
variable  V.  Let  { Vn},  n  =  1 , 2, . . . ,  Af  be  the  values  of  the  variable  V  of  (1)  in  the  Poincare 
map.  We  define  the  fc-step  difference  quantity  AVk  as  AVJt  =  Vm+k  —  Vm  where 
m  =  1,2, ...  ,N',N'  <  N.  Then  we  calculate  the  probability  distribution  of  AV*. 

Figure  3  shows  the  variation  of  maximal  Lyapunov  exponent  (A)  as  a  function  of  AI  .  In 
this  plot  there  are  many  regions  labelled  by  |,  in  which  A  is  positive  but  very  small, 
indicating  weak  chaos.  Distributions  are  calculated  for  various  values  of  AI.  In  numerical 
calculations  we  neglect  the  first  5000  Poincare  data  as  transient  and  use  the  next  104 
points.  For  AI  =  1.0919  the  maximal  Lyapunov  exponent  is  » 0.0004  and  the 
corresponding  chaotic  motion  is  weak.  For  this  AI  value,  P  is  found  to  change 
continuously  with  k.  That  is,  the  distribution  is  nonstationary.  This  is  further  verified  by 
the  chi-square,  C2  test  quantity,  defined  as  [15] 

n 

/""2/1.    -\   __  \       fn    S-i/(R'  -\  -  S'}  (7) 

1=1 

where  jR,  and  Sf  are  the  probabilities  of  the  z'th  interval  for  P(AVk+j)  and  P(AV)t) 
respectively.  In  (7)  intervals  with  /?,  =  S, •  =  0  are  excluded.  If  two  probability 
distributions  differ  greatly,  one  gets  a  large  C2.  For  two  similar  distributions  C2  will 
be  small.  The  numerically  calculated  C2  is  plotted  in  figure  4.  The  analysis  is  carried  out 
for  k  values  up  to  5000.  The  nondecreasing  C2  implies  the  nonstationary  characteristic  of 
P.  Thus,  a  complete  probability  distribution  is  impossible  for  weak  chaos.  A  physical 
mechanism  of  nonstationary  P  can  be  a  recurrence  of  memory  loss  and  recovery  of  initial 
conditions  [49]. 

Stationary  probability  distribution  is  found  for  strong  chaos.  For  AI  =  1.26,  A  is 
«  0.056.  The  calculated  C2  is  plotted  in  figure  5.  For  k  >  15  C2  is  almost  zero.  That  is, 
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Figure  3.    Estimated  maximal  Lyapunov  exponent  versus  AI. 
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Figure  4.    C2(k,  1)  versus  k  for  the  BVP  chaotic  attractor  with  AI  =  1.0919  (weak 
chaos). 
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Figure  5.    Same  as  figure  (4)  now  for  AI  =  1.26  (strong  chaos). 


the  distribution  has  evolved  into  a  stationary  state.  This  suggests  that  the  variable  Vn  can 
be  described  as  if  it  were  generated  by  a  random  number  generator  with  a  certain 
probability  distribution.  A  possible  mechanism  for  a  stationary  distribution  is  complete 
loss  of  memory  to  the  initial  conditions.  For  two  band  and  four  band  chaotic  attractors, 
C2(fe,  1)  values  are  found  to  be  large  (not  shown).  However,  for  two  band  and  four  band 
chaotic  attractors  C2(fc,2)  and  C2(fc,4)  respectively  decay  to  zero  as  k  increases. 
Interestingly,  for  two  (four)  band  attractor  simple  switching  between  two  (four)  different 
classes  of  P  is  found. 


4.  Controlling  of  chaos  in  the  BVP  equation 

In  this  section  we  consider  the  problem  of  controlling  of  chaos  in  the  BVP  equation.  First, 
we  briefly  review  some  previous  results  of  controlling  of  chaos  in  the  BVP  equation. 


4.1  Previous  results 

In  a  series  of  papers  [27-33]  we  have  studied  the  applicability  and  efficacy  of  a  few 
control  algorithms  in  the  BVP  equation. 
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Chaotic  dynamics 

Stabilization  of  UPO  embedded  in  a  chaotic  attractor  has  been  studied  [28, 29, 31]  by 
the  control  methods  of  Singer  et  al  [21],  Pyragas  [22]  and  Chen  and  Dong  [23].  A 
detailed  comparative  study  of  these  methods  has  been  made  [31].  In  all  the  three  control 
schemes  (i)  stabilization  of  desired  UPO  is  achieved  for  certain  range  of  values  of 
amplitude  (e)  of  the  perturbation,  (ii)  before  reaching  the  desired  target  the  system 
exhibits  transient  evolution  and  (iii)  the  recovery  time  varies  with  the  control  strength  e. 

Adaptive  control  algorithm  [18, 19]  with  linear  as  well  as  nonlinear  control  functions  is 
able  to  convert  the  chaotic  attractors  of  the  BVP  system  to  a  period-nT  limit  cycle 
[28, 29],  Stable  control  is  observed  for  certain  range  of  values  of  stiffness  (e)  of  the 
control  function.  Further,  the  recovery  time  is  found  to  be  inversely  proportional  to  e. 

We  have  investigated  analytically  [30]  and  numerically  [28]  the  effect  of  periodic 
parametric  perturbation  and  addition  of  second  periodic  force  on  chaotic  dynamics.  The 
period  of  the  controlled  motion  is  found  to  depend  on  amplitude  and  frequency  of  the 
added  force.  We  have  shown  that  most  of  the  regular  regime  can  be  identified  by  the 
Melnikov  method.  Interestingly,  suppression  of  chaos  is  found  in  the  parametric  regimes 
where  the  Melnikov  function  does  not  change  sign. 


4.2  Ott-Grebogi-Yorke  method 

In  the  OGY  technique  [17],  stabilization  of  an  UPO  in  a  suitably  defined  Poincare  map 
can  be  achieved  by  slight  adjustment  of  a  control  parameter.  Suppose  we  wish  to  stabilize 
an  unstable  period- 1  orbit  contained  in  the  chaotic  attractor  of  BVP  oscillator.  We  choose 
AO  as  the  control  parameter  and  denote  Xp  (Ao  =  0)  as  the  unstable  period- 1  orbit  to  be 
stabilized.  As  AO  in  (1)  is  varied  slightly  from  A0  =  0  to  some  AO  ^  0  then  X/r(0)  will 
shift  to  XF(AQ).  We  define  a  vector  g  as  g  =  dXp(Ao}/dAQ  evaluated  at  AQ  ^  0.  Near 
XF(Q]  and  for  small  values  of  AQ  one  may  write  $Xn+\  «  M6Xn  where  M  is  a  2  x  2  matrix 
and  8Xn  =  Xn  —  XF(AQ}.  When  the  nth  iteration  Xn  is  close  to  Xp  the  value  of  A0  is 
changed  such  that  Xn+\  falls  on  the  stable  manifold  of  Xp.  That  is,  we  choose  the  control 
<$AOM  so  that/H  •  6Xn+\  =  0  which  yields  the  control  formula  [17] 


6AQn  = 


(A« -!)/„•*' 


(8) 
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Figure  6.     (a)  The  R-component  of  the  period- 1  orbit  in  the  Poincare  map  versus  n. 
The  control  was  switched  on  at  n  =  30,  then  switched  off  at  n  =  200  and  then 


where  Xu  is  the  unstable  eigenvalue  of  the  matrix  M  and/«  is  the  unstable  contravariant 
basis  vector  [17]. 

In  the  numerical  simulation  we  found  all  pairs  of  iterates  falling  within  the  circles  of 
radius  0.01  about  the  point  XF(0).  To  this  data  we  have  fitted  the  approximate  linear 
map  and  then  determined  the  form  of  M  and  its  eigenvalues  and/«.  The  approximate 
location  of  the  period-1  unstable  orbit  is  calculated  as  XF  =  (Vp,Rp)  »  (-0.9560, 
-0.4685).  We  have  obtained  Xu  w  1.46,  A,  =  -0.68,  £  «  (0.105,0.97)  and  /„  = 
(1  _j_  \ly2  (AH  -  Xs}~l(l,Xs}.  To  stabilize  the  orbit  XF  we  have  chosen  the  maximum 
allowed  perturbation  as  |<5A0max|  =  0.01.  Figure  6a  illustrates  the  control  of  period-1 
orbit.  The  period-1  orbit  gets  stabilized  as  long  as  the  control  is  included.  Figure  6b 
shows  the  perturbation  6Ao  versus  n. 

4.3  Suppression  of  chaos  by  periodic  instantaneous  burst 

The  BVP  equation  with  the  addition  of  periodic  instantaneous  burst  is  written  as  [33] 
V=V-V3/3-R+A{cost,  (9a) 

(9b) 


where 


(9c) 


n=i 


The  force  g(i)  is  nonzero  and  equal  to  unity  only  at  times  t  =  nrT,  n=  1,2,  —  For 
simplicity,  T  is  fixed  as  2ir,  period  of  the  driving  force.  For  fixed  value  of  T  the  times  at 
which  g(t]  become  nonzero  and  the  period  of  the  force  are  characterized  by  r.  Equation 
(9)  is  integrated  using  fourth  order  Runge-Kutta  method  with  time  step  2vr/100. 

Figure  7  shows  the  bifurcation  diagram  as  a  function  of  a  for  r  =  0.5.  Chaotic 
behaviour  persists  for  a  <  0.0045.  Regular  motion  is  recovered  in  the  parameter  interval 
a  >  0.0045.  An  interesting  point  here  is  the  suppression  of  chaos  by  inverse  period 
doubling  phenomenon.  Period  87,  4T,  2T  and  T  motions  are  found  in  the  intervals 
(0.0051-0.0057),  (0.006-0.0096),  (0.0102-0.0309)  and  (a  >  0.0312)  respectively.  Next, 
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Figure  7.    Bifurcation  diagram  of  the  BVP  system  (9).  The  parameters  are  fixed  at 
a  -  0.7,  b  =  0.8,  c  =  0.1,  A\  =  0.74,  T  =  2ir  and  r  =  0.5. 
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a  is  kept  at  0.006286  and  r  is  increased  from  small  value  in  steps  of  0.01  up  to  1.  Regular 
behavior  is  observed  for  0.01  <  r  <  0.69. 

Further,  the  effect  of  phase  difference  <p  between  the  periodic  external  force  AI  cost 
and  g(t)  is  also  studied.  For  various  values  of  r  and  a  the  dynamics  of  the  BVP  system  is 
studied  by  varying  (j>.  Interestingly,  the  behaviour  of  the  system  whether  it  is  chaotic  or 
periodic  in  the  absence  of  0  is  unaltered  when  <j>  is  included.  The  only  effect  observed  is 
an  infinitesimal  shift  in  the  values  of  the  state  variables  in  the  Poincare  map.  A  more 
detailed  investigation  of  suppression  of  chaos  by  instantaneous  burst  will  be  presented 
elsewhere  [33]. 

5.  Summary  and  conclusions 

In  this  paper  we  have  studied  various  aspects  of  chaotic  dynamics  in  the  BVP  oscillator. 
As  shown  for  the  BVP  oscillator  the  chaotic  attractors  just  near  the  critical  bifurcations 
such  as  sudden  widening,  band  merging  and  intermittency  produce  large  fluctuations  of 
the  coarse-grained  local  expansion  rates  A,  and  consequently,  additional  peaks  are  found 
to  occur  in  the  an(q]  versus  q  plot.  On  the  other  hand,  a  stationary  probability  distribution 
is  found  for  chaotic  attractors  with  large  positive  Lyapunov  exponent,  while  for  attractors 
with  sufficiently  small  positive  Lyapunov  exponent  a  nonstationary  probability 
distribution  is  observed.  Thus,  ffn(q)  and  P(Axk)  are  useful  to  characterize  various 
chaotic  attractors. 

We  have  discussed  the  problem  of  controlling  of  chaos  in  the  BVP  equation.  If  the  aim 
is  to  set  the  system  motion  to.  a  desired  predetermined  regular  orbit,  the  adaptive  control 
algorithm  and  feedback  methods  have  distinct  advantages.  If  one  wishes  to  convert 
chaotic  motion  to  some  regular  motion  then  periodic  perturbations  can  be  used. 
Surprisingly,  chaotic  motion  can  be  suppressed  by  periodic  kicks,  which  make  the  motion 
periodic. 
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Abstract.  We  review  the  subject  of  control  of  chaotic  systems  paying  special  attention  to 
exponential  control.  We  also  discuss  the  application  of  synchronization  of  chaotic  systems  to 
security  in  communications. 
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1.  Introduction 

This  started  off  as  a  general  review  of  the  subject  of  synchronization  and  control  of 
chaotic  systems.  When  it  became  clear  however  that  this  special  issue  on  chaos  would 
contain  a  number  of  such  contributions,  we  decided  to  give  it  a  more  personal  touch  by 
also  discussing  at  some  length  some  of  our  own  work  in  this  field  -  the  exponential 
control  of  systems  exhibiting  chaotic  behaviour. 

2.  Control 

The  subject  of  controlling  chaotic  systems  has  become  popular  since  the  work  of  Ott  et  al 
[1],  hereafter  referred  to  as  OGY.  OGY  were  the  first  to  realize  that  the  presence  of  chaos 
instead  of  being  a  nuisance  can,  in  fact,  be  of  great  advantage.  This  is  because  a  chaotic 
attractor  typically  has  embedded  within  it  an  infinite  number  of  unstable  periodic  orbits. 
Any  one  of  a  number  of  such  orbits  can  be  stabilized  by  small  time-dependent  parameter 
perturbations  of  an  available  system  parameter  and  a  choice  can  be  made  among  these 
orbits  to  achieve  a  desired  system  performance.  This  is  to  be  contrasted  with  the  situation 
in  which  the  attractor  is  not  chaotic  but  periodic,  in  which  case  small  parameter 
perturbations  can  only  produce  small  changes  in  the  system  orbit. 

To  appreciate  the  implications  of  what  has  been  said  above,  consider  a  system  which 
may  be  required  to  be  used  for  different  purposes  and  be  expected  to  operate  under 
different  conditions  at  different  times.  If  the  system  is  chaotic,  this  type  of  multiple-use 
situation  might  be  accommodated  without  altering  the  gross  system  configuration.  In 
particular,  depending  on  the  use  desired,  the  system  behaviour  can  be  changed  by 
switching  the  temporal  programming  of  the  small  parameter  perturbations  so  as  to 
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purposely  building  chaotic  dynamics  into  the  system  can  be  a  way  of  accommodating  the 
desired  flexibility. 

OGY  imagined  a  situation  in  which  the  dynamical  equations  describing  the  system  are 
not  known,  but  the  experimental  time-series  of  some  scalar  variable  can  be  measured  and 
using  delay  coordinates  [2]  a  surface  of  section  can  be  constructed.  The  piercings  of  the 
experimental  surface  of  section  are  then  studied  and  a  particular  unstable  periodic  orbit 
which  gives  the  best  system  performance  is  selected  (using  embedding  techniques 
discussed  by  Gunaratne  et  al  [3]).  The  system  trajectory  is  observed  till  it  falls  within  a 
strip  of  predefined  width  of  the  selected  unstable  orbit,  at  which  juncture  a  suitable 
parameter  p  of  the  system  is  adjusted  so  that  the  system  trajectory  at  the  next  piercing  of 
the  surface  of  section  falls  on  the  stable  manifold  of  the  selected  unstable  orbit.  When  this 
occurs,  the  parameter  perturbation  can  be  set  to  zero  and  the  orbit  for  subsequent  time 
will  approach  the  desired  fixed  point  at  a  geometrical  rate. 

OGY  demonstrated  the  success  of  their  analysis  numerically  on  the  Henon  map.  The 
method  has  subsequently  been  successfully  applied  to  a  variety  of  systems  including 
magnetic  ribbons  [4],  spin-wave  systems  [5],  chemical  systems  [6],  electrical  diodes  [7], 
lasers  [8]  and  cardiac  systems  [9]. 

Some  important  features  of  the  OGY  technique  are: 

1.  no  model  for  the  dynamics  of  the  system  under  consideration  is  required, 

2.  the  required  changes  in  the  parameter  can  be  quite  small, 

3.  control  can  be  achieved  even  with  imprecise  measurements  of  the  system  eigenvalues 
and  eigenvectors,  and 

4.  the  method  is  applicable  to  any  system  whose  dynamics  can  be  characterized  by  a 
nonlinear  map. 

Shinbrot  et  al  [10,  11]  adapted  the  OGY  technique  to  situations  in  which  the  system 
can  be  modelled  by  a  set  of  equations  although  these  need  not  be  exact.  Their  technique 
has  been  shown  to  be  effective  even  in  the  presence  of  small-amplitude  noise  or  small 
errors  in  modelling. 

Dressier  and  Nitsche  [13]  showed  that  modifications  to  the  original  OGY  procedure 
were  necessary  when  the  attractor  is  reconstructed  from  a  time  series  using  delay 
coordinates  because  the  experimental  surface  of  section  map  depends  not  only  on  the 
current  value  of  the  control  parameter  but  also  on  its  preceding  value.  Rollins  et  al  [14] 
presented  a  recursive  proportional-feedback  algorithm,  which  is  an  adaptation  of  the 
method  of  Dressier  and  Nitsche  for  the  control  of  highly  dissipative  systems. 

Lai  et  al  [15]  extended  the  OGY  stabilization  of  dissipative  chaos  to  chaotic 
Hamiltonian  systems.  Lai  and  Grebogi  [16]  used  this  modification  to  synchronize  two 
signals  using  perturbations  proportional  to  the  instantaneous  difference  between  them. 
Alvarez-Ramirez  [17]  presented  a  nonlinear  feedback  control  in  which  the  feedback  is 
based  on  linearizing  the  input-output  dynamics  of  the  system  and  control  leads  to  large 
regions  of  stability. 

Newell  et  al  [18]  demonstrated  an  experimental  proportional-feedback  algorithm  for 
the  synchronization  of  chaotic  time  signals  generated  by  a  pair  of  independent  diode 
resonator  circuits. 

Bielawski  et  al  [19]  stabilized  unstable  periodic  orbits  by  applying  continuous 
feedback  on  a  control  parameter,  the  feedback  signal  being  proportional  to  the  difference 
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between  two  values  of  a  dynamical  variable  separated  by  a  time  interval  equal  to  the 
unstable  orbit  periodicity.  This  method  is  expected  to  be  efficient  for  systems  near  a 
bifurcation  point  and  for  highly  dissipative  systems. 

Pyragas  [20]  proposed  two  methods  for  the  control  of  unstable  periodic  orbits  with 
small  time-continuous  perturbations.  Feedback  combined  with  a  periodic  external  force 
of  a  special  form  is  used  in  the  first  method  (external  force  control);  while  the  second 
method  relies  on  self  controlling  delayed  feedback  (delayed  feedback  control).  Qu  et  al 
[21]  analyzed  Pyragas'  external  force  control  and  found  that  continuous  control  by 
feedback  of  a  single  variable  is  not  always  successful.  They  also  showed,  contrary  to 
Pyragas'  claim,  that  under  certain  conditions  positive  feedback  can  control  a  system 
where  negative  feedback  fails  to  do  so.  Pyragas  [22]  subsequently  extended  his  control  to 
stabilize  aperiodic  orbits,  while  Kittle  et  al  [23]  verified  the  effectiveness  of  such  control 
experimentally. 

Matias  and  Giiemez  [24,  25]  presented  a  new  method  for  the  stabilization  of  1-D 
iterated  maps  which  exhibit  chaos,  by  acting  on  the  system  variables  rather  than  on  some 
available  parameter.  The  method  is  implemented  by  applying  pulses  that  change  the 
system  variable  proportionally.  The  control  algorithm  consists  of  the  application  every 
An  iterations  of  a  feedback  to  the  variable  x  having  the  form 

xn  =  *„(!+  7), 

where  7  represents  the  strength  of  the  feedback  and  can  be  positive  as  well  as  negative. 
The  method  does  not  require  that  the  particular  dynamical  law  be  known.  However,  the 
ability  to  control  chaos  depends  critically  on  the  values  of  7  and  An.  The  latter  is  closely 
related  to  the  periodicity  of  the  observed  cycle,  this  being  a  multiple  of  An.  They 
illustrated  their  technique  by  an  application  to  the  logistic  and  exponential  maps. 
Subsequently  they  applied  their  method  to  the  isothermal  autocatalator  model  [26]  and 
the  Lorenz  and  Rossler  systems. 

Singer  et  al  [27]  demonstrated  experimentally  and  theoretically  that  low-energy, 
feedback  control  signals  can  be  utilized  efficiently  to  suppress  (laminarize)  chaotic  flow 
in  a  thermal  convection  loop.  Chen  and  Chou  [28]  showed  that  the  continuous  feedback 
approach  is  highly  effective  for  controlling  chaotic  systems.  Given  the  system's  model 
dynamic  equations,  the  control  is  designed  for  a  well-specified  system  performance.  The 
approach  was  applied  to  the  Lorenz  system  and  was  able  to  drive  the  system  to  any  steady 
state.  The  control  was  found  to  tolerate  both  measurement  noise  and  modeling 
uncertainty  as  long  as  these  were  bounded. 

Huberman  and  Lumer  [12]  introduced  a  simple  adaptive  control  mechanism  to  control 
nonlinear  systems  in  stable  orbits.  A  system  which  is  perturbed  away  from  a  stable  fixed 
point  (say  X*)  is  brought  back  to  the  fixed  point  by  introducing  changes  in  the  parameter 
(u)  such  that 


Sinha  et  al  [29]  extended  this  method  to  multi-parameter  and  higher  dimensional 
nonlinear  systems.  They  also  suggested  a  method  by  which  periodic  motion  such  as  limit 
cycles  can  be  adaptively  controlled  and  demonstrated  the  robustness  of  the  procedure  in 
the  presence  of  (additive)  background  noise. 
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John  and  Amritkar  [30]  suggested  a  method  for  controlling  chaotic  systems  which  is 
able  to  synchronize  the  system  trajectory  to  a  desired  unstable  orbit.  Their  method  uses 
adaptive  control  [12]  and  introduces  time-dependent  changes  in  the  system  parameters 
depending  on  two  factors  :  (1)  the  difference  between  the  system  output  variables  {«/} 
and  the  corresponding  variables  of  the  desired  orbit  {y/},  and  (2)  the  difference  between 
the  values  of  the  parameters  (//,-)  which  are  controlled  and  their  values  (/x*)  for  the 
desired  orbit.  They  demonstrated  the  effectiveness  of  their  method  using  the  Lorenz  and 
Rossler  systems.  They  also  showed  that  their  method  could  be  used  for  communication 
purposes. 

There  are  other  interesting  and  simple  methods  [31-38]  for  controlling  of  chaos.  For 
example,  entrainment  control  [31],  suppression  of  chaos  by  weak  periodic  parametric 
perturbation  [32]  and  addition  of  second  periodic  force  [33]  are  also  possible. 


3.  Synchronization 

Although  we  have  mentioned  synchronization  en  passant,  the  possibility  of  synchroniza- 
tion in  chaotic  systems  is  not  obvious.  Two  identical  autonomous  chaotic  systems  started 
at  nearly  identical  initial  points  in  phase  space  have  trajectories  which  quickly  become 
uncorrelated,  even  though  each  maps  out  the  same  attractor  in  phase  space.  Pecora  and 
Carroll  [39]  were  the  first  to  describe  the  linking  of  two  chaotic  systems  with  a  common 
signal  such  that  the  two  systems  synchronized,  i.e.  the  trajectories  of  the  two  systems 
converged  and  remained  in  step.  Their  results  were  obtained  for  flows  (differential 
equations),  but  can  be  applied  to  iterated  maps  with  only  small  modifications.  They 
considered  an  autonomous  n-dimensional  dynamical  system 

u=f(u), 

divided  the  system,  arbitrarily,  into  two  subsystems  [u=  (v,w)].  They  created  a  new 
system  u'  identical  to  the  original  u  system  and  substituted  the  set  of  variables  v  for  the 
corresponding  v'  in  the  new  system.  The  behaviour  of  the  new  system  depends  on  the 
Lyapunov  exponents  of  the  w  subsystem — referred  to  as  the  sub-Lyapunov  exponents 
(SLEs).  They  suggested  the  following  theorem:  the  subsystems  w  and  w'  will  synchronize 
only  if  the  SLEs  are  all  negative. 

This  theorem  is  necessary,  but  not  sufficient,  for  synchronization.  It  says  nothing  about 
the  set  of  "initial  conditions"  for  which  w'  will  synchronize  with  w.  They  demonstrated 
synchronization  in  the  Lorenz  and  the  Rossler  systems  as  well  as  in  electronic  circuits  and 
their  numerical  models  [40].  Incidentally,  Amritkar  and  Gupte  [41]  have  proposed  a 
qualitative  measure  of  correlation  between  coevolving  dynamical  systems  which  is  very 
useful  in  characterizing  synchronization  in  chaotic  systems. 

Periodically  driven  nonlinear  systems  can  exhibit  multiple-period  behaviour  (period-2, 
period-3,  etc.).  Several  such  systems,  when  driven  by  the  same  drive,  can  be  on  identical 
attractors  but  remain  out  of  phase  with  each  other.  This  means  that  the  basins  of  attraction 
for  multiple-period  systems  can  be  divided  into  domains  of  attraction,  one  for  each  phase 
of  the  motion.  A  period-n  attractor  will  have  n  domains  of  attraction  in  its  basin.  This  out- 
of-phase  situation  is  stable — small  perturbations  will  not  succeed  in  getting  the  systems 
in  phase.  Pecora  and  Carroll  [42]  showed  that  by  using  an  almost  periodic  driving  signal 
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(generated  from  various  chaotic  systems)  one  can  simultaneously  (1)  keep  the  motion  of 
the  systems  nearly  the  same  as  the  periodic  driving  signal,  (2)  keep  the  basins  of 
attraction  nearly  the  same,  and  (3)  eliminate  the  n  domains  of  attraction.  In  other  words, 
there  will  be  only  one  domain  for  the  basin.  This  means  that  any  number  of  such  driven 
systems  will  always  be  in  phase. 

Kittle  et  al  [23]  pointed  out  that  the  stabilization  of  a  desired  aperiodic  orbit  can  be 
interpreted  in  terms  of  Pecora  and  Carroll's  method  of  synchronization.  To  stabilize  a 
desired  orbit  we  need  to  know  a  prerecorded  signal  y*  (t),  which  is  sufficient  to  control  the 
output  of  a  given  system  to  a  desired  orbit  for  a  given  controlling  method.  The 
prerecorded  signal  y*  (t)  can  be  replaced  by  the  output  signal  of  an  additional,  identical 
chaotic  system.  The  latter  can  be  seen  as  the  driving  system,  and  the  original  can  be 
interpreted  as  the  response  system.  The  problem  of  stabilizing  a  prerecorded  desired 
aperiodic  orbit  is  thus  equivalent  to  the  problem  of  synchronizing  two  identical  chaotic 
systems. 

Kapitaniak  [43]  applied  the  continuous  controlling  method  developed  by  Pyragas  [20] 
to  achieve  synchronization  of  two  chaotic  systems.  Murali  and  Lakshmanan  [44]  also 
used  Pyragas'  method  to  synchronize  two  identical  nonlinear  oscillators  and  to  transmit 
signals  in  a  secure  way. 

Ding  and  Ott  [45]  pointed  out  that  exact  synchronism  may  also  occur  for  a  large  class 
of  systems  that  are  not  replicas  of  the  original  system  as  suggested  by  Pecora  and  Carroll. 
They  were  able  to  affect  the  following  improvements  over  Pecora  and  Carroll's 
technique: 

1.  achieving  synchronization  when  a  replica  subsystem  does  not  synchronize  (i.e.  A  >  0 
for  the  replica  subsystem), 

2.  enabling  faster  convergence  to  the  synchronized  state, 

3.  eliminating  or  reducing  the  size  of  spurious  subsystem  basins  of  attraction  in  which 
the  subsystem  does  not  synchronize,  and 

4.  improving  the  performance  of  signal  recovery  techniques  for  situations  where  a 
chaotic  time  series  is  used  to  mask  a  small  information  bearing  signal  [44,  46]. 

Gupte  and  Amritkar  [47]  demonstrated  that  it  is  possible  to  stabilize  unstable  periodic 
orbits  of  chaotic  attractors  by  using  a  suitable  drive  variable,  the  choice  of  drive  variable 
being  dictated  by  the  values  of  the  SLEs  of  the  response  system.  They  found  that  for  the 
Lorenz  and  Rossler  systems,  the  SLEs  of  some  of  the  unstable  fixed  points  appeared  to 
govern  the  locking  to  chaotic  orbits  and  to  periodic  orbits  as  well.  On  the  other  hand,  the 
SLEs  of  the  unstable  period-six  orbit  appeared  to  govern  the  properties  of  the  chaotic 
orbit  for  the  Duffing  oscillator.  They  also  noted  that  the  actual  chaotic  trajectories  for 
these  systems  wound  around  the  unstable  fixed  points  or  the  periodic  orbits  for  which  the 
agreement  between  the  SLEs  was  observed.  They  conjectured  that  the  SLEs  of  a  given 
chaotic  orbit  retained  the  memory  of  the  unstable  periods  that  were  responsible  for  its 
origin. 

4.  Exponential  control 

We  [48,  49]  have  succeeded  in  implementing  a  somewhat  novel  method  of  controlling 
chaos.  We  impose  a  multiplicative  functional  feedback  control  on  a  system  parameter. 
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The  control  is  of  the  exponential  form  whose  argument  is  proportional  to  the  feedback 
response  of  the  system,  i.e.  the  difference  between  the  desired  value  and  the  actual  value 
of  one  suitably  chosen  variable  of  the  system.  Pecora  and  Carroll's  method  uses  a  suitably 
chosen  variable  of  the  system  as  a  drive  for  controlling  chaos.  The  OGY  method  controls 
chaos  by  perturbing  the  system  parameter.  Our  method  is  a  combination  of  variable  and 
parametric  control  as  it  achieves  control  by  modulating  the  parameter  through  the 
variable.  The  control  can  stabilize  unstable  fixed  points,  unstable  limit  cycles  as  well  as 
chaotic  trajectories.  It  is  effective  both  for  maps  and  flows. 
Consider  a  general  N-dimensional  dynamical  system 

?  =  ?(?;M;0  (1) 

where  3?  =  (Xi,X2,  .....  ,XW)  are  variables  and  fj,  =  (^1,^2,  ••••^K)  are  controlling 
parameters  whose  values  determine  the  nature  of  the  dynamics.  The  stabilization  of  a 
desired  unstable  attractor  or  a  chaotic  trajectory  is  possible  by  multiplying  a  suitably 
chosen  controlling  parameter  say  p,r  in  (1)  by  an  exponential  feedback  control  involving 
only  one  suitably  chosen  variable  say  X/  such  that  the  form  of  control  is 

exp[e(X,-Xf)]  (2) 

where  X/  is  the  actual  value  of  one  of  the  variables  of  the  system  after  applying  the 
control,  Xf  is  the  desired  value  of  that  variable  and  e  is  the  "stiffness"  of  the  control 
which  can  take  both  positive  and  negative  values. 
The  dynamics  of  the  modulated  system  in  the  presence  of  control  is  given  by 

X  =y(x";/zi,//2,...,Mrexp[e(X/-Xf)],..,^;0.  (3) 


The  form  of  the  control  is  such  that  it  becomes  passive  once  the  desired  goal  Xs   is 
achieved.  If  fluctuations  drive  the  system  off  the  desired  goal,  the  control  reactivates. 

The  control  works  for  those  combinations  of  controlling  parameters  and  variables  of 
the  system  for  which  the  real  part  of  the  largest  Lyapunov  exponent  of  the  modulated 
system,  the  system  after  the  control  is  applied  represented  by  (3),  is  negative.  The 
feedback  function  in  the  expression  of  the  control  involves  only  one  selected  variable  X/ 
to  convert  the  desired  repellor  -  a  fixed  point,  a  limit  cycle  or  a  chaotic  orbit,  into  an 
attractor,  implying  that  knowledge  of  only  one  variable  on  the  desired  unstable  orbit  is 
sufficient  to  settle  the  system  on  to  that  orbit.  This  makes  exponential  control  of  particular 
significance  for  systems  with  several  degrees  of  freedom.  As  an  example,  one  unstable 
fixed  point  of  the  Lorenz  system  [50]  can  be  stabilized  using  exponential  control  with 
feedback  depending  on  any  one  of  the  X,  Y  or  Z  variables,  whereas  an  unstable  limit 
cycle  or  a  chaotic  trajectory  of  the  system  can  be  converted  into  an  attractor  only  if  we 
use  feedback  depending  on  the  Z  variable.  For  stabilizing  most  repellors,  it  is  sufficient  to 
apply  exponential  control  to  one  selected  parameter  of  the  Lorenz  system.  But  for 
stabilizing  one  of  the  two  fixed  points  of  the  Henon  map  [51],  exponential  control  has  to 
be  applied  to  both  the  parameters  a  and  b.  Some  of  these  features  may  also  be  true  of 
other  forms  of  control.  Although  in  all  the  cases  we  have  studied,  control  involving  only 
one  variable  on  the  desired  orbit  is  sufficient  to  make  all  the  Lyapunov  exponents  of  the 
system  negative,  this  may  not  be  true  for  all  systems.  But  the  fact  remains  that  the  control 
uses  only  a  subset  of  the  variables  and  the  parameters  for  controlling  chaos.  We  have 
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successfully  tested  our  control  for  stabilizing  different  types  of  orbits  of  the  logistic  map, 
the  Henon  map  and  the  Lorenz  system. 

We  can  extend  our  control  to  convert  unstable  fixed  points  of  higher  period  say 

(X*1 ,  X*2, . . . ,  X*k)  to  stable  fixed  points  for  given  values  of  the  controlling  parameters. 
What  is  required  is  a  feedback  that  encodes  as  much  information  about  a  periodic  orbit  as 
is  necessary  for  its  unique  characterization.  But  there  is  a  problem  here  of  practicality. 
Since  in  such  cases  the  form  of  the  control  given  by  (2)  requires  the  convergence  of  the 
chosen  variable  to  one  of  the  k  values  of  that  variable,  the  controlling  technique 
diminishes  in  utility  with  increase  in  period.  For  higher  period  orbits  of  a  discrete 
dynamical  system  a  more  effective  control  is  one  which  employs  a  logical  OR  structure  in 
the  feedback  function.  So  in  order  to  stabilize  unstable  fixed  points  of  period  fc,  i.e. 

{X*1 },  one  of  the  parameters  p,r  in  (1)  is  multiplied  by 

/     k 
exp    e 

V  <'=1 

where  X/(«)  is  the  value  of  the  chosen  variable  X;  at  time  n  of  the  modulated  map  after 
applying  control  and  (X*l,X*2,  ....,Xi*k]  is  the  set  of  k  values  of  Xi  on  the  desired 
unstable  period-/:  orbit.  We  have  successfully  implemented  such  control  in  the  case  of  the 
logistic  map  for  stabilizing  a  period-2  orbit  [48]. 

A  quantity  of  obvious  interest  in  the  context  of  controlling  chaos  is  the  time  required 
for  the  system  to  settle  on  to  the  desired  orbit.  This  of  course  depends  upon  the  stiffness 
of  the  control  i.e.  e.  For  a  given  e  we  study  the  time  r  required  for  the  system  to  approach 
within  a  distance  u  of  the  desired  orbit  starting  from  some  initial  point.  If  UJQ  is  the  initial 
distance  from  the  desired  orbit,  then  it  is  clear  that  the  length  of  the  transient  r  and  uj  are 
related  by  ui  =  UQ  exp(Ar),  where  A  is  the  real  part  of  the  largest  Lyapunov  exponent.  The 
slope  of  the  plot  of  r  against  In(cj/cu0)  is  nothing  but  I/ A.  This  of  course  has  to  be 
negative  for  convergence.  These  values  of  the  Lyapunov  exponents  are  found  to  be  in 
good  agreement  with  those  obtained  numerically  [52]  and,  wherever  possible, 
analytically.  We  also  study  the  dependence  on  e  of  the  transient  time  r  required  for 
settling  on  to  different  types  of  orbits  for  different  systems  characterized  by  ln(o;/a;o)  as 
well  as  on  the  accuracy  u  with  which  it  does  so.  We  find  that  r  is  initially  a  decreasing 
function  of  e.  But  there  exists  an  optimum  stiffness  of  control  beyond  which  increasing  e 
can  cause  r  to  increase.  This  dependence  of  r  on  e  is  found  to  be  the  same  as  that  of  A  on 
e  for  any  orbit. 

In  addition  to  this,  in  the  case  of  discrete  maps  we  have  also  tried  to  stabilize  orbits 
which  are  not  the  natural  fixed  points  (stable  or  unstable)  of  the  original  system.  We  have 
found  that  exponential  control  succeeds  in  creating  new  stable  attractors  which  are  not  the 
natural  attractors  of  the  unmodulated  system.  However  we  cannot  stabilize  any  arbitrary 
orbit.  The  functional  form  of  the  map  and  the  control  criterion  decide  which  orbits  can  be 
forced  on  to  the  system. 

There  is  one  drawback  of  the  control  -  in  any  system,  if  an  unstable  fixed  point  is 
represented  by  a  null  vector  then  it  cannot  be  stabilized  using  exponential  control.  The 
reason  is  easy  to  understand.  The  Jacobian  of  the  modulated  system  given  by  (3) 
evaluated  at  such  a  point  in  the  presence  of  the  control  is  the  same  as  of  the  unmodulated 
system  (1).  So  the  eigenvalues  of  the  system  remain  unchanged  in  the  presence  of  the 
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Figure  1.  Plot  of  the  transient  time  r  vs  ln(w/u;o)  for  the  Lorenz  attractor  with 
a  =  10,  b  —  8/3  and  r  =  60  while  stabilizing  the  unstable  fixed  point 
(\/b(r  —  1),  \/b(r—  l),r  —  1).  Values  of  r  for  Pecora  and  Carroll's  method  with 
X-drive  (triangles)  and  7-drive  (squares)  are  compared  with  those  obtained  by  using 
exponential  control  involving  the  parameter  r  and  the  variable  X  (circles)  and  also  r 
and  y  (stars). 


control.  Hence  such  a  fixed  point  remains  unstable  even  in  the  presence  of  control.  Such 
points  may,  however,  get  stabilized  in  the  process  of  stabilizing  other  fixed  points, 
resulting  in  the  coexistence  of  more  than  one  attractor. 

We  have  compared  the  transient  times  and  the  basins  of  attraction  for  exponential 
control  with  other  forms  of  control  which  share  with  it  the  following  features: 

1.  the  control  is  continuous, 

2.  it  depends  only  on  the  variable(s)  of  the  desired  orbit, 

3.  is  independent  of  the  previous  values  of  the  controlling  parameters  fj,,  if  the  values  of 
the  parameters  are  changing  in  the  presence  of  control,  and 

4.  does  not  increase  the  dimensionality  of  the  original  system. 

To  compare  our  control  with  Pecora  and  Carroll's,  both  for  X-  and  7-drives  in  the 
Lorenz  system,  we  consider  the  control  where  the  parameter  r  is  multiplied  by  the 
exponential  feedback  function  involving  the  X  and  Y  variables  keeping  the  same  set  of 
parameter  values  in  both  cases.  We  find  that  the  value  of  A  with  this  form  of  the  control  is 
-4.536  •  •  •  for  e  =  1.7.  We  also  find  that  A  for  the  control  involving  the  parameter  r  and 
the  variable  Y  is  -7.593  •  •  •  for  e  =  -0.05  for  the  same  set  of  parameter  values.  The 
corresponding  values  of  A  for  Pecora  and  Carroll's  method  are  - 1 .83  for  the  X-diive.  and 
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Control  of  chaos 

—2.85  for  the  F-drive  [39].  Our  values  of  A  are  less  than  both  these  values  and  hence 
exponential  control  can  be  considered  to  take  hold  more  rapidly.  To  verify  this  we  find  the 
transient  time  r,  required  to  stabilize  the  second  unstable  fixed  point  of  the  Lorenz  system 
for  a  given  set  of  parameter  values,  for  different  values  of  u>  using  Pecora  and  Carroll's 
method  and  for  exponential  control  involving  the  parameter  r  and  the  variables  X  and  Y 
for  £  =  —1.7  and  e  =  —0.05  respectively.  We  plot  the  variation  of  r  with  ln(w/u>o)  in 
figure  1,  which  confirms  that  our  control  stabilizes  the  unstable  fixed  point  faster  than 
Pecora  and  Carroll's.  This  is  not  altogether  surprising  in  view  of  the  fact  that  our  control 
has  an  exponential  form. 

We  have  also  compared  our  control  with  those  suggested  by  Singer  et  al  [27],  and  Chen 
and  Chou  [28].  For  the  first  method,  we  find  that  the  basin  of  attraction,  while  stabilizing 
nonzero  fixed  points  of  the  Lorenz  system,  is  smaller  than  the  one  obtained  using 
exponential  control.  Also,  the  transient  time  is  of  the  order  ~  105  which  is  larger  than  that 
with  exponential  control  (~  103).  The  method  of  Chen  et  al  [28]  shows  the  same  transient 
time  for  stabilizing  the  unstable  fixed  points  of  the  Lorenz  map  but  the  basin  of  attraction 
is  smaller  than  that  obtained  with  exponential  control.  These  comparisons  should  not  be 
taken  to  imply  that  exponential  control  will  always  be  the  better  choice.  They  only  show 
that  the  performance  of  exponential  control  is  different  from  other  controls  and  may  be 
more  efficient  for  some  systems. 

With  exponential  control  it  is  also  possible  to  control  spatiotemporal  chaos.  A 
spatiotemporal  system  can  be  described  by  a  coupled  map  lattice  (CML)  model  [53-57]. 
Feedback  pinnings  [58]  are  used  to  control  chaos  in  the  system  by  stabilizing  a  certain 
unstable  reference  state.  To  control  such  a  system  F,  we  use  pinnings  defined  by 

L/l 

(5) 


where  I  is  the  distance  between  two  neighbouring  pinnings,  Xn(i)  is  the  reference  state  to 
be  stabilized  and  6(j]  =  1  for  j  =  1  and  8(j)  =  0  otherwise.  We  have  succeeded  in 
stabilizing  various  unstable  spatiotemporal  patterns  like  an  unstable  homogeneous 
stationary  state,  an  unstable  space-period-two  pattern  and  a  time-period-two  with  a  space- 
period-four  running  wave. 


5.  Application  of  synchronized  chaos  to  communication 

Cuomo  and  Oppenheim  [46]  used  an  analog  circuit  implementation  of  the  chaotic  Lorenz 
system  to  illustrate  the  use  of  synchronized  chaotic  systems  in  communications.  The  set 
of  equations  representing  Lorenz  based  chaotic  circuits  are 

u  =  cr(v  —  u] 

v  —  ru  —  v  —  20uw 

w  —  5uv  —  bw.  (6) 
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Figure  2.    The  error  i.e.  a'(t)  -  a(t),  in  regeneration  of  the  information  signal  at  the 
receiver. 


drive  signal.  The  basic  idea  was  to  modulate  a  transmitter  coefficient  b  with  the 
information-bearing  wave  form,  m(t),  which  was  taken  to  be  a  square  wave.  The  square 
wave  changed  the  value  of  b  to  a  new  value  bn  with  the  one  bit  and  no  change  was 
produced  with  the  zero  bit.  A  chaotic  drive  signal  u(t]  was  transmitted.  At  the  receiver, 
the  modulation  produced  a  synchronization  error  between  the  received  drive  signal  and 
the  receiver's  regenerated  drive  signal  ur  with  an  error  signal  amplitude  that  depended  on 
the  modulation. 

We  have  tried  to  use  exponential  control  for  the  modulation/detection  process  in 
communications.  The  idea  is  to  take  the  transmitter  and  receiver  systems  to  be 
represented  by  the  same  set  of  equations  with  the  same  set  of  values  for  the  controlling 
parameters.  As  demonstration  we  choose  the  Lorenz  system  to  represent  the  transmitter 
and  the  receiver,  with  the  parameter  values  a  =  16.0,  b  —  4.0,  and  r  =  40.0.  The  largest 
Lyapunov  exponent,  for  these  values  of  the  parameters  is  positive,  implying  that  the 
system  is  in  a  chaotic  regime.  We  chose  an  arbitrary  chaotic  trajectory  starting  with  the 
initial  point,  say,  (10.0,0.0,30.0).  The  information  signal  a(r)  is  taken  to  be  a  square 
wave  defined  by 


a(i)  =  10        0  <  t  <  5Af 
=  0          5A?  <  t  <  lOAf 


(7) 


where  Af  is  the  integration  step  taken  to  be  0.001.  We  superimpose  the  information  signal 
a(t)  on  one  of  the  variables,  say,  X(t)  of  the  drive  system  such  that  the  resultant  signal 
s(t)  =  a(t)  +X(t).  Then  s(t)  and  another  variable,  say,  Z(t)  of  the  drive  system  are 


transmitted.  At  the  receiver,  we  start  with  an  arbitrary  initial  state,  say,  (X°r,  7°,Z°).  We 
apply  b  —  Z  control  such  that  the  parameter  b  gets  multiplied  by  exp(e(Zr  —  Z)),  where  Zr 
is  the  Z  coordinate  of  a  point  on  a  chaotic  trajectory  of  the  receiver,  evolving  in  the 
presence  of  exponential  control,  so  that  its  dynamics  can  be  represented  by 

Xr  =  a(Yr-Xr] 

ir  —     X.rZjr  H-  rX.r      ir 

Zr  =  XrYr  -  b  exp(e(Zf  -  Z))Zr.  (8) 

We  choose  the  value  of  e  to  be  0.034  for  which  the  transmitted  chaotic  trajectory 
becomes  stable  in  the  presence  of  exponential  control.  Consequently  nearby  trajectories 
converge  to  the  desired  trajectory.  Thus  after  transients  have  died  out,  the  information 
signal  can  be  regenerated  by  subtracting  the  Xr(t]  w  X(t)  signal  from  the  transmitted 
signal  s(t),  i.e,  a(t)  w  a'(i]  =  s(t)  —Xr(t).  We  find  that  the  error  in  the  regeneration  of 
the  information  signal  a(t]  for  the  chosen  values  of  the  constants,  lies  between  0.1%  and 
1.0%  of  the  amplitude  of  the  information  signal.  This  can  be  seen  in  figure  2. 

The  advantage  of  using  this  approach  for  the  modulation/detection  of  a  signal  is  that 
there  is  no  restriction  on  the  frequency  or  the  amplitude  of  the  information  signal.  The 
quality  of  the  regenerated  signal  is  quite  good.  The  only  drawback  is  that  we  have  and 
transmit  two  signals,  viz.  s(t]  and  Z(t),  from  the  transmitter. 
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Effect  of  noise  on  coupled  chaotic  systems 


MANOJIT  ROY  and  R  E  AMRITKAR 

Department  of  Physics,  University  of  Pune,  Pune  411  007,  India 

Abstract.  The  effect  of  noise  in  inducing  order  on  various  chaotically  evolving  systems  is 
reviewed,  with  special  emphasis  on  systems  consisting  of  coupled  chaotic  elements.  In  many 
situations  it  is  observed  that  the  uncoupled  elements  when  driven  by  identical  noise,  show 
synchronization  phenomena  where  chaotic  trajectories  exponentially  converge  towards  a  single 
noisy  trajectory,  independent  of  the  initial  conditions.  In  a  random  neural  network,  with  infinite 
range  coupling,  chaos  is  suppressed  due  to  noise  and  the  system  evolves  towards  a  fixed  point. 
Spatiotemporal  stochastic  resonance  phenomenon  has  been  observed  in  a  square  array  of  coupled 
threshold  devices  where  a  temporal  characteristic  of  the  system  resonates  at  a  given  noise  strength. 
In  a  chaotically  evolving  coupled  map  lattice  with  the  logistic  map  as  local  dynamics  and  driven  by 
identical  noise  at  each  site,  we  report  that  the  number  of  structures  (a  structure  is  a  group  of 
neighbouring  lattice  sites  for  values  of  the  variable  follow  which  the  certain  predefined  pattern) 
follows  a  power-law  decay  with  the  length  of  the  structure.  An  interesting  phenomenon,  which  we 
call  stochastic  coherence,  is  also  reported  in  which  the  abundance  and  lifetimes  of  these  structures 
show  characteristic  peaks  at  some  intermediate  noise  strength. 

Keywords.    Chaos;  noise;  coupled  map  lattice. 
PACSNos    05.45;  47.52 


1.  Introduction 

Chaos  in  natural  and  human-made  systems  is  a  well  established  fact  by  now.  Systems  in 
diverse  disciplines  such  as  population  biology  [1],  physiology  [2],  hydrodynamics  [3], 
chemical  reactions  [4],  plasma  [5],  lasers  [6],  electronics  [7],  computing  networks  [8], 
economic  theory  [9],  social  theory  etc.  have  been  observed  to  exhibit  rich  and 
unpredictable  behaviour  of  chaos.  This  behaviour  has  been  identified  with  the  inherent 
nonlinear  nature  of  the  systems  rather  than  external  influences. 

On  the  other  hand,  the  very  same  nonlinearity  has  been  seen  to  give  rise  to  ordering 
phenomena  [10],  e.g.,  regular  formation  of  cloud  patterns,  a  variety  of  patterns  in 
hydrodynamic  flow,  oscillatory  patterns  in  chemical  reactions,  in  the  behaviour  of  lasers, 
pulse  propagation  in  Gunn  diodes  etc.  These  patterns  may  be  spatial,  temporal,  or 
spatiotemporal  in  nature  and  their  understanding  is  of  very  special  interest.  One 
particularly  interesting  ordering  phenomenon  is  the  generic  existence  of  different 
structures  in  a  turbulent  fluid  [11-18].  They  originate  and  degenerate  randomly  in  space 
and  time.  These  structures  appear  in  spite  of  the  fact  that  the  fluid  is  undergoing  a 
turbulent  evolution  and  no  clear  understanding  of  this  phenomenon  is  as  yet  achieved. 

Noise  has  been  known  to  play  a  detrimental  role  in  many  experimental  situations.  This 
motivated  researchers  to  develop  better  techniques  and  methods  to  minimize,  if  not 
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totally  remove,  the  effect  of  noise  and  enhance  the  signal-to-noise  ratio  and  hence  system 
performance.  A  slight  amount  of  noisy  perturbation  is  known  to  destroy  delicate  patterns 
in  spatially  extended  systems. 

In  the  light  of  these  facts,  interest  grew  when  evidence  to  the  contrary  started  appearing 
as  regards  the  effect  of  noise  on  ordering  phenomena.  Firstly,  it  has  been  observed  that 
addition  of  noise  of  a  given  strength  to  certain  nonlinear  systems  increases  the  system's 
response  at  a  particular  time  scale,  thereby  improving  the  signal-to-noise  ratio;  these 
findings  opened  up  an  entirely  new  field  of  research,  known  as  stochastic  resonance  [19- 
27].  Secondly,  noise  has  been  seen  to  influence  spatial  and  spatiotemporal  behaviour  of 
some  nonlinear  systems  in  a  quite  counterintuitive  manner.  It  is  observed  that  noise 
can  trigger,  select  and  sustain  patterns  in  optical  systems,  fluid  dynamical  systems,  etc. 
[28-32]. 

Considerable  progress  has  been  made  to  establish  a  connection  between  these  pattern 
forming  systems  (such  as  fluids)  and  the  theory  of  nonlinear  dynamical  systems.  A 
natural  next  step  is  to  see  how  noise  influences  the  spatiotemporal  evolution  of  these 
dynamical  systems,  with  the  hope  that  this  kind  of  study  may  shed  some  light  on  the 
abovementioned  noise-induced  features  in  physical  systems.  This  article  presents  a 
review  of  the  work  that  has  been  carried  out  on  the  effect  of  noise  in  inducing  order  in  an 
otherwise  chaotically  evolving  system,  with  particular  emphasis  on  systems  consisting  of 
coupled  chaotically  evolving  elements. 

We  have  organized  the  article  as  follows:  Section  2  discusses  noise  induced  ordering 
phenomena  observed  in  low-dimensional  uncoupled  systems.  In  §  3,  we  review  the  work 
on  effect  of  noise  on  two  spatially  extended  systems,  a  neural  network  and  array  of  firing 
elements.  Section  4  introduces  the  coupled  map  lattice  (CML),  one  of  the  most  popular 
models  of  nonlinear  dynamical  systems  with  spatial  extension.  In  this  same  section  we 
discuss  a  recent  work  that  we  have  carried  out  on  the  effect  of  noise  on  CML.  Section  5 
summarizes  and  concludes  the  article. 


2.  Effect  of  noise  on  uncoupled  systems 

In  this  section  we  investigate  some  examples  of  noise-induced  ordering  in  uncoupled 
systems. 

2.1  Synchronization 

In  some  systems  different  trajectories  get  synchronized  asymptotically  to  a  single  noisy 
trajectory  independent  of  the  initial  conditions  when  driven  by  an  identical  sequence  of 
noise  above  a  certain  strength.  Here,  synchronization  should  be  understood  as  the 
exponential  convergence  of  the  average  distance  between  any  two  phase  space  points. 
This  means  that  synchronization  is  essentially  a  nonchaotic  phenomenon  associated  with 
a  negative  Lyapunov  exponent  [33],  although  the  asymptotic  trajectory  can  be  very 
random.  (The  Lyapunov  exponent  A  characterizes  the  rate  at  which  the  distance  between 
neighbouring  trajectories  changes.  If  eo  is  the  initial  separation  between  two  trajectories, 
the  separation  after  time  t  can  be  written  as  et  ~  eoexp(Af).  A  positive  A  implies 
exponential  divergence  of  nearby  trajectories  and  hence  chaos.  For  multidimensional 
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systems  our  reference  to  'Lyapunov  exponent'  will  always  mean  the  largest  of  the 
Lyapunov  characteristic  exponents.) 

In  the  continuum-time  limit  such  systems  can  be  written  as  a  set  of  uncoupled 
Langevin  equations 

xt(t)=F(xt)+ri(t),  (1) 

where  */  is  a  dynamical  variable,  the  index  i  corresponds  to  different  trajectories,  i.e., 
different  initial  conditions,  the  (nonlinear)  function  F(x)  governs  the  dynamics  of  the 
system,  and  rj(t)  is  a  delta-correlated  ({^(0??(f/)}  =  (?72(0)^(f  ~  0)  n°isy  driving  force 
imparted  at  a  regular  interval  that  is  small  compared  to  all  relevant  macroscopic  time 
scales  of  the  system,  and  is  the  same  for  all  i.  Equations  (1)  have  the  synchronization 
solution  x(t)  =  xi(t).  Therefore,  the  question  essentially  is  whether  this  solution  is  stable 
or  not  for  a  given  range  of  system  parameters.  In  other  words,  if  an  appropriately 
constructed  A  <  0,  there  is  an  exponential  convergence  of  trajectories  and  hence 
synchronization,  and  if  A  >  0,  the  evolution  is  chaotic. 
Let  us  now  take  up  individual  cases. 

2.1.1  Particle  in  a  Newtonian  potential:  Fahy  and  Hamann  (FH)  [34]  have  studied  a 
Newtonian  particle  moving  in  a  smooth  multiminima  potential  V(x)  without  friction, 
subjected  to  the  condition  that  at  regular  time  intervals  r,  it  is  stopped  and  all  its  velocity 
components  are  reset  to  random  values  chosen  from  a  Gaussian  distribution.  They  obser- 
ved that  when  an  ensemble  of  such  particles  with  different  initial  conditions  is  driven  by  an 
identical  sequence  of  random  forces,  their  trajectories  asymptotically  get  synchronized  to 
a  single  noisy  trajectory  provided  that  r  is  less  than  certain  critical  value  rc.  On  the  basis 
of  this  observation  they  concluded  that  for  r  <  rc  the  final  trajectory  of  the  particles  is 
independent  of  the  initial  conditions  to  any  required  level  of  accuracy  (the  accuracy 
aspect  will  have  importance  later  in  our  discussion);  it  depends  only  on  the  choice  of 
velocities.  Thus  the  trajectory,  albeit  noisy,  is  not  chaotic  for  r  <  rc  because  of  exponen- 
tial convergence,  and  an  appropriately  defined  Lyapunov  exponent  A  is  negative.  They 
have  shown  that  for  any  one-dimensional  potential  V(x]  confining  the  particles  to  a  finite 
region  and  for  sufficiently  small  r,  the  average  rate  of  contraction  7  of  the  distance 
between  two  particles  initially  close  together  is  given  by  7  =  r@((dV/dx)2}/2m  +  C^r2), 
where  (3  —  \jk^T,  and  m  is  the  mass  of  the  particle  (angular  brackets  denote  an  average 
with  respect  to  the  Boltzmann  distribution).  The  above  mentioned  synchronization 
phenomenon  is  a  stronger  observation  than  the  well-known  statistical  phenomenon  in 
which  the  asymptotic  distribution  of  Brownian  trajectories  is  found  to  be  proportional 
to  the  Boltzmann  factor  exp[—/3V(jc)],  independent  of  their  initial  distribution.  FH 
have  also  conjectured  that  this  synchronization  feature  may  be  generic  to  all  bounded 
systems. 

2.1.2  Noisy  chaotic  systems:  Maritan  and  Banavar  (MB)  [35]  have  considered  the  effect 
of  noise  on  the  following  two  chaotic  systems.  First,  they  have  taken  a  noisy  logistic  map 
with  the  evolution  law 

x,+i  =4xt(l  -xj+rit,  (2) 
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where  t]t  is  a  uniform-deviate  random  number  chosen  from  an  interval  [—  W,  +W]  with 
the  constraint  that  0  <  xt+i  <  1.  Equation  (2)  may  be  considered  as  an  example  of  the 
discrete-time  version  of  eq.  (1)  with  At  =  1.  They  found  a  critical  Wc(~  0.5)  such  that 
when  a  pair  of  randomly  chosen  initial  conditions  is  driven  by  an  identical  sequence  of  77, 
with  W  >  Wc,  their  asymptotic  trajectories  become  synchronized  (within  a  given 
accuracy)  to  a  single  random  trajectory  independent  of  the  initial  conditions.  They  have 
computed  the  mean  squared  separation  J2  between  the  two  identically  driven  trajectories 
and  observed  that  d2  falls  off  for  W  >  Wc.  They  offered  an  intuitive  explanation  saying 
that  the  convergence  can  occur  for  logistic  map  due  to  the  contraction  of  d  whenever  the 
sum  of  the  pair  of  numbers  comes  close  to  unity,  because  the  distance  evolves  as 


Since  peaks  in  the  invariant  density  for  the  logistic  map  are  close  to  0  and  1,  W  >  \  can 
bring  both  x(\]  and  x(2)  near  \, 

The  second  system  considered  by  MB  is  the  Lorenz  system  described  by 

d*        .        .      dv  dz  . 

—  =  a(x-y),     —  =-xz  +  rx-y,     —  =  xy-bz,  (3) 

with  a  =  10,  b  =  f  ,  and  r  =  28.  The  ^-equation  is  then  evolved  in  difference  form  as 
y(t  +  A?)  =  y(t)  +  [-x(t)z(t)  +  rx(t)  -  y(t)]&t  +  77,  WtVAt,  (4) 


where  77,  is  again  a  delta-correlated  uniform-deviate  random  number  between  [0,  1]  (it  has 
a  nonzero  mean  unlike  in  the  logistic  case)  and  W;  is  the  amplitude.  Note  that  here 
addition  of  noise  is  unrestricted  unlike  in  the  logistic  case  where  the  boundedness  of  the 
phase  space  constrains  the  noise  to  depend  on  the  state  of  the  system.  For  the  Lorenz 
system  also  a  threshold  value  («  |)  for  Wj  was  found  beyond  which  the  synchronization 
phenomenon  was  observed  for  an  identically  driven  system  with  different  initial 
conditions.  It  is  reported  that  synchronization  does  not  occur  if  r?f  with  the  same 
amplitude  has  zero  mean. 

MB  have  maintained  that  although  the  systems  considered  above  exhibit  synchroniza- 
tion, their  strange  attractors  are  not  replaced  by  topologically  simple  structures  like  a 
fixed  point  or  limit  cycle. 

Pikovsky  made  a  cautionary  observation  [36]  that  for  all  bounded  systems  there  is 
always  a  nonzero  probability  that  any  two  phase  space  points  will  come  close  to  within 
some  e  >  0  with  or  without  noise.  In  other  words  after  a  sufficiently  long  time  the  two 
systems  will  be  synchronized  because  of  the  finite  precision  (of  the  computer)  and 
fmiteness  of  phase  space.  This  type  of  spurious  synchronization  will  occur  even  for 
systems  with  positive  A,  which  is  to  be  distinguished  from  the  physical  synchronization 
which  is  characterized  by  negative  A.  The  former  is  unstable  against  small  perturbations 
because  of  positive  A  and  so  will  not  be  observed  in  real  experiments,  whereas  the  latter  is 
quite  stable.  Pikovsky  found  A  for  the  noisy  logistic  system  of  MB  to  be  positive  and 
concluded  that  this  type  of  synchronization  is  a  numerical  artifact. 

MB  subsequently  noted  [37]  that  whether  or  not  a  physical  synchronization  occurs, 
identical  noise  with  sufficient  strength  drastically  enhances  the  probability  of  a  'close 
encounter'  between  any  two  ensemble  points. 
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There  has  been  an  attempt  [38]  to  interpret  synchronization  in  terms  of  the  inherent 
structural  instability  of  the  undriven  system.  These  authors  maintain  that  the  strange 
attractor  of  the  undriven  chaotic  system  gets  replaced  by  a  stable  fixed  point  under 
'*  parametric  perturbations,  resulting  in  synchronization  of  any  two  phase  space  points  and 

a  negative  A.  They  showed  this  for  all  the  three  systems  mentioned  above  by  treating 
noise  as  a  perturbation  in  the  parameter  and  also  considered  new  examples.  It  was 
remarked  that  this  phenomenon  is  not  generic  and  holds  for  only  those  systems  in  which 
this  type  of  structural  instability  occurs. 

The  picture  that  is  emerging  out  of  these  findings  is  that  while  an  understanding  of  the 
synchronization  phenomenon  may  still  remain  incomplete,  there  is  no  doubt  about  noise 
playing  a  crucial  role  in  drastically  increasing  the  chance  of  a  close  encounter  of  any  two 
phase  space  points  and  thereby  bringing  down  the  Lyapunov  exponents  of  the  otherwise 
chaotic  system. 

2.2  Violation  of  the  law  of  large  numbers 

An  interesting  observation  was  reported  [39]  regarding  the  chaotic  evolution  of  an 
ensemble  of  uncoupled  maps  driven  by  a  parametric  noise.  The  system  consists  ofN  local 
maps 

where  the  nonlinearity  parameter  at(i]  is  subjected  to  fluctuations  in  both  space  and  time. 
The  quantity  of  interest  is  the  mean  field  ht,  defined  as 

1 


For  uncoupled  variables  xt(i)  fluctuating  almost  independently,  ht  for  large  N  is  expected 
to  obey  the  law  of  large  numbers,  and  hence  the  mean-square  deviation  (MSD) 
(=  (hj)  —  (/*r)2)  should  vary  as  1/N,  and  converge  to  a  fixed-point  value  as  N  — >  oo.  It 
was  observed  that  with  the  logistic  map  in  the  chaotic  regime,  the  MSD  falls  off  as  ~  l/N 
if  at(i)  fluctuates  in  space.  On  the  other  hand  if  at(i)  =  at  is  independent  of  i  then  the 
MSD  saturates  with  N  beyond  a  critical  N  =  Nc,  whose  value  depends  on  the  strength  of 
the  noise.  Similar  behaviour  was  observed  using  other  maps  such  as  the  circle  and  tent 
maps.  The  author  claims  that  an  ensemble  of  uncoupled  chaotic  maps  with  spatially 
uniform  parametric  fluctuations  violates  the  law  of  large  numbers,  irrespective  of  the 
details  of  the  map. 


3.  Effect  of  noise  on  spatially  extended  systems 

We  shall  now  consider  systems  which  have  a  spatial  extension.  Because  of  increased 
complexity  owing  to  the  largely  enhanced  phase  space  dimensions  (one  mostly  talks  of 
infinite  dimensional  phase  spaces)  these  systems  show  very  rich  dynamical  features. 
Spatially  extended  systems  are  modelled  by  a  set  of  variables  (whose  evolution  may  be 
governed  by  discrete-time  nonlinear  maps  like  the  logistic  map,  or  continuous-time 
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ordinary  differential  equations  such  as  the  oscillator  equation)  physically  coupled  to  each 
other  in  euclidean  space. 

Let  us  consider  a  few  cases  of  such  systems  and  the  influence  of  noise  on  their 
evolution. 


3.1  Noise  and  suppression  of  chaos  in  a  neural  network 

Molgedey,  Schuchhardt  and  Schuster  (MSS)  have  investigated  the  effect  of  noise  on 
discrete-time  evolution  of  a  random  neural  network  with  infinite-range  interactions  [40]. 
The  model  consists  of  N  analog  neurons  {*,(/)},  i  =  1,  •  •  •  ,  N,  with  -1  <  x(i)  <  1, 
evolving  according  to  the  law 


where  the  function  F(h)  has  the  following  properties:  1.  it  is  odd  [F(—h)  =  —F(ti)];  2.  it 
approaches  ±1  as  h  —  >  ±00;  3.  it  increases  near  h  =  0  as  [dF/dh]h=Q  =  g  ,  where  g  >  0  is 
the  gain  parameter.  The  coupling  parameters  £y  are  delta-correlated  Gaussian  random 
variables  with  zero  mean,  ht(i)  denotes  the  internal  field  of  the  neuron  and  r)t(i)  is  the 
external  white  noise  with  zero  mean  and  variance  {^(1)^(7)  }  =  cP-8ij8tr  . 

MSS  used  a  dynamical  functional  approach  to  reduce  the  dynamics  of  the  entire  system 
to  an  equation  for  an  effective  single  neuron  in  the  thermodynamical  limit: 

*H-i=F(/0,  (8) 

with 

(hthr)  =  o28tT  +  (F(Ar_i)F(A^i))  (9) 

Equation  (8)  with  eq.  (9)  yields  the  same  averaged  dynamical  properties  as  for  eq.  (6). 
One  now  defines  the  activity  of  the  network  as  Kt  =  (h*)  which  for  small  K  reduces  to 

*,  =  a2  +£*#,_!  +0(K2_1).  (10) 


In  the  absence  of  noise  there  is  a  trivial  fixed  point  K*  =  0  for  eq.  (10)  and  is  stable 
for  g  <  1.  In  the  presence  of  noise  it  has  only  one  stable  fixed  point  in  the  range 
[0,1  +  a2]. 

To  study  the  chaotic  behaviour  of  the  network  a  replica  of  the  system  with 
infinitesimally  separated  initial  conditions  and  with  the  same  noise  rft(i]  =rf(f)  is 
constructed.  The  Lyapunov  exponent  for  the  system  is  defined  as 

•       i  If   I       _y2     N2v 

X=  lim  lim  —  log2Ur+T     ^'.  (11) 

™>x}^2r     g2    ((X]-Xtf}  (i  } 

Assuming  equilibrium  (((h}}2)  =  {(/J?)2}),  they  [40]  have  obtained  for  the  noiseless  case 
A  <  0  for  g  <  I  and  the  system  goes  to  the  trivial  fixed  point  K*  =  0.  For  g  >  1,  A  >  0 
and  the  system  shows  chaos. 
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Figure  1.  Plot  of  phase  diagram  (gc  versus  cr)  obtained  by  Molgedey  et  al  [40]  for  an 
infinite  range  neural  network  with  external  white  noise.  For  a  given  gain  g  chaos  is 
suppressed  for  sufficiently  strong  noise  a. 

They  have  numerically  studied  the  effect  of  noise  on  the  chaotic  properties  of  the 
system  using  the  following  form  for  F(h}: 


F(h}  = 


for/K-1/g, 
for-l/g</i 
for  +  l/g  <  h. 


For  g  less  than  some  critical  value  gc,  the  activity  K*  increases  with  noise  as  the  system 
settles  onto  the  stable  fixed  point  K*  =  1  +  a2,  whereas  for  g  >  gc  chaos  sets  in.  It  was 
found  that  for  small  noise  asymptotically  gc  =  1  -  o2  In  o2  and  for  large  noise 
gc  =  \/n/2  a.  Figure  1  is  the  plot  of  gc  versus  a  (the  phase  diagram),  which  clearly 
shows  the  chaotic  and  regular  regimes. 

MSS  concluded  that  for  higher  dimensional  systems  of  this  type  where  chaos  occurs 
essentially  due  to  the  randomized  interactions  amongst  constituent  nonchaotic  elements, 
noise  acts  to  impair  information  flow  between  these  elements  and  thereby  suppresses 
chaos. 

3.2  Spatiotemporal  stochastic  resonance  in  excitable  media 

Let  us  now  consider  the  observation  of  the  stochastic  resonance  phenomenon  in  a 
spatially  extended  pattern  forming  system,  as  reported  by  Jung  and  Mayer-Kress  (JMK) 
[41].  Stochastic  resonance,  as  the  name  suggests,  is  a  phenomenon  in  which  at  a  given 
noise-strength  certain  temporal  patterns  of  the  system  get  enhanced  drastically  owing  to 
an  increase  in  the  system's  sensitivity  [19,  20,  25].  This  behaviour  has  a  characteristic 
bell-shaped  curve  when  the  output  power  (of  the  Fourier  spectrum)  at  the  corresponding 
frequency  is  plotted  against  noise-strength,  with  the  peak  at  the  'resonating'  noise  value. 
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two-dimensional  equidistant  square  array  (with  lattice  constant  a)  of  N  x  N  noisy 
threshold  devices  d(i,  j),i,j=  1 , . . . ,  A7"  with  the  following  properties:  if  the  input  xt(ij) 
to  the  devices  is  below  some  threshold  b,  output  yt (i,  y)  =  0;  if  xt(i,  j}  >  b  the  device 
generates  a  spiky  output  of  the  form  y/(i,y)  =  /o6(x(i,  j))  8(xt(i,  j}  —  b}  and  then  goes 
into  temporary  hibernation  for  a  refractory  period  A?r.  The  dynamics  can  be  described  by 
a  linear  Langevin  equation  for  inputs  x,(i,  j}  as 

where  7  is  a  leakage  constant  which  accounts  for  the  thermal  leakage  due  to  interaction 
with  the  surroundings,  77,  (f,  j)  is  a  delta-correlated  noise  and  a  =  (xt(ij}2}.  Equation  (12) 
on  integration  over  At  gives  rise  to  the  discrete  time  dynamics 


i,  J)  =  xt(i, 


(13) 


where  Gt(i,j)  is  a  Gaussian  random  number  with  variance  <JG  =  cr[l  —  e  2TAf]. 

Threshold  devices  are  pulse-coupled,  i.e.,  when  an  element  d(k,l)  fires,  it 
communicates  with  its  surrounding  elements  d(i,j)  at  a  distance  r,yt^  by  contributing 
an  amount  K  exp(-/3r?.  w/a2)  to  their  inputs  xt(i,  j),  where  K  —>•  K/b  is  in  dimensionless 
unit  and  j3  is  a  dimensionless  quantity  describing  the  spatial  coupling  range.  For  large 
coupling  (K  >  exp  (/?)),  these  firing  elements  give  rise  to  excitatory  waves  spreading 
through  the  array,  such  as  spiral  waves,  target  waves  (single,  nonrepetitive  wavefronts), 
etc.  The  selection  of  the  wave  form  depends  on  the  geometry  of  initial  conditions.  There 
is  a  threshold  Kc  such  that  for  K  <  Kc  excitatory  waves  do  not  necessarily  start. 

However,  in  the  presence  of  noise  the  spreading  of  excitatory  waves,  such  as  spiral 
waves,  is  observed  in  subthreshold  regime  (K  <  Kc\  Any  such  noise-sustained  wave 
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Figure  2.  Plot  of  the  graph  of  the  time  averaged  number  of  excess  events  IJL  with 
noise  variance  cr,  for  a  typical  lattice  size  200  x  200  and  with  a  given  set  of  parameter 
values  [41]. 
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disappears  soon  after  the  noise  is  turned  off.  With  increasing  noise  the  spiral  wave 
evolves  with  a  larger  curvature,  i.e.,  with  an  enhanced  coherence  in  the  array.  If  the  noise 
is  increased  further,  the  spiral  breaks  up  and  there  is  a  loss  of  coherence. 

In  order  to  show  that  the  phenomenon  observed  above  is  essentially  spatiotemporal 
stochastic  resonance,  JMK  have  taken  a  solitary  initial  wave  and  defined  a  quantity  fj,,  the 
time  averaged  number  of  excess  events,  as  the  difference  between  the  number  of  firing 
elements  along  the  driving  wavefront  and  the  average  number  of  firing  events  along  a  row 
of  the  array,  not  affected  by  the  driving.  The  plot  of  /u  versus  a,  the  variance  of  the  noise, 
is  shown  in  figure  2.  The  plot  exhibits  the  characteristic  bell-shaped  curve  mentioned 
earlier.  Synchronization  of  the  spatiotemporal  firing  pattern  to  an  external  driving, 
thereby  giving  rise  to  the  resonating  peak,  is  a  simple  generalization  of  the  stochastic 
resonance  to  this  extended  system. 

We  shall  now  move  on  to  the  final  part  of  the  paper,  namely,  the  effect  of  noise  on 
coupled  map  lattices  (CML).  As  the  model  itself  has  a  plethora  of  interesting  dynamical 
features,  we  begin  with  an  introduction  to  CML. 


4.  Coupled  map  lattices 

Coupled  map  lattices  (CML)  are  spatially  extended  dynamical  systems  with  discrete 
space  (lattice),  discrete  time  and  continuous  state  evolution.  General  evolutionary 
dynamics  of  a  CML  may  be  expressed  as 

/CfoC/),  *,(/)),  (14) 

where  F  and  G  are  nonlinear  maps  and  the  state  variables  xf  vary  continuously  in  the 
phase  space  of  the  map,  and  i  is  the  space  index.  CMLs  are  in  general  constructed  with 
the  symmetry  properties  of  spatial  translational  and  rotational  invariance,  and  so  the 
coupling  term  ey  is  taken  to  be  a  uniform  scalar  ranging  from  the  nearest-neighbour 
coupling  to  global  coupling. 

CMLs  have  been  used  to  model  many  physical  phenomena.  These  include  pattern 
formation,  chemical  waves,  excitable  media,  nucleation,  crystal  growth,  charge  density 
waves,  population  dynamics,  fluid  dynamics,  etc.  [42-55].  Chemical  systems  exhibit  very 
rich  spatiotemporal  structures  and  patterns.  The  standard  description  of  these  systems  is 
in  terms  of  the  reaction-diffusion  equations  that  incorporate  the  combined  effects  of  local 
nonlinear  reaction  dynamics  and  global  diffusion  of  chemical  species  due  to  the 
concentration  gradients  in  the  system.  Many  of  the  averaged  features  and  even  some 
detailed  ones  of  chemical  patterns  can  be  captured  by  a  simple  CML  description  of  the 
dynamics.  Another  area  of  interest  is  the  phase  ordering  dynamics.  Phase  separation 
occurs  due  to  competition  between  different  states  in  the  system,  such  as  domain 
growth  in  ferromagnetic  and  chemical  systems.  Some  of  the  salient  features  of  phase 
ordering  dynamics  are  also  observed  in  the  CML  model.  CML  may  also  be  applied  in 
pattern  dynamics  with  an  excitable  state  and  a  relaxation  from  that  state.  Such  processes 
are  observed  in  reaction-diffusion  in  excitable  media  and  also  in  biological  problems 
such  as  heart  rhythm  and  electrical  activities  in  neural  tissues.  In  fluid  dynamics, 
simulations  using  CML  can  show  formation  of  convective  patterns,  vortices,  sinks, 


behaviour.  One  of  the  major  interests  in  studying  CML  is  in  the  context  of  understanding 
spatial  and  temporal  structure  formation,  specially  in  fluid  dynamics.  These  structures 
appear  in  spite  of  the  fact  that  the  underlying  evolution  of  the  system  is  spatiotemporally 
chaotic. 

One  of  the  most  extensively  studied  types  of  one  dimensional  CML  is  that  with 
nearest-neighbour  coupling  and  having  the  following  form: 


i)  =  F(*f  (i))  +     [G(*,(f  -  1))  +  G(*(i  +  1))  -  2G(*(i))J.  (15) 

We  shall  concentrate  on  the  form  G(x)  =  F(x),  which  reduces  eq.  (15)  to 

xt+l(i]  =  (l-  e}F(xt(i))  +  ~  [F(xt(i  -  1))  +  F(xt(i  +  1))].  (16) 

This  particular  model  is  known  as  the  future  diffusive  CML  since  the  diffusively  coupled 
entities  are  one  timestep  evolved  values.  Studies  of  this  model  alongwith  the  local 
dynamics  as  the  logistic  map  F(jc)  =  fjac(\  -  x]  have  revealed  that  they  can  exhibit  a  wide 
range  of  spatiotemporal  complexity.  It  has  been  observed  that  the  temporal  period  doub- 
ling behaviour  of  the  map  can  induce  spatial  domain  structures  separated  by  kinks  and 
antikinks.  A  pattern  selection  regime  is  observed  where  patterns  of  certain  characteristic 
lengths  are  selected.  Phenomena  such  as  spatiotemporal  quasiperiodicity,  soliton 
turbulence,  spatiotemporal  intermittency,  wavelength  doubling  bifurcations,  synchroniza- 
tion etc.  for  this  and  other  maps  (like  the  circle  map  0,+1  =  Bt  +  ft  -  (K/2n)  sin(27T0,), 
fi-angular  frequency,  £-nonlinearity  parameter)  have  been  identified  [56-64]. 

We  now  discuss  the  effect  of  noise  on  the  evolution  of  a  CML.  In  the  next  subsection 
we  report  a  novel  phenomenon  that  we  have  observed  in  this  context  [65]. 

4.1  Noise  and  'stochastic  coherence'  in  a  CML 

We  take  the  system  of  eq.  (16)  with  an  additional  noise  term  as  follows: 

*,+i(0  =  (1  -e)Ffe(O)  +~[F(xt(i-  1))  +  F(xt(i  +  1))]  +  77,,  (17) 

where  r?f  is  the  familiar  delta-correlated  noise.  The  logistic  function  F(x)  =  px(l  ~  x)  is 
used  as  local  dynamics.  For  noise  rjt  we  have  used  a  uniformly  distributed  random 
number  bounded  between  -W  and  +W,  with  the  constraint  that  0  <  xt+i  (i)  <  1;  we  call 
W  the  noise-strength  parameter.  Values  of  y,,  e,  and  L  (size  of  the  lattice)  are  chosen  so 
that  the  resultant  dynamics  of  the  system  is  chaotic. 

Now,  we  define  a  structure  as  a  region  of  space  such  that  the  difference  in  the  values  of 
the  variables  of  neighbouring  sites  within  this  region  is  less  than  a  predefined  small 
positive  number,  say  8,  i.e.,  \xt(i)  -  xt(i  ±  l}\  <  6.  We  call  8  the  structure  parameter.  We 
look  for  such  patterns,  or  structures,  to  appear  in  the  course  of  evolution  of  the  model 
given  by  eq.  (17). 

Figure  3  shows  a  plot  (on  a  log-log  scale)  of  the  distribution  of  the  number  n(l]  vs. 
length  /  of  the  structures  for  different  values  of  W,  with  fj,  =  4.0,  e  =  0.6,  8  =  0.0001  and 
L  =  1000,  and  open-boundary  conditions.  The  power-law  nature  of  the  decay  of  n(/)  is 

Pramana  -  J.  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
280  Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


Figure  3.  Plot  of  variation  of  number  n(l)  of  structures  with  length  /  for  a  lattice 
with  size  L  =  1000,  for  different  values  of  noise-strength  W  as  indicated.  Parameters 
chosen  are  e  =  0.6,  6  =  0.0001,  and  /^  =  4.0.  Open-boundary  conditions  are  used. 
Data  are  obtained  for  50000  iterates,  each  for  10  initial  conditions. 


clearly  evident,  which  has  a  form 
n(t)  oc  ra, 


(18) 


where  a  is  the  power-law  exponent.  This  indicates  that  the  system  does  not  have  any 
intrinsic  length  scale.  It  may  be  noted  that  in  the  absence  of  noise  (W  =  0.0)  the  decay  is 
manifestly  exponential  [66].  The  exponent  a  is  seen  to  depend  on  the  noise-strength  W, 
with  a  minimum  for  W  around  0.6.  We  define  the  average  length  7  of  a  structure  as 
1—  Z)*n(0/]Cn(0-  I*1  figure  4  we  plot  the  variation  of  T  with  W  for  values  of 
parameters  as  in  figure  3.  The  plot  exhibits  a  bell-shaped  nature  within  a  fairly  narrow 
range  of  W  around  the  value  0.6  (one  may  note  the  surprising  similarity  between  figures  4 
and  2,  though  they  refer  to  completely  different  phenomena).  It  may  be  noted  that  the 
minimum  of  a  also  occurs  for  W  quite  close  to  this  value,  as  expected. 

We  call  the  phenomenon  observed  above  stochastic  coherence.  This  is  similar  to 
stochastic  resonance  which  shows  a  bell-shaped  behaviour  of  the  temporal  response  as  a 
function  of  the  noise-strength,  as  mentioned  before.  However,  one  may  note  that  our 
system  does  not  have  any  intrinsic  length-scale,  whereas  in  stochastic  resonance  the  noise 
resonates  with  a  given  time-scale;  hence  our  use  of  the  word  coherence  rather  than 
resonance.  In  stochastic  resonance  noise  transfers  energy  to  the  system  at  a  characteristic 
frequency,  whereas  in  stochastic  coherence  noise  induces  coherence  in  the  system. 

To  study  the  evolutionary  aspects  of  these  structures  we  obtained  the  distribution  of  the 
number  w(r)  of  structures  vs.  their  lifetimes  r  for  different  W.  n(r)  is  observed  to 
decrease  with  r  with  a  stretched  exponential  type  of  decay  having  a  form 

n(r)  oc  exp(— (const. )r^),  (19) 
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Figure  4.    Plot  of  variation  of  average  length  /  of  structure  with  noise-strength  W, 
with  parameters  as  stated  in  figure  3. 


Figure  5.    Variation  of  average  lifetime  T  of  structures  with  noise-strength  W  shown 
for  parameters  as  stated  in  figure  3. 


where  £  depends  on  W.  We  define  the  average  lifetime  r  of  a  structure  as  T  =  ]T)  r  n(r}/ 
X)  n(r).  In  figure  5  we  plot  f  vs.  W  for  parameter  values  as  in  figure  3.  The  graph  shows 
a  bell-shaped  feature  with  a  maximum  for  W  around  0.6. 

In  order  to  ascertain  the  chaotic  nature  of  the  system  evolution  we  have  calculated  the 
Lyapunov  exponent  spectra  for  our  system.  We  find  a  number  of  Lyapunov  exponents  to 
be  positive,  implying  that  the  underlying  evolution  is  chaotic.  The  maximum  Lyapunov 
exponent  A  shows  a  minimum  around  noise-strength  0.6.  The  fact  that  we  have  observed 


IfMVT   /Yl~ 


other  extrema  for  similar  W  may  make  it  appear  that  the  origin  of  these  behaviours 
probably  lies  in  the  reduction  of  A  due  to  noise  [67]  (though  it  should  be  noted  that  the 
reduction  is  not  monotonic).  To  explore  this  possibility  we  have  studied  the  variation  of  A 
with  the  coupling  parameter  e.  We  find  that  A  remains  fairly  constant  for  0.2  <  e  <  0.8 
for  the  entire  range  of  W.  On  the  other  hand,  a  plot  of  the  variation  of  the  average  length  7 
with  e  for  fixed  W  shows  a  monotonically  increasing  behaviour,  quite  contrary  to  what  is 
expected  from  A.  This  implies  that  the  Lyapunov  exponent  alone  cannot  be  used  for 
proper  characterization  of  the  spatio-temporal  features  of  the  system  (unlike  the 
synchronization  phenomenon  discussed  earlier,  where  A  is  sufficient  to  fully  characterize 
the  behaviour). 

To  conclude  this  section,  we  have  reported  a  new  phenomenon  observed  in  a 
chaotically  evolving  one-dimensional  CML  driven  by  an  identical  noise,  which  we 
termed  stochastic  coherence.  It  is  observed  that  there  is  a  phenomenal  increase  in  the 
abundance  of  coherent  structures  of  all  scales  due  to  noise.  The  distribution  of  these 
structures  shows  a  power-law  decay  with  the  length  of  the  structure.  The  average  length 
as  well  as  the  average  lifetime  of  these  structures  exhibit  characteristic  maxima  at  certain 
noise-strength  values. 

5.  Summary  and  conclusion 

We  have  reviewed  the  work  that  has  been  carried  out  on  the  effect  of  noise  on 
evolutionary  dynamics  of  chaotic  systems.  In  the  case  of  uncoupled  systems,  we  have 
encountered  a  synchronization  phenomenon  for  different  phase  space  trajectories  driven 
by  identical  noise  above  a  certain  strength.  We  have  also  seen  nonstatistical  behaviour  for 
an  ensemble  of  chaotic  trajectories  driven  by  an  identical  sequence  of  parametric  noise. 
For  coupled  systems,  noise  above  a  given  strength  has  been  observed  to  suppress  chaos  in 
a  random  neural  network  with  infinite  range  interactions.  Spatiotemporal  stochastic 
resonance,  the  first  case  of  a  stochastic  resonance  phenomenon  observed  in  systems  with 
spatial  extension,  has  been  reported  for  a  two-dimensional  square  array  of  firing  devices. 
In  the  context  of  CML,  we  have  observed  an  interesting  phenomenon,  stochastic 
coherence,  when  the  entire  lattice  is  driven  by  identical  noise. 

As  noted  earlier  in  the  text,  understanding  the  formation  and  evolution  of  patterns  and 
structures  in  spatially  extended  systems  is  still  incomplete.  In  real  physical  situations, 
noise  almost  ubiquitously  influences  the  system  behaviour.  So  while  studying  the 
ordering  phenomena,  incorporating  noise  as  a  part  of  the  evolving  'supersystem'  and 
studying  the  mutual  interaction  of  the  two  subsystems  offers  a  more  practical  way  of 
looking  into  the  problem.  The  inherent  nonlinearity  in  the  system,  which  on  the  one  hand 
gives  rise  to  chaos  and  on  the  other  hand  ordered  behaviour,  is  surely  again  playing  its 
role  in  enhancing  order  under  the  influence  of  noise;  how  so  is  still  not  quite  clear.  We 
have  encountered  two  kinds  of  noise-induced  ordering  phenomena:  global  ordering  (such 
as  synchronization)  and  local  ordering  (spatiotemporal  stochastic  resonance  and 
stochastic  coherence).  Stochastic  coherence,  in  particular,  may  turn  out  to  be  a  promising 
way  towards  achieving  any  degree  of  understanding  of  the  structure  formation  in 
extended  systems,  although  a  lot  of  work  remains  to  be  done.  While  it  is  definitely  not 
claimed  that  all  questions  will  be  answered  with  this  approach,  we  certainly  hope  to 
bridge  a  few  gaps  in  the  existing  knowledge. 
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Abstract.  We  describe  the  rich  spectrum  of  spatio-temporal  phenomena  emerging  from  a  class  of 
models  incorporating  adaptive  dynamics  on  a  lattice  of  nonlinear  (typically  chaotic)  elements.  The 
investigation  is  based  on  extensive  numerical  simulations  which  reveal  many  novel  dynamical 
phases,  ranging  from  spatio-temporal  fixed  points  and  cycles  of  all  orders,  to  parameter  regimes 
displaying  marked  scaling  properties  (as  manifest  in  distinct  1  //  spectral  characteristics  and  power 
law  distributions  of  spatial  quantities). 

Keywords.     Spatio-temporal  chaos;  lattice  dynamical  models. 
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1.  Introduction 

A  basic  aim  of  modelling  in  physics  is  to  provide  suggestive  conceptual  frameworks  for 
understanding  complex  phenomena,  which  are  generic  in  physical  systems.  One  seeks  to 
construct  simple  yet  effective  models  which  are  capable  of  capturing  the  essence  of 
complicated  dynamic  processes  as  they  occur  in  nature.  For  instance,  one  dimensional 
unimodal  maps  can  serve  as  a  versatile  paradigm  for  low  dimensional  dissipative  chaos, 
as  they  yield  an  extraordinarily  rich  spectrum  of  behaviours.  From  low  dimensional 
nonlinear  systems,  which  are  now  reasonably  well  understood,  we  reach  out  towards 
nonlinear  extended  systems.  The  hope  is  to  build  prototypes  for  such  composite  complex 
systems  —  prototypes  which  can  yield  a  repertoire  of  dynamical  behaviour  reminiscent  of 
well  known  behaviour  in  a  wide  range  of  fields  where  spatial  extent  is  important,  for 
instance  turbulence,  pattern  formation  (such  as  dislocation  dynamics,  intermittency  in 
space-time,  spatial  patterns),  Josephson  junction  arrays,  neural  dynamics,  coupled 
systems  of  optically  bistable  devices,  electron-hole  plasmas  and  even  parallel  computing 
and  evolutionary  biology!  [1] 

So  complex  systems  research  has  emerged  as  a  field  of  vital  importance  in  the  last 
couple  of  decades,  especially  with  the  advent  of  high  powered  computing.  It  has  had 
impact  on  several  disciplines,  as  it  investigates  fundamental  dynamic  processes  which 
occur  in  areas  of  physics,  chemistry,  and  biology.  This  field  of  research  involves  the 
identification  of  dynamic  processes  and  concepts  which  can  be  employed  to  understand 
large  families  of  complex  systems.  It  searches  for  principles  of  synthesis  of  dynamic 
processes  at  one  level,  which  produce  important  dynamic  phenomena  at  another 
collective  level.  Thus,  in  general  terms,  this  field  of  research  seeks  to  describe  and 
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understand  the  origins  of  complex  dynamic  activities  and  to  uncover  the  methods  of 
modelling  and  possibly  controlling  various  types  of  hierarchical  dynamics. 

One  important  prototype  of  extended  complex  systems  is  a  nonlinear  dynamical  system 
with  spatially  distributed  degrees  of  freedom,  or  alternately  a  spatial  system  composed  of 
large  numbers  of  low  dimensional  nonlinear  systems.  There  are  two  principal  variations 
of  this  scheme:  first,  the  local  dynamics  can  be  simple  and  the  interactions  complicated  in 
form;  secondly,  we  can  have  complex  local  dynamics  and  simple  mode  of  transmitting 
information  between  'sites'  (or  'degrees  of  freedom').  Here  we  will  be  concentrating  on 
the  latter  situation  where  locally  there  is  strong  nonlinearity.  The  basic  ingredients  of 
such  systems  are:  (i)  creation  of  local  chaos  or  local  instability  by  a  low  dimensional 
mechanism  and  (ii)  spatial  transmission  of  energy  and  information  by  diffusion  or 
transport  (this  interconnectivity  can  work  as  a  global  organizing  principle  in  composite 
systems). 

In  principle  spatio-temporal  physics  is  completely  described  by  the  appropriate  partial 
differential  equations  (PDE).  Unfortunately,  though,  such  equations  are  extremely 
difficult  to  solve,  and  practically  intractable  without  simplifying  assumptions  and  without 
employing  distributed  computing  techniques.  In  order  to  follow  complex  systems 
dynamically  over  long  times  and  over  large  spatial  extents  without  resorting  to  heavy  duty 
computing,  physicists  have  used  their  imagination  and  evolved  a  hierarchy  of  models 
based  on  different  discretization  schemes.  These  computationally  efficient  schemes 
capture  some  of  the  essential  physics  of  a  very  extensive  range  of  natural  phenomena.  Two 
of  the  most  succesful  schemes  are  cellular  automata  (CA)  models  where  space,  time  and 
state  variables  are  all  discrete,  and  lattice  dynamical  systems  (LDS)  where  space  and  time 
are  discrete  and  state  variables  take  continuous  values.  These  models  have  led  to  the 
discovery  of  so  many  novel  phenomena  that  they  are  now  not  merely  the  poor  man's 
PDE,  but  bonafide  models  of  complexity  in  space-time  dynamics  in  their  own  right. 

The  aim  of  all  these  models  is  to  capture  in  increasing  degree  the  complexity  inherent 
in  nature  and  provide  fitting  descriptions  of  large  interactive  systems  comprising  of  many 
elements.  In  particular,  they  address  two  widely  occuring  phenomena:  spatial  fractality, 
i.e.,  the  spontaneous  evolution  of  self  similarity  in  space,  and  low  frequency  noise  or  I// 
noise  which  represents  the  lack  of  a  natural  scale  in  time,  therefore  a  kind  of  "fractality" 
in  time.  It  is  observed  that  many  functional  dependences,  arising  in  a  variety  of 
experimental  contexts,  look  like  straight  lines  when  plotted  on  a  log-log  scale,  over  a 
significantly  large  range.  So  these  functions  do  not  have  natural  length/time  scales,  that 
is,  they  are  scale  invariant  or  self  similar.  This  is  exemplified  in  space  by  spatial  fractality, 
which  is  ubiquitous  in  nature,  present  as  it  is  in  spatial  structures  as  diverse  as  mountain 
ranges,  coastlines,  clouds,  colloidal  aggregates,  patterns  of  fracture,  dielectric  breakdown, 
porosity  of  soil  and  branching  of  roots  [2].  In  the  temporal  regime,  we  have  I//  noise: 
S(f)  ~  I//,  where  /  is  the  frequency  and  S(f)  is  the  power.  (This  is  also  known  as 
"flicker  noise"  in  astronomy,  in  the  context  of  light  from  quasars  for  instance  [3]).  Such 
"noise"  (or  from  some  viewpoint  "signal")  is  characterized  by  variations  on  all' 
timescales,  and  appears  in  events  as  varied  as  resistance  fluctuations,  sand  flow  in  an 
hourglass  and  even  traffic  and  stock  market  movements  [3].  The  generality  and  ubiquity 
of  these  two  phenomena  have  puzzled  scientists  for  a  long  time.  The  study  of  fractality, 
though,  is  mostly  confined  to  the  level  of  characterization  (for  example  plotting  results  of 
experiments  on  log-log  scales  and  associating  the  slope  with  some  kind  of  fractal 
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dimension).  What  theorists  would  really  like  to  do  of  course,  is  to  find  the  dynamical 
origin  of  these  phenomena.  So,  the  success  of  many  of  the  models  suggested  should  be 
judged  with  respect  to  their  ability  to  dynamically  yield  space-time  fractality. 

In  this  article  we  will  focus  on  a  class  of  complex  networks,  proposed  very  recently  by 
Sinha  and  Biswas  [4].  These  models  incorporate  adaptive  dynamics  on  a  lattice  of 
nonlinear  (typically  chaotic)  elements,  and  are  relevant  as  prototypes  for  complex 
nonlinear  self  regulatory  processes  in  physics  and  biology.  We  will  describe  the  wealth  of 
spatiotemporal  structures  these  models  yield,  and  characterize  their  "phases"  and  scaling 
patterns. 

2.  Model 

Here  we  describe  the  essentials  of  the  proposed  class  of  models,  which  falls  under  the 
general  category  of  lattice  dynamical  systems.  First,  we  focus  on  the  one  dimensional 
unidirectional  model  where  time  is  discrete,  labelled  by  n,  space  is  discrete,  labelled  by  /', 
i  =  I,//,  where  N  is  system  size,  and  the  state  variable  xn(i)  (which  in  physical  systems 
could  be  quantities  like  energy,  velocity,  pressure  or  concentration)  is  continuous.  Each 
individual  site  in  the  lattice  evolves  under  a  suitable  nonlinear  map/(z).  For  instance,  the 
local  map/(;t)  can  be  chosen  to  be  the  logistic  map,  which  has  widespread  relevance  as  a 
prototype  of  low-dimensional  chaos,  i.e. 

/(jc)  =  1  —  ax2, 

x  G  [—1.0,  1.0],  with  the  nonlinearity  parameter  a  chosen  in  the  chaotic  regime,  for 
instance  a  =  2.0  in  all  numerical  experiments  in  [4].  Another  possible  choice  in  the  local 
dynamics  is  the  circle  map  which  is  relevant  in  systems  involving  nonlinear  oscillatory 
behaviour  [6,  7].  This  is  given  by 

K 

f(x]  =  x  —  — -sin(27r;c) 
2?r 

with  K  determining  the  strength  of  the  nonlinearity. 

Now,  on  this  nonlinear  lattice  a  self  regulatory  threshold  dynamics  is  incorporated.  The 
adaptive  mechanism  is  triggered  when  a  site  in  the  lattice  exceeds  the  critical  value  xc  i.e., 
when  a  certain  site  xn(i]  >  xc.  The  supercritical  site  then  relaxes  (or  "topples")  by 
transporting  its  excess  8x  =  (xn(i)  ~xc)  to  its  neighbour  as  follows: 

xn(i)  -+xc 
xn(i+l)-^xn(i+l}  +  8x.  (1) 

This  algorithm  thus  induces  a  unidirectional  nonlinear  transport  down  the  array  (by 
initiating  a  domino  effect).  The  boundary  is  open  so  that  the  "excess"  may  be  transported 
out  of  the  system.  Note  that  the  adaptive  ("toppling")  mechanism  in  the  model  is  locally 
conservative,  whereas  the  intrinsic  dynamics  of  the  elements  is  dissipative.  This  kind  of 
threshold  mechanism  imposed  on  local  chaos,  makes  the  above  scenario  especially 
relevant  for  certain  mechanical  systems  like  chains  of  nonlinear  springs,  as  also  for  some 
biological  systems,  such  as  synapses  of  nerve  tissue  (note  that  individual  neurons  display 
complex  chaotic  behaviour  and  have  step  function-like  reponses  to  stimuli). 


algorithm  [8],  or  the  "sandpile"  model,  which  gives  rise  to  self  organized  criticality 
(SOC).  This  model  is  however  significantly  different,  the  most  important  difference  being 
that  the  self  regulatory  mechanism  now  occurs  on  a  nonlinear  "substrate" ,  i.e.  there  is  an 
"intrinsic"  or  "internal"  deterministic  dynamics  at  each  site.  Further,  the  state  variable 
analogous  to  the  integer  "height"  variable  z  in  the  sandpile  model  is  continuous  here.  All 
this  accounts  for  enhanced  complexity  and  this  system  is  thus  capable  of  exhibiting  a 
wider  repertoire  of  dynamics.  So,  unlike  1D-SOC,  the  one  dimensional  model  here  is  not 
trivial  and  can  give  rise  to  many  interesting  features,  including  a  host  of  dynamical 
phases. 

The  dynamics  depends  on  the  algorithm  for  autonomously  updating  each  site  and 
propagating  threshold  coupling  between  sites.  The  first  possible  case' is  one  where  these 
two  evolutionary  steps  are  carried  out  separately.  Here  the  system  is  allowed  to  relax 
completely  before  each  chaotic  update.  One  self-regulatory  update  of  all  existing 
supercritical  sites  in  the  lattice,  performed  according  to  (1)  constitutes  one  relaxation  time 
step.  This  relaxation  step  may  in  turn  create  some  other  supercritical  sites.  After  A  such 
relaxation  steps,  the  system  undergoes  the  next  chaotic  update.  In  some  sense  then,  time  n 
associated  with  the  chaotic  dynamics  is  measured  in  units  of  A.  When  A  is  large  enough 
(A><9(AQ)  as  in  [4],  the  adpative  dynamics  begins  after  each  step  in  the  site  dynamics 
and  continues  till  the  system  has  reached  a  steady  state  where  all  sites  are  less  than 
critical,  i.e.  all  x(i)  <  xc,  and  the  system  is  stationary,  after  which  the  next  step  in  the  site 
dynamics  takes  place.  So  the  time  scales  of  the  two  dynamics,  the  intrinsic  chaotic 
dynamics  of  each  lattice  site  and  the  adaptive  relaxation,  are  adiabatically  separable.  The 
relaxation  mechanism  is  much  faster  than  the  chaotic  evolution,  and  this  enables  the 
system  to  relax  completely  before  the  next  chaotic  iteration.  The  complete  dynamical 
picture  then  is  as  follows:  after  every  time  step  in  the  chaotic  evolution,  n,  the  system 
undergoes  a  self  regulatory  relaxation  leading  to  stable  "under-critical"  configurations 
{xn(i}},  after  which  the  next  chaotic  iteration  of  the  lattice  takes  place,  governed  by  the 
nonlinear  evolution  mapping:  xa+i  (r)  =f(xn(i)).  One  can  also  consider  the  local  chaos  in 
the  maps  to  be  an  "intrinsic"  or  "internal"  perturbation  (as  opposed  to  "external" 
perturbation  in  the  "sandpile"  model,  which  is  effected  by  dropping  "sand"  from 
outside).  In  this  sense  then,  the  system  above  (as  its  allowed  to  relax  fully  between 
chaotic  updates)  operates  in  the  "dilute"  perturbation  limit. 

A  random  driving  force  can  also  be  introduced  in  the  model.  Under  this,  the  system  is 
perturbed  at  some  site;  in  the  lattice:  xn(j]  -» xn(j)  +  a  where  a  is  the  strength  of  the 
perturbation  and  j  is  chosen  at  random.  The  spatial  extent  of  the  perturbation  can  be 
gauged  by  the  number  of  such  random  sites  affected  at  any  given  update  (which  we 
denote  by  ATr).  The  random  driving  force  is  operative  at  time  scales  comparable  to  the 
chaotic  dynamics.  When  this  is  much  slower  than  the  adaptive  dynamics,  as  in  [4],  the 
scenario  is  similar  to  the  SOC  algorithm,  where  the  driving  force  (perturbation)  is  very 
dilute,  and  the  system  is  allowed  to  relax  completely  before  the  next  pertubation  and  time 
is  usually  measured  in  units  of  the  perturbing  force  (for  example,  in  units  of  grains  of 
"sand"  added  in  the  sandpile  model  of  SOC)  and  the  configurations  studied  are  the 
relaxed  configurations  after  each  perturbation  step.  This  is  quite  different  from  the 
coupled  map  lattice  (CML)  dynamics  [9],  where  the  coupling  is  incorporated  in  the  map 
evolution  step. 
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At  the  other  end  of  the  spectrum  is  the  limit  of  very  small  A — which  is  practically 
analogous  to  CML.  Here  the  local  nonlinear  dynamics  and  chaotic  updates  take  place 
almost  simultaneously.  It  is  evident  that  lowering  A  essentially  allows  us  to  move  from  a 
picture  of  adiabatically  separated  adaptive  processes  to  one  where  adaptive  processes 
overlap,  and  disturbances  do  not  die  before  the  subsequent  chaotic  update,  i.e.  the  tails  of 
the  relaxation  processes  carry  over  to  the  next  event  and  there  are  "interference"  effects. 
The  effect  of  relaxation  time  on  dynamical  characteristics  was  investigated  in  detail  in  [7]. 

The  relevant  parameters  in  the  model  are  the  local  nonlinearity  parameter,  (for 
instance,  a  in  the  case  of  logistic  maps,  and  K,  in  the  case  of  circle  maps),  strength  of 
perturbation,  a  and  the  relaxation  time  A.  The  simulations  are  done  with  random  initial 
conditions  for  the  x(i)  and  all  transients  are  allowed  to  die.  Two  of  the  most  interesting 
features  to  be  examined  are  (i)  the  temporal  evolution  of  local  quantities,  such  as  the 
individual  sites  xn(i],  and  (ii)  the  temporal  evolution  of  globally  defined  quantities,  such 
as  "avalanches",  which  are  defined  as  the  total  number  of  "active"  sites,  i.e.  sites  that 
have  "toppled"  during  the  adaptive  relaxation,  denoted  by  s.  The  spatial  aspects  of 
interest  are  the  distribution  of  x(i)  and  the  presence  of  clustering  and  coherence  in  space 
as  indicated  by  the  cluster  distribution  at  any  point  in  time  (or  snapshot).  Besides  these 
generic  spatio-temporal  measures,  some  other  quantities  may  be  of  interest  in  certain 
contexts.  For  instance,  the  dynamics  and  distribution  of  the  "drop  rate",  i.e.  the  "excess" 
transported  out  of  the  lattice  edge  during  relaxation,  may  be  significant  in  some  physical 
situations  (such  as  in  phenomena  reminiscent  of  the  "dripping  faucet"  scenario). 

Note  that  xc  is  the  most  significant  parameter  in  the  system.  For  instance,  for  the  case 
of  the  local  logistic  map  (f(x}  =  l—  2.x2),  where  — 1.0  <jtc  <  1.0,  numerical 
simulations  show  the  presence  of  the  many  "phases"  in  xc  space  [4].  For  example,  for 
xc  <  0.5  the  dynamics  goes  to  a  fixed  point.  When  0.5  <  xc  <  1.0,  the  dynamics  is 
attracted  to  cycles  whose  periodicities  depend  on  the  value  of  xc.  By  tuning  xc  one  thus 
obtains  cycles  of  varying  orders.  So  in  a  sense,  this  model  (by  variation  of  a  single 
parameter)  provides  a  wide  repertoire  of  dynamics  for  lattice  dynamical  systems,  just  as 
the  logistic  map  does  for  one  dimensional  systems  (by  variation  of  the  nonlinearity 
parameter).  Thus  it  may  be  used  as  a  paradigm  for  complex  behaviour  in  extended 
systems. 

Note  another  important  feature  here.  If  the  same  adaptive  dynamics  was  imposed  on  a 
random  lattice,  that  is  a  bunch  of  noisy  elements,  we  would  not  have  recovered  any  of  the 
above  mentioned  phases.  For  these  phases  to  occur  we  crucially  require  a  deterministic 
dynamics,  even  if  entirely  chaotic.  The  difference  between  stochastic  noise  and 
deterministic  chaos  is  of  vital  importance  here. 

We  can  also  introduce  the  bidirectional  model.  This  is  given  by  the  following 
algorithm:  if  xn(i)  >  xc,  the  supercritical  site  then  relaxes  to  xc  by  transporting  the  excess 
(jcn(i)  —  xc)  equally  to  its  two  neighbours: 

xn(i)->xc 

Xn(i  +  1)  -»  Xn(i  +  1)  +  &C 

xn(i-l)->xn(i-l)+6x  (2) 

where  Sx  —  (xn(i)  —  xc)/2.  Now  disturbances  in  the  course  of  self-regulation,  can  spread 
like  "ripples"  and  "refract".  The  dynamics  of  the  bidirectional  model  looks,  by  and 
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large,  like  a  "fuzzier"  or  "noisier"  version  of  its  unidirectional  counterpart.  The  details 
will  be  discussed  in  the  following  sections. 

Before  we  move  on  to  a  specific  example  in  §  3,  we  will  write  down  the  generalized 
model  in  d  dimensions  with  m  state  variables.  Let  each  state  variable  defined  at  a  certain 
site  in  the  lattice  be  denoted  as  xj(i},  where  i  is  a  d  dimensional  vector  representing  the 
coordinates  of  the  site  in  the  d  dimensional  lattice,  and  jf  represents  a  particular  state 
variable,  taking  values  from  1  to  m.  If  a  certain  state  variable,  say  jc-'(i)  at  a  certain  site  i 
in  the  d  lattice  exceeds  the  critical  value,  x{,  then  the  excess  is  transported  equally  to  its 
neighbours.  (Note  that  in  general  we  may  have  different  critical  values  for  different  state 
variables,  and  these  are  given  by  m  numbers,  x{,  j  =  1,  2,  .  .  .  ,  m  ).  Also  note  that  in  d 
dimensions  the  number  of  neighbours  (assuming  isotropic  transport)  is  Id.  Given  these 
the  adaptive  dynamics  then  is 


6x,  (3) 

where  the  quantity  Sx  gives  the  excess,  which  is  equally  distributed  among  the  Id 
neighbours:  6x  =  (xj(i)  —  x{}/2d.  The  vector  6  is  d-dimensional,  comprising  of  only  one 
nonzero  element  (=  1  or  —  1)  and  it  gives  the  coordinates  of  the  nearest  neighbours  of  the 
critical  site  to  which  the  excess  is  transported. 

Some  other  variations  to  the  model  are  also  possible.  For  instance,  we  can  investigate 
the  effect  of  changing  boundary  conditions.  In  particular,  we  can  incorporate  the  adaptive 
dynamics  on  a  lattice  with  periodic  boundary  conditions,  that  is  on  a  ring  of  nonlinear 
elements.  Further,  for  instance,  one  can  implement  the  adaptive  dynamics  (or  transport) 
on  a  hierarchical  lattice,  in  order  to  evoke  scaling  phenomena  reminiscent  of  energy 
cascade  in  turbulent  media.  In  general  then,  several  variations  of  this  class  of  models  can 
be  invoked  in  order  to  tailor  the  generalities  to  suit  the  specific  physics  at  hand  and 
account  for  particular  phenomena. 

In  this  article  we  will  focus  on  one  representative  example,  namely,  adaptive  dynamics 
on  logistic  maps.  We  give  a  detailed  description  of  the  system  and  its  emergent 
phenomena  in  the  section  below. 


3.  Adaptive  dynamics  on  logistic  maps 

In  this  model  the  local  on-site  dynamics  is  given  by  logistic  maps.  The  specific  form  of 
the  logistic  map  used  (without  any  loss  of  generality)  is:  f(x)  =  1  -  ax2,  x  e  [-1,1], 
operating  in  the  chaotic  regime,  i.e.,  a  =  1.  We  deal  with  the  case  of  A  large,  which 
allows  complete  relaxation  of  the  active  sites  before  the  subsequent  chaotic  update.  First, 
we  describe  the  unidirectional  case,  with  and  without  external  noise,  and  then  describe 
the  bidirectional  model. 


3.1  Unidirectional  model:  Phenomenology 

As  noted  in  §2,  the  most  significant  parameter  in  the  system  is  the  critical  xc 
(*c  e  (-1, 1)),  and  by  tuning  xc  one  obtains  the  following  phases:  The  first  phase  is  the 


fixed  point  region  which  occurs  when  xc  <  0.5.  Here  the  system  goes  to  a  coherent  state 
where  all  sites  xn(i)  =xc  for  all  times  (after  a  short  transience).  This  spatiotemporal 
invariance  is  achieved  for  all  perturbation  strengths  and  system  sizes.  The  size  of  the 
avalanches  is  always  equal  to  N(N  +  l)/2  and  the  duration  of  the  avalanches  equal  to  N 
[4].  In  this  parameter  regime,  then,  the  adaptive  mechanism  suppresses  the  underlying 
chaos  in  the  lattice  and  yields  spatiotemporal  regularity. 

The  model  (in  the  range  xc  <  0.5)  may  then  be  considered  as  a  tool  for  "controlling" 
[10]  an  ensemble  of  chaotic  elements,  as  it  can  very  effectively  force  the  system  to  a 
temporally  invariant  and  spatially  coherent  state:  xn(i)  =  xc,  i  =  1, . . . , N  for  all  n.  Note 
that  this  spatiotemporal  control  is  robust  with  respect  to  system  size  and  perturbation 
strengths.  Couched  in  the  language  of  control  theory,  we  can  consider  xc  to  be  the 
"desired  state"  of  the  system  [10].  The  "error  signal"  is  then  determined  by  the 
difference  between  the  existing  state  and  the  desired  state.  A  positive  error  signal  triggers 
off  the  self  regulatory  feedback  mechanism  which  drives  the  system  to  adaptively  control 
its  dynamics  back  to  the  desired  state  (when  the  error  is  zero). 

When  xc  —  0.5  we  still  have  a  coherent  state  with  all  x(i)  =  xc.  When  there  is  no  noise 
the  avalanches  are  of  size  zero  as  there  are  never  any  active  sites  in  the  lattice,  as  x  =  0.5 
is  a  fixed  point  of  the  map  f(x).  Under  a  random  driving  force  (Nr  ^  0  and  a  ^  0)  the 
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Figure  la.     (Continued) 

Pramana  -  J.  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


293 


12 


10 


•s     8 


(b) 


0.99 


I 
0.992 


0.994 


,   _  I 
0.996 


I 
0.998 


Figure  1.  Order  p  of  the  cycle  obtained  at  various  values  of  the  parameter  xc,  in  a 
unidirectional  adaptive  model  on  a  lattice  of  logistic  maps  (cr  =  0),  depicted  for  the 
ranges:  (a)  xc  e  [0.5, 1.0]  (k  up  to  20  is  shown)  and  (b)  xc  €  [0.99, 0.9999]  (k  up  to  12 
is  shown). 

avalanches  are  triggered  off,  and  they  have  a  random  distribution  of  sizes,  as  the 
avalanche  size  is  simply  determined  by  the  distances  of  the  perturbed  sites  from  the  edge 
of  the  lattice,  and  from  each  other,  and  these  are  statistically  random  quantities. 

When  0.5  <  xc  <  1.0,  the  temporal  evolution  of  the  lattice  sites  (and  avalanches)  is 
attracted  to  a  cycle  whose  periodicity  (order)  p  depends  on  xc  [4] .  For  example,  by  tuning 
xc  one  obtains  the  following  dynamical  phases:  for  0.5  <  xc  <  0.809..,  we  get  2-cycles, 
for  0.809..  <  jcc  <  0.85  we  get  4-cycles,  for  xc  ~  0.86  we  get  6-cycles,  for  xc  ~  0.88  we 
get  7-cycles,  for  :cc  ~  0.9  we  get  10-cycles  and  for  xc  ~  0.98  we  get  4-cycles,  and  so 
forth.  When  there  is  no  external  perturbing  force  (a  =  0)  these  cycles  are  exact .  From 
figure  1  (which  gives  the  order  of  the  cycle,  p,  supported  at  various  values  of  xc~)  it  is 
clearly  evident  that  the  system  yields  a  rich  repertoire  of  cyclic  patterns.  Note  here  that  it 
is  possible  to  obtain  a  detailed  and  rigorous  analytical  picture  of  the  spatial  and  temporal 
characteristics  of  the  various  cycles:  see  [5].  This  analysis  [5]  provides  a  dynamical  basis 
for  the  various  phases  found  in  this  model.  Further,  the  analysis  also  gives  exact  scaling 
relations  for  the  position  and  width  of  periodic  windows  in  xc  space.  Due  to  constraints  of 
space,  however,  we  will  not  touch  upon  those  analytical  results  in  this  article,  but 
concentrate  instead  on  the  numerical  results  found  principally  in  [4]. 
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Temporally,  then,  we  obtain  cycles  of  varying  orders.  Now,  spatially,  we  would  like  to 
discern  the  degree  of  "coherence"  in  the  spatial  profile  at  any  instant  of  time  (or 
snapshot)  by  examining  the  spatial  patterns  in xn(i),i  =  1, 2, . ..  ,N.  For  instance,  we  can 
quantify  spatial  clustering  in  the  landscape  by  defining  the  size  c  of  a  spatial  cluster  as  the 
number  of  consecutive  sites  for  which  the  inequality  \xn(i  +  1)  —  xn(i)\  <  e  holds,  where 
e  is  a  small  number.  (The  results  are  not  very  sensitive  on  the  choice  of  e).  Very  large 
cluster  sizes,  i.e.  c  ~  O(N),  indicate  a  smooth,  almost  coherent,  profile,  whereas  profiles 
with  maximum  cluster  size  cmax  ~  10  reflects  a  rough  profile. 

It  is  observed  that  for  values  of  xc  supporting  low  order  periodicities,  there  often  exists 
one  very  large  cluster.  In  addition  to  this  large  (isolated)  cluster,  there  exist  many  smaller 
clusters  which  are  distributed  approximately  as  a  power  law.  For  instance,  for  the  case  of 
xc  =  0.98  which  supports  a  4-cycle,  we  usually  find  one  large  cluster,  cmax  ~  Af/2,  and  a 
host  of  smaller  clusters  (see  figure  2).  The  coarse  grained  distribution  of  the  small  cluster 
sizes,  P(c},  appears  to  be  a  power  law:  P(c)  ~  c~^c,  with  exponent  <£c  depending  on  the 
specifics  of  the  model  [11].  On  addition  of  noise  the  spatial  profile  gets  more  uneven,  as 
the  clusters  break  into  smaller  clusters.  For  instance,  for  xc  =  0.98,  a  =  0.2,  cmax  ~  N/4, 
that  is  the  largest  cluster  is  around  half  as  large  as  in  the  a  =  0  case. 

For  larger  xc  supporting  cycles  of  high  order  (for  example,  xc  =  0.99)  we  find  that 
the  landscape  consists  of  several  small  clusters  of  size  c,  occuring  over  a  range  of  scales, 
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Figure  2.  A  plot  of  state  variable  x(i)  vs  site  index  i,  in  a  unidirectional  adaptive 
model  on  a  lattice  of  logistic  maps,  at  xc  =  0.98,  a  =  0,  for  a  lattice  of  size  JV  =  5000. 
In  (a)  we  depict  the  entire  lattice:  i  =  1 , . . . ,  5000.  Note  the  existence  of  one  very  large 
cluster  (all  sites  with  2648  <  i  <  5000  have  jc(z')  =  0.98).  In  (b)  we  depict  a  blown  up 
version  of  the  section  of  the  lattice  with  1  <  i  <  500.  Note  the  range  of  cluster  sizes. 


up  to  cmax  ~  10.  The  profile  is  now  very  rough  (see  figure  3).  Also,  note  that  the  addition 
of  noise  now  does  not  appreciably  alter  the  picture.  The  distribution  P(c)  follows  a  power 
law:  jP(c)  ~  c~^c,  where  the  exponent  0C  depends  on  the  particulars  of  the  model. 


3.2  Unidirectional  dynamics  under  weak  noise 

Very  interesting  phenomena  emerge  on  the  addition  of  weak  noise.  When  noise  is  very 
weak  (i.e.,  a  is  small,  and  number  of  affected  sites,  NT  is  small,  say  NT/N  <  0.1)  then  the 
dynamics  remains  regular,  especially  for  low  xc  values  supporting  low  order  cycles.  On 
adding  moderate  noise,  cr  ~  0.01,  and  NT/N  ~  0.1,  we  get  interesting  results  for  certain 
xc  values:  namely  large  xc  with  a  reasonably  wide  window  of  low  periodicity  associated 
with  it.  At  these  parameters  the  system  yields  a  well  defined  Iff  spectrum:  S(f)  ~  l//^, 
4>  ~  1.0.  So  the  time  profile  now  has  variations  on  all  scales,  with  interesting  "cyclic" 
looking  broad  trends.  Figure  4  displays  a  representative  example  of  xc  =  0.98,  a  =  0.01, 
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Figure  3.  A  plot  of  state  variable  x(i)  vs  site  index  i,  in  a  unidirectional  adaptive 
model  on  a  lattice  of  logistic  maps,  at  xc  —  0.99,  a  =  0,  for  a  lattice  of  size  N  =  1000. 
Note  the  rough  looking  profile. 

that  there  is  a  large  peak  in  the  spectrum  at/  =  |,  which  is  the  period  associated  with  xc 
when  a  ~  0. 

This  kind  of  temporal  scale  invariance  is  reflected  in  the  distribution  of  avalanche  size 
s,  which  falls  as  a  power  law:  P(s)  ~  s~^'.  It  is  also  observed  that  P(j)  is  considerably 
peaked  at  large  values  of  s  (s  ~  O(N}}.  This  hump  at  large  s  reflects  the  very  large  events, 
which  occur  approximately  periodically,  with  the  period  given  by  the  a  =  0  model. 
Figure  5  gives  the  P(s)  vs  s  curve  for  the  representative  case  of  xc  =  0.9.  The  exact  value 
of  the  exponent  fa  depends  on  the  system  parameters,  and  usually  has  values  close  to  1.0 
(here  for  xc  =  0.9,  fa  ~  1.2  ±  0.2). 

A  further  quantity  of  interest  would  be  the  sizes  of  active  clusters,  k,  at  any  given 
instant  of  time  (or  snapshot)  after  the  relaxation  has  taken  place.  This  is  determined  by 
the  number  of  consecutive  sites  that  move  (or  become  active)  during  the  course  of  an 
adaptive  process.  This  number  can  vary  from  1  to  N. 

Numerical  experiments  indicate  that  the  distribution  P(k)  follows  a  power  law: 
P(k)  ~  k?k .  The  exponent  fa  depends  on  the  particulars  of  the  system  and  varies  in  the 
range  ~  1.0  to  2.0.  For  instance,  from  figure  6  (which  shows  a  representative  example  of 
xc  =  0.88,  with  P(k)  obtained  by  averaging  over  2  x  104  realizations)  it  is  clearly  evident 
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Figure  4.  Power  spectrum  of  avalanches  in  a  unidirectional  adaptive  model  on  a 
lattice  of  logistic  maps,  with  xc  =  0.98,  a  —  0.01,  and  NT  —  1,  N  =  20.  We  have 
averaged  over  8  time  runs  of  2048  each.  The  abcissa  has  In  /,  where/  is  the  frequency 
(/  €  (0,0.5]),  and  the  ordinate  has  lnS(/),  where  S(f)  is  the  power. 


that  there  exists  well  defined  scaling,  with  the  exponent  4>k  ~  1-1-  This  scaling  reflects 
the  fact  that  responses  to  perturbation  (or  energy  dissipation)  in  this  model  takes  place 
over  a  range  of  length  scales,  starting  from  small  local  rearrangements  to  large  events 
where  the  transport  activity  is  at  a  wider  scale  involving  all  the  elements  of  the  lattice. 
For  large  values  of  xc  supporting  very  high  order  cycles,  the  spectra  is  noisy  and  flat 
(•$"(/)  ~  l//°)  even  f°r  very  weak  noise.  Also,  the  distribution  of  avalanche  size  s  is  a 
gaussian.  So  the  system  now  behaves  more  like  a  bunch  of  random  elements. 


3.3  Bidirectional  model:  Phenomenology 

Bidirectional  transport  mimics  its  unidirectional  counterpart  closely,  but  seems  to  be  a 
noisier  or  "fuzzier"  version  of  it.  Thus  we  find  that  for  xc  <  0.5  all  the  sites  are  randomly 
distributed  with  a  small  deviation  around  the  mean  which  is  almost  equal  to  xc.  The 
avalanches  are  all  of  the  maximum  possible  size  smax  =  A  x  N.  When  0.5  <  xc  <;0.8  the 
sites  get  attracted  to  noisy  2-cycles,  as  is  evident  through  the  power  spectrum  of  the 
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Figure  5.  Plot  of  the  distribution  of  avalanche  sizes:  lnP(s)  vs  Ins,  in  a 
unidirectional  adaptive  model  on  a  lattice  of  logistic  maps,  for  N  =  1000,  xc  =  0.9, 
and  a  =  0.1.  We  sample  2  x  104  configurations. 

temporal  evolution  of  the  individual  sites  which  exhibit  a  large  (but  not  clean)  peak  at 
1/2.  The  avalanches  evolve  as  exact  2  cycles,  alternating  between  0  and  smax-  Further,  as 
we  tune  xc  we  obtain  a  noisy  4-cycle,  and  so  forth. 

As  xc  gets  larger  we  have  a  rather  complex  scenario  with  a  host  of  scaling  properties 
emerging  in  space  and  time.  Firstly,  for  certain  :cc  values,  namely  large  xc  with  a 
reasonably  wide  window  of  low  periodicity  associated  with  it,  we  observe  a  time  profile 
having  variations  over  a  wide  range  of  scales,  giving  rise  to  a  well  defined  I//  spectrum. 
For  instance,  in  the  example  shown  in  figure  7,  the  system  (xc  =  0.98)  displays  a 
prominent  low  frequency  spectrum,  with  0  ~  1.54  ±  0.06.  Note  also,  the  existence  of  a 
sharp  peak  at  /  =  i,  which  is  the  frequency  of  the  exact  cycle  associated  with  the 
unidirectional  a  =  0  system.  (It  is  then  evident  from  comparing  figures  4  and  7,  that  the 
spectrum  for  the  bidirectional  model  is  strikingly  similar  to  that  obtained  from 
unidirectional  dynamics  under  weak  noise). 

The  active  clusters,  typically,  now  come  in  all  sizes,  indicating  again  that  activity  is 
taking  place  on  several  scales,  starting  from  small  rearrangements  to  large  events 
sweeping  through  the  lattice.  The  distribution  of  active  cluster  sizes  P(k),  displays  a 
clearly  defined  power  law:  P(k)  ~  k~^k  .  The  exponent  depends  on  the  particular  system 
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Figure  6.  Plot  of  the  distribution  of  active  clusters:  In  P(k)  vs  In  k,  in  a  unidirectional 
adaptive  model  on  a  lattice  of  logistic  maps,  for  xc  =  0.88,  a  =  0.2,  N  =  500.  We 
have  averaged  over  2  x  104  configurations.  Notice  the  enhanced  probability  of  finding 
large  active  clusters,  as  evident  from  the  rising  P(k)  around  fcmax  ~  N/5. 

we  examine.  For  instance,  for  xc  =  0.9  and  a  in  the  range  0.0  to  0.2,  we  obtain  fa  ~  4.0, 
and  for  xc  =  0.98,  for  noise  levels  a  =  0.0  to  0.2,  we  obtain  fa  ~  2.0.  This  scaling 
reflects  the  fact  that  responses  to  perturbation  (or  energy  dissipation)  in  this  model  takes 
place  over  a  range  of  length  scales. 

A  further  quantity  of  interest  would  again  be  the  distribution  of  avalanches.  Here  we 
focus  only  on  the  coarse  grained  distribution,  as  the  range  of  values  of  s,  from  0  to 
Smax  =  A  x  N,  is  rather  large.  First,  it  is  observed  that  there  is  a  considerable  hump  at 
large  values  of  s  (s  -+  5max).  This  is  related  to  the  fact  that  there  are  large  events  which 
occur  approximately  periodically  (with  the  same  period  as  that  of  the  system's 
unidirectional  counterpart).  For  instance,  for  xc  =  0.98,  approximately  one  in  every  four 
avalanches  is  very  large.  (Such  avalanches  actually  take  infinitely  long  to  settle  down,  and 
so  their  size  is  bound  only  by  finite  A,  i.e.  s  =  smax_  =  A  x  N.)  For  small  sizes,  it  was 
found  that  the  distribution  falls  as  a  power  law:  P(s)  ~  s~&.  While  this  power-law  trend 
persists  for  weak  noise,  under  addition  of  moderate  noise  the  system  often  yields  an 
exponentially  falling  distribution:  P(s)  ~  e~***.  Further,  very  high  xc  (xc  ->  1.0)  also 
yields  such  exponentially  decaying  P(s). 
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Figure  7.  Power  spectrum  of  avalanches  in  a  bidirectional  adaptive  model  on  a 
lattice  of  logistic  maps,  for  xc  =  0.98,  N  =  20,  and  a  =  0.01, Nr  =  3.  We  average  over 
8  time  runs  of  2048  each.  The  abcissa  has  In  /,  where  /  is  frequency  (/  6  (0, 0.5]), 
and  the  ordinate  has  lnS(/),  where  S(f)  is  the  power. 

Spatially,  we  obtain  power  law-like  distributions  for  spatial  cluster  sizes  c.  We  find  a 
host  of  small  clusters  up  to  a  size  of  cmax  ~  N/25.  Note  that  the  addition  of  noise  does  not 
appreciably  alter  the  roughness  of  the  profile.  Note,  also,  that  unlike  its  unidirectional 
counterpart,  there  are  no  large  clusters  here.  The  coarse  grained  distribution  of  the  small 
cluster  sizes  P(c]  appears  to  be  a  power  law,  with  exponent  0C  depending  on  the  specifics 
of  the  model. 


4.  Summary 

In  this  article,  we  have  described  a  novel  class  of  models  incorporating  adaptive  dynamics 
on  a  lattice  of  nonlinear  (typically  chaotic)  elements.  We  demonstrate  the  wide  repertoire 
of  dynamical  patterns  these  systems  are  capable  of  supporting.  The  spectrum  of  emergent 
spatio-temporal  phenomena  ranges  from  spatio-temporal  fixed  points  and  cycles,  to 
parameter  regimes  marked  by  scaling  properties,  as  manifest  in  clearly  defined  I// 
spectral  characteristics  and  power  law  distributions  of  spatial  quantities.  The  capacity  of 
these  models  to  yield  wide  ranging  dynamical  patterns  by  tuning  just  one  parameter, 


Pramana  -  J.  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


301 


Sudeshna  Sinha 

underscores  their  utility  as  versatile  paradigms  of  spatio-temporal  complexity  in  extended 
systems. 
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Control  of  spatiotemporal  chaos:  A  study  with  an 
autocatalytic  reaction-diffusion  system 
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Abstract.  The  characterization  of  chaotic  spatiotemporal  dynamics  has  been  studied  for  a 
representative  nonlinear  autocatalytic  reaction  mechanism  coupled  with  diffusion.  This  has  been 
carried  out  by  an  analysis  of  the  Lyapunov  spectrum  in  spatially  localised  regions.  The  linear 
scaling  relationships  observed  in  the  invariant  measures  as  a  function  of  the  sub-system  size  have 
been  utilized  to  assess  the  controllability,  stability  and  synchronization  properties  of  the  chaotic 
dynamics.  The  dynamical  synchronization  properties  of  this  high-dimensional  system  has  been 
analyzed  using  suitable  Lyapunov  functionals.  The  possibility  of  controlling  spatiotemporal  chaos 
for  relevant  objectives  using  available  noisy  scalar  time-series  data  with  simultaneous  self- 
adaptation  of  the  control  parameter(s)  has  also  been  discussed. 

Keywords.  Spatiotemporal  chaos;  reaction-diffusion  systems;  autocatalysis;  Lyapunov  expo- 
nents; synchronization  and  control;  parametric  self-adaptation. 

PACS  Nos    05.70;  82.20;  47.54;  05.45 


1.  Introduction 

Spatially  extended  nonlinear  systems  are  known  to  exhibit  a  wide  variety  of  complex 
dynamics  ranging  from  stable  stationary  states  to  coherent  travelling  waves,  solitons  and 
spatiotemporal  chaos.  Studies  on  this  topic  have  been  reviewed  by  Cross  and  Hohenberg 
[1]  from  a  global  perspective  and  their  importance  in  many  physical,  chemical,  biological 
and  social  systems  have  been  summarized.  To  cite  a  few  examples,  complex  patterns  have 
been  experimentally  observed  in  hydrodynamic  fluids,  nonlinear  optics,  crystallization  of 
solids,  chemically  reacting  systems,  morphogenesis,  self-replication  of  living  cells,  neural 
systems,  population  dynamics,  and  economics.  Even  though  these  systems  are  high- 
dimensional  due  to  mechanisms  such  as  diffusion,  convection,  etc.,  concurrently 
occurring  in  the  spatial  domain,  a  wide  range  of  complex  phenomena  exhibited  by  these 
systems  may  be  modelled  by  simple  partial  differential  equations  or  coupled  maps. 
Considerable  application  potential  exists  if  it  were  possible  to  devise  strategies  that  would 
regulate  and  control  these  dynamical  patterns  to  exhibit  the  desired  behaviour.  This  is 
especially  true  for  systems  exhibiting  spatiotemporal  chaotic  patterns  which  are  highly 
sensitive  to  the  initial  conditions  and  parameter  settings  [2]  and  their  robust  control  needs 
highly  efficient  and  newer  techniques  of  analysis  to  be  developed. 

Systems  exhibiting  chaotic  behaviour  offer  the  advantage  of  controlling  the  system 
dynamics  in  a  variety  of  dynamical  patterns  with  low  energy  inputs  [3].  Typically,  an 
experimenter  may  wish  to  stabilize  the  spatiotemporally  chaotic  dynamics,  or  switch 
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it  over  to  a  different  dynamical  pattern  (which  may  also  be  chaotic),  using  limited 

available  information  in  the  form  of  scalar  time-series  data  in  the  spatial  domain. 

Possible  control  strategies  in  such  situations  would  require  controllers  to  be  dispersed 

in  the  spatial  domain  with  each  one  perturbing  the  value  of  the  chosen  control 

parameter(s),  or  alternatively,  introducing  appropriate  forcing  in  the  form  of  dynamical 

waveforms  from  outside  the  system.  Recent  studies  using  these  methodologies  [4-7] 

have  shown  that  it  may  be  possible  to  suitably  tune  the  spatiotemporally  chaotic 

dynamics  to  a  stabilized  mode  or  a  periodic  orbit  of  the  homogeneous  system,  devoid 

of  spatial  effects.  A  dense  distribution  of  the  controllers  may  be  necessary  to  account 

for  the  large  number  of  unstable  degrees  of  freedom  in  high-dimensional  systems. 

However,  if  some  information  about  the  basic  mechanism  governing  the  system  dynamics 

is  known,  then  ingenious  strategies  involving'  simple  localized  control  algorithms  may 

be  employed.  For  instance,  the  spatiotemporal  patterns  observed  in  cardiac  tissues  [8], 

reaction-diffusion  models  for  catalysis  [9,10],  and  Ginzburg-Landau  equation  [11-13] 

arise  because  of  unstable  topological  defects  nucleating  spontaneously  from  an  initial 

disordered  state.  In  such  situations  it  has  been  shown  that  the  stabilization  of  even 

one  wave-generating  defect  may  interact  and  control  all  the  others  right  up  to  the 

system  boundary  [13].  In  the  event  when  the  spatiotemporal  system  is  subjected  to 

noise  and  a  periodic  forcing  function,  the  cooperative  effects  that  may  arise  between 

the  inherent  nonlinear  dynamics,  the  external  noise  and  periodic  forces  may  stabilize 

the  system  dynamics  due  to  the  phenomenon  of  stochastic  resonance.  In  fact,  in  a 

recent  experimental  study  with  linearly  coupled  array  of  nonlinear  oscillators,  it  has 

been  shown  that  ordered  periodic  states  can  be  achieved  by  the  optimization  of  noise 

levels  and/or  forcing  frequency  [14, 15].  Thus,  while  working  with  chaotic  systems, 

even  unconventional  methods  may  be  rationally  applicable  for  regulating  the  system 

dynamics. 

The  control  of  nonlinear  chaotic  systems  may  also  be  looked  at  from  a  different 
viewpoint.  Given  a  process  from  which  we  monitor  scalar  time-series  data,  can  we  use  it 
to  drive  a  response  system  so  that  its  dynamics  in  all  the  other  non-monitored  variables 
becomes  synchronized  with  that  of  the  process?  Despite  the  fact  that  the  chaotic 
behaviour  exhibits  rapid  decorrelation  of  nearby  orbits,  it  has  been  observed  that  it  may 
be  possible  to  synchronize  the  response  system  dynamics  to  that  of  the  process  [16]. 
Considerable  attention  has  been  focussed  on  this  topic  in  the  recent  years  and  conditions 
for  obtaining  synchronization  have  been  studied  when  the  response  system  is:  1)  a  replica 
of  the  process  with  incorrect  knowledge  of  the  initial  conditions  and  2)  a  non-replica  of 
the  process  with  unknown  initial  conditions  and  incorrect  parameter  settings.  In  particular 
it  has  been  shown  that  for  synchronization  in  initial  conditions  to  occur,  the  response 
system,  excluding  the  driving  variables  should  possess  only  negative  Lyapunov  exponents 
[17-26].  When  the  response  system  is  a  non-replica  of  the  process,  synchronization  may 
be  possible  by  suitable  perturbations  in  the  response  parameter(s)  (depending  upon  local 
stability  considerations)  [27, 28]  -  much  in  the  same  fashion  as  suggested  by  OGY  for 
controlling  chaos  [29,30]. 

It  may  be  noted  that  the  above  synchronization  and  control  studies  refer  to  systems 
described  by  low-dimensional  chaotic  systems.  On  the  other  hand,  spatio-temporal 
chaotic  systems  may  be  thought  of  as  a  network  of  chaotic  elements  with  a  large  number 
of  positive  Lyapunov  exponents,  and  hence,  for  their  dynamical  synchronization  a  large 
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number  of  driving  signals  may  be  necessary.  Furthermore,  it  would  be  computationally 
difficult  to  characterize  the  system  dynamics  and  clearly  demarcate  the  stable  and 
unstable  directions  associated  with  the  attractor  because  of  the  high  dimensions  involved 
[7,31-33,35,37-39].  It  is  then  worthwhile  to  see  if  simpler  and  practical  ways  of 
analyzing  these  systems  can  be  devised.  A  possible  strategy  may  be  to  first  assess  the 
complexity  and  stability  properties  of  the  high-dimensional  system  in  a  lower 
dimensional  framework.  Our  earlier  studies  on  this  topic  have  shown  that  interesting 
scaling  relationships  in  the  system  invariant  measures  (such  as  Lyapunov  exponents, 
entropy,  etc.)  as  a  function  of  sub-system  size  exists  for  the  spatiotemporally  chaotic 
dynamics  in  nonlinear  reaction-diffusion  systems  [40].  An  analysis  of  these  relationships 
may  yield  valuable  information  about  the  requirements  of  spatial  time-series  data  for  the 
characterization  and  control  of  spatiotemporal  systems. 

In  this  paper,  we  shall  discuss  the  control  of  spatiotemporally  chaotic  dynamics  in 
nonlinear  reaction-diffusion  systems  for  limitations  in  the  knowledge  of  the  system.  The 
reaction-diffusion  system  considered  here  belongs  to  a  fairly  general  class  of  chemically 
reacting  systems  involving  nonlinearity  by  way  of  autocatalytic  steps  [45,  49]  and  in  the 
following  section  we  shall  elaborate  on  the  specifics  of  the  system  studied.  Subsequently, 
in  §  3  we  summarize  the  possible  methodology  to  characterize  the  spatiotemporal  chaotic 
behaviour  in  terms  of  sub-system  invariant  measures  like  Lyapunov  exponents,  entropy, 
etc.  Section  4  studies  the  synchronization  and  control  of  replica  and  non-replica  response 
systems  with  simultaneous  estimation  of  the  response  parameter(s)  and  §  5  briefly 
summarizes  the  scope  of  the  results. 

2.  Model  description 

In  recent  times  considerable  interest  has  been  aroused  in  the  study  of  nonlinear  reaction- 
diffusion  systems  with  the  experimental  observance  of  complex  spatiotemporal  patterns, 
e.g.,  striped,  hexagonal,  travelling  waves,  and  spatiotemporal  chaos  in  a  simple  gel- 
reactor  uncontaminated  by  convection  effects  [41,  44].  Pattern  formation  in  these  systems 
arises  as  a  result  of  spontaneous  symmetry  breaking  transitions  with  intrinsic  length 
scales  dependent  on  the  diffusion  coefficients  and  the  reaction  rate  constants  [42,  43].  An 
interesting  class  of  spatial  patterns  observed  include  spots  which  have  an  uncanny 
resemblance  to  what  is  observed  in  many  biological  systems  in  the  sense  that  they 
replicate,  grow  and  die  [46, 47].  These  replicating  patterns,  do  not  have  the  high  degree  of 
sensitivity  associated  with  the  chaotic  dynamics  and  hence  their  control  is  less  difficult. 
In  fact,  in  an  earlier  study  with  a  simple  autocatalytic  reaction-diffusion  model,  using 
controllers  dispersed  in  the  spatial  domain,  we  have  shown  that  the  control  of  these 
patterns  in  desired  dynamical  states  may  be  possible  [48].  In  the  present  study  we 
consider  a  similar  reaction  network  with  an  additional  mechanistic  step  involving  a  new 
species  D,  i.e.,  a  three-step  parallel  autocatalytic  reaction  mechanism  [49]  with 
competing  interactions  between  the  chemical  species  A,  D,  and  B,  viz., 

A+2B— +3B,     ri=fciCACS,  (I) 

D  +  2B— >3£,      r2  =  k2CDC2B,  (II) 

*— >C,       r3=k3CB,  (IH) 

Pramana  -  J.  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences"  305 


Nita  Parekh  et  al 
(a) 


Flow  in 


Flow  out 


OO 
Well- Mixed 


0.6  i 


0.4 


0.2  - 


0.0 


(b) 


0.0 


0.4 


0.8 


1.2 


Figure  1.  (a)  Schematic  diagram  of  a  well-mixed  continuous  stirred  tank  reactor 
(CSTR).  (b)  Temporal  chaos  arising  for  the  autocatalytic  reaction  (1)  in  a  CSTR, 
shown  as  a  phase-plane  plot;  a  =1.5;  /5  =  2.93;  Da,  =  18000;  Da2  =  400; 
Da3  =  80. 


and  the  rate  expressions  r/,  z  =  1, 2, 3  for  each  step,  given  alongside.  Nonlinear  feedback 
in  this  mechanism  occurs  due  to  the  rates  of  formation  of  species  B  being  autocatalytic  in 
steps  (I)  and  (II),  while,  in  (III)  the  effect  is  inhibitory.  For  a  continuous  flow  well-mixed 
cell  (shown  in  figure  la),  the  model  description  for  the  above  reaction  takes  the  following 
dimensionless  form 


dr 

d*2 
dt 


=  1  -  Xi  - 


d 


=  1-(1  + 


(1) 


where  Z£-,  i=  1,2, 3,  respectively,  represent  the  dimensionless  species  concentrations  of 
A,  D,  and  B  relative  to  their  concentrations  at  the  inlet  to  the  cell.  Here,  Oafs  are  the 
dimensionless  kinetic  parameters  associated  with  the  reaction  steps  (I-ffl)  and  the 
parameters  a  and  /3  are  the  feed  concentration  ratios  of  species  B  and  D  with  respect  to  A. 
An  analysis  of  the  dynamical  properties  of  this  cell  shows  features  such  as  multi- 
stationarity,  oscillatory  behaviour  and  chaos  in  chosen  parameter  space  of  a,  J3  and  Da^s 
[49].  In  figure  Ib  a  typical  phase  portrait  diagram  of  X2  vs  ^3  depicting  the  chaotic 
dynamical  behaviour  of  (1)  (with  maximum  Lyapunov  exponent,  Amax  ~  1.36)  is  shown. 
On  introducing  a  diffusive  transport  mechanism  for  a  system  of  N  coupled  autocatalators 
(1),  the  system  dynamics  may  be  described  by  the  following  set  of  PDE's,  viz., 


(r,f) 


dt 


=  1  -  X,  (r,  0  - 


r,OX|(r,0+<*i 


dr2 
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/  \ 


Figure  2.  (a)  Schematic  of  N  CSTRs  coupled  bi-directionally  via  diffusion  in  one 
spatial  dimension  with  periodic  boundary  conditions,  (b)  Spatiotemporal  chaos  arising 
for  a  perturbation  given  at  /  =  40.  DI  =  D2  =  1.0;  D3  =  0.01;  a  =  1.5;  /3  =  2.93; 
Da\  =  18000;  Da^  =  400;  Da3  =  80;  [X}  axis-scale:  (0.0,0.07). 


=  /3-X2(r,r)- 


dt 


dr2 


r,0 


dt 


,  t} 


(r,  f) 


(2) 


where  r  =  (x,y,z)  is  the  spatial  vector;  and  dt,  i  =  1, 2, 3  are  the  dimensionless  diffusion 
coefficients  of  the  respective  species  A,  D  and  B,  Autocatalytic  reaction-diffusion  systems 
of  the  type  (2)  are  known  to  possess  a  rich  variety  of  dynamical  patterns  with  properties 
such  as  excitability,  wave  reflection  and  wave  splitting,  propagating  fronts,  etc.  [50,  51]. 
In  this  paper,  all  studies  were  carried  out  for  N  independent  autocatalators  coupled  via 
bi-directional  diffusion  on  a  one-dimensional  spatial  lattice  (schematically  shown  in 
figure  2a).  On  Euler  discretizing  the  Laplacian,  d2/dx2,  in  the  spatial  dimension  x,  viz., 
we  obtain 


dt 


dt 


Da^(L  t}x,^  t] 

llt)-2X2(j,t)+X2(j-  I,?)], 


dt 


=  l-(l+Dfl3)X3(7,f)  + 


1,0  -2X3(7, 


(7,  t)  +  Da2X2(j,  t}]Xl(j,  t] 


(3) 


with  D,-  =  J,-/(Ajc)2,  AA:  being  the  spatial  mesh  size;  j  =  1, 2, . . . ,  AT.  The  number  of 
degrees  of  freedom  is  significantly  increased  to  3N  and  the  characterization  of  the  system 
dynamics  is,  now,  not  a  trivial  task.  In  this  study,  the  diffusion  coefficients  of  the  species 
A  and  D  are  assumed  equal  throughout  and  much  greater  than  that  for  the  species  B,  i.e., 
DI  =  DI  >  D^.  The  unequal  rates  of  diffusion  and  the  local  nonlinearity  may  then  result 
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in  the  formation  of  complex  spatiotemporal  patterns  including  spatiotemporal  chaos 
depending  on  the  parameter  values.  In  figure  2b  is  shown  the  spatiotemporally  chaotic 
dynamics  exhibited  by  (3)  for  N  =  64  and  assuming  periodic  boundary  conditions. 
Initially,  all  the  N  cells  were  assigned  identical  initial  conditions  and  the  parameter  values 
were  chosen  corresponding  to  single  cell  (1)  exhibiting  chaotic  dynamics  (figure  Ib).  The 
extended  system  (3)  was  then  simulated  using  the  fourth  order  Runge-Kutta  algorithm 
with  time  step  A?  =  0.0002.  For  time  t  <  40  (in  dimensionless  units),  the  system 
dynamics  is  seen  to  be  spatially  correlated  though  temporally  it  is  uncorrelated  and 
chaotic.  Note  that  the  diffusion  mechanism  is  not  active  in  this  region  because  of  identical 
initial  conditions  assumed  over  the  entire  spatial  domain,  and  therefore  all  the  cells 
evolve  synchronously.  Now  at  time  t  =  40,  a  perturbation  was  given  to  the  set  of  eleven 
cells  lying  in  the  central  region  of  the  lattice.  This  perturbation  assumed  that  the 
dynamics  of  the  chosen  cells  progressively  went  out  of  phase  with  each  other  by  time 
t  =  0.002.  A  spread  of  this  perturbation  to  the  system  boundaries  by  diffusion,  with  a 
complete  loss  in  the  spatial  correlation,  is  qualitatively  seen  in  figure  2b  and  this 
dynamical  behaviour  was  seen  to  persist  indefinitely.  Characterization  of  this  loss  in 
correlation  and  their  effects,  however,  need  to  be  studied  quantitatively  to  make  more 
meaningful  interpretations.  The  following  section  discusses  a  means  of  doing  so. 

3.  Characterization  of  reaction-diffusion  dynamics 

The  characterization  of  spatiotemporal  dynamics  in  terms  of  the  Lyapunov  exponents 
[52]  which  quantify  the  growth  rate  of  trajectories  for  infmitesimally  small  perturbations 
will  be  discussed  in  this  section.  For  the  spatially  discretized  system  (3)  with  three 
chemical  species  interacting  on  a  one-dimensional  lattice  of  size  N,  there  are  3/V 
dependent  variables.  Hence  the  complete  Lyapunov  spectrum  would  have  37V  exponents 
and  their  calculation  would  be  computationally  very  intensive,  especially  for  large  TV.  To 
alleviate  this  difficulty,  in  an  earlier  study  we  discussed  the  analysis  of  the  Lyapunov 
spectrum  for  sub-systems  of  size  ns(<  TV)  and  studied  the  behaviour  of  the  system 
invariants  as  ns  -»  N  [40].  For  the  sake  of  completeness  in  discussing  issues  related  to  the 
control  of  these  systems,  we  shall  now  review  this  approach,  highlighting  the  major 
results  obtained. 

For  the  extended  system  (3)  with  K  denoting  the  set  of  fixed  parameters,  the  temporal 
evolution  of  the  concentrations  of  each  of  the  species  for  chosen  values  of  £>,-  may  be 
functionally  written  as 

^^  =  MX1(kJ0,...JX3(*,0>XI-(fe±llO,«),  (4) 

where  the  first  index  i  =  1, 2, 3  of  the  functionals  F^  denotes  the  respective  species  and 
the  second  index  fc=l,...,ns  represents  a  sub-system  lattice  site.  The  Lyapunov 
exponents  of  the  sub-system  of  size  ns  may  then  be  calculated  in  a  manner  similar  to  that 
for  the  full  system  [52],  viz., 

AfUtosupIlni^,     /=C..,3ns,      .  (5) 
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wnere  OA.(IO)  rerers  to  tne  mrmitesimai  perturDation  rrom  a  reference  state  at  t  —  to  and 
<5X'(?)  its  evolution  at  a  time  t.  While  calculating  these  exponents  for  (4),  open  boundary 
conditions  were  assumed  at  the  sub-system  boundary  sites,  i.e.,  at  k  =  1  and  k  =  ns.  The 
open  boundary  conditions  indicate  that  only  for  the  purposes  of  evaluating  the  A/'s,  the 
flow  of  information  from  the  outer  site  of  the  sub-system  boundaries,  may  be  likened,  to 
presence  of  noise  in  the  analysis.  Alternatively,  for  the  Id  lattice,  we  may  characterize  the 
sub-system  dynamics  on  excluding  the  outermost  sub-system  boundary  sites,  although 
time-series  signals  at  these  boundary  sites  are  available.  The  effective  sub-system  size  for 
characterization  is  now  rcs  —  2  ,  and  the  invariant  measures  of  (4)  can  be  studied 
accurately. 

The  characterization  of  the  sub-system  dynamics  may  now  be  carried  out  by  applying 
the  well-known  Kaplan  and  Yorke  (KY)  conjecture  [53]  to  the  spectrum  of  sub-system 
Lyapunov  exponents,  Az  .  For  doing  so,  we  define  the  effective  sub-system  Lyapunov 
dimension,  a£  ,  as 


/+l       i=l 

where  /  is  the  largest  integer  for  which  the  sum  of  the  exponents,  Aj  H  ----  +  A,  >  0.  If 
AI  <  0,  then  4  =  0  and  if  /  =  3ns,  then  d^  =  3ns  [54].  Further  reduction  in  the 
computational  effort  is  possible  as  it  may  not  be  necessary  to  evaluate  all  the  3ns  sub- 
system exponents.  A  relationship  in  the  variation  of  d^  with  ns  would  then  help  in 
establishing  the  number  of  effective  variables  required  to  determine  the  long-term 
behaviour  of  the  complete  system  dynamics.  From  a  knowledge  of  of*  we  can  define  the 
sub-system  dimension  density,  p^,  as 

wOO 
,«=^,  (7) 

"S 

by  normalizing  the  sub-system  Lyapunov  dimension  with  its  size  ns.  The  behaviour  of  p® 
as  a  function  of  ns  may  then  importantly  help  in  identifying  a  critical  sub-system  size,  nsc, 
required  for  accurate  characterization  of  the  complete  system  dynamics. 

Another  important  invariant  measure  is  the  Kolmogorov-Sinai  (KS)  entropy  used  for 
quantifying  the  mean  information  production  and  the  growth  of  uncertainty  in  a  system 
subjected  to  small  perturbations  [53].  The  KS  entropy  is  defined  as  the  sum  of  the 
positive  Lyapunov  exponents  A+  [52,  55].  For  regular  predictable  behaviour,  the  KS 
entropy  is  zero  while  for  chaotic  systems  it  takes  a  finite  positive  value  and  tends  to 
infinity  for  a  stochastic  process.  In  the  present  analysis  we  shall  use  this  relationship  for 
evaluating  the  sub-system  KS-entropy,  h®,  as 


and  the  normalized  sub-system  entropy,  (h®),  as 

?/«.•  (9) 


We  now  discuss  the  role  of  these  sub-system  invariants  in  the  characterization  of 
the  full  spatiotemporal  system.  In  figure  3a  is  shown  a  plot  of  the  sub-system  dimension 
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Figure  3.  Behavior  of  the  sub-system  invariants  as  a  function  of  its  size  ns:  (a) 
Lyapunov  dimension,  d^  ,  (b)  entropy,  h^;  (c)  dimension  density  p^;  (d)  normalized 
entropy,  (/i(s)). 


a£  as  a  function  of  its  size  ns.  A  clear  linear  scaling  relationship  in  the  sub-system 
dimension  is  seen  for  increasing  sub-system  size  (figure  3a).  This  suggests  that  it  may 
indeed  be  possible  to  determine  the  effective  dimensionality  of  the  whole  system  from  an 
analysis  of  a  relatively  small  sub-system.  Some  generic  conjectures  based  on 
dimensionality  have  also  been  seen  [34].  A  clear  linear  scaling  relationship  with 
increasing  sub-system  size  was  also  observed  for  the  sub-system  KS  entropy,  h^s\  shown 
in  figure  3b.  The  sub-system  dimension  density,  p(&\  is  however,  seen  to  saturate  to  a 
constant  value  for  ns  greater  than  a  critical  sub-system  size  nsc  (figure  3c).  This 
converging  behaviour  of  p®  for  ns  >  nsc,  may  importantly  help  in  assessing  the  number 
of  degrees  of  freedom  required  for  the  characterization  of  the  whole  system.  The 
saturating  behaviour  observed  for  the  normalized  sub-system  entropy,  (h®),  (figure  3d) 
suggests  that  even  though  the  total  entropy  of  the  sub-system  may  increase  with  its 
size,  the  average  rate  of  information  loss/gain  levels  off  for  ns  >  nsc.  These  results 
suggest  that  sub-system  analysis  may  suffice  for  the  characterization  of  the  full 
system  dynamics.  Similar  scaling  relationships  in  the  system  invariants  as  a  function 
of  sub-system  size  have  also  been  observed  for  coupled  maps  with  near  neighbour 
interactions  indicating  possible  generalizations  [35,  36].  The  knowledge  of  these  sub- 
system properties  may  be  used  in  identifying  favourable  conditions  for  controlling 
the  spatiotemporal  system  and  in  the  following  section  we  shall  further  elaborate  on 
this  aspect. 


310 


Pramana  ~  J,  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


4.  Synchronization  and  control  of  spatiotemporal  chaos 

In  this  section,  we  discuss  the  synchronization  and  control  of  the  spatiotemporally  chaotic 
dynamics  of  a  response  system  driven  by  monitored  time-series  signals  from  a 
spatiotemporal  process  for  various  objectives  like  stabilizing  the  chaotic  behaviour  of  the 
response  system  and  its  ability  to  mimic  the  process  dynamics  under  different  situations 
[16].  In  practice,  there  exists  constraints  in  the  monitoring  of  scalar  time-series  data  from 
the  entire  spatial  domain  of  the  process,  and  moreover,  this  data  may  generally  be  noisy 
due  to  errors  arising  during  measurement.  It  would  be  desirable  to  consider  the  control  of 
the  spatiotemporal  dynamics  of  replica  and  non-replica  response  systems  for  these 
limitations. 


4.1  Synchronization  in  replica  systems 

To  begin  with,  we  first  consider  the  response  system  to  be  a  replica  of  a  spatiotemporally 
chaotic  process,  i.e.,  the  response  and  the  process  systems  are  assumed  to  be  identical  in 
all  respects  except  in  their  initial  conditions,  Xi(j,  0)  ^  Xf(j,  0),  where  X/(y,  0)  denote  the 
response  system  variables.  The  response  system  model  may  then  be  written  in  its 
variables,  Xt(j,t),  as 


(7+1,0-  2#i  (;,  0  +  Xj  (j  -  1  ,  f)], 


+  D2[X2(j+l,t}-2X2(j:t}+X2(j-l,t)l 

=  ^3(#i(7\0,*2(y,0,#3aO) 

=  1  -(1  +  Da,}X3(j,t}  +  oi[ 


+  1,0  -2*3(7,0  +XiU  -1,01,  (10) 

where,  j  —  1  ,  2,  .  .  .  ,  N.  If  the  response  system  (10)  is  now  assumed  to  be  driven  by  scalar 
time-series  signals,  X3(jf,  t),j=  1,2,  ...  ,./V,  from  the  process,  its  dynamics  follow 


and  was  found  to  completely  synchronize  with  the  process  dynamics,  in  all  the  3N 
variables.  This  is  clearly  seen  in  figure  4a  from  the  diminishing  behaviour  of  the  relative 
error  in  the  response  and  process  dynamics,  i.e.,  e'2(j,  t)  =  [X2(j,t]  —X2(j,  t)}/ 
X2(j,t)  —  >  0.  (Similar  synchronized  behaviour  was  observed  in  the  other  two  variables 
also). 
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(a.) 


(b) 


Figure  4.  Synchronization  in  a  replica  system  for  parameter  values  in  figure  2.  The 
relative  space-time  error  behaviour  for  e'2(j,  t)  shown,  (a)  Complete  synchronization 
in  the  spatio-temporally  chaotic  dynamics  of  the  response  system  obtained  on  using 
driving  signals,  Xj,(j, t},  j  =  1, . .  .  ,Af,  from  the  process;  e'2  axis-scale:  (—0.005, 
0.0007).  (b)  On  using  sub-system  driving  signals,  X^k,  t),  k  =  1, . . . ,  ns,  for  ns  =  1 1, 
from  the  central  region  (shown  by  tics),  synchronization  was  observed  only  within  the 
sub-system.  e'2  axis-scale:  (—1.11,6.35). 


We  now  show  that  it  may  be  possible  to  explain  the  observed  synchronization 
behaviour  of  the  combined  response-process  dynamics  using  Lyapunov  stability  analysis. 
The  method  suggests  that  a  given  system  is  stable,  if  any  continuously  derivable  positive 
definite  function,  L,  (called  the  Lyapunov  function)  can  be  defined  along  a  trajectory  with 
the  property  that  its  time  derivative,  L  <  0,  as  t  — >  oo  [56].  This  method  may  be 
advantageously  used  since  it  affords  analytical  reasoning  of  the  stability  properties  of  the 
system.  In  the  present  study,  we  show  by  constructing  suitable  Lyapunov  functionals  in 
terms  of  error  variables,  et(j,  t)  =  X{(j,  t}  -  Xi(j,  t},  that  synchronization  in  the  dynamics 
between  the  response  and  the  process  may  be  assessed.  The  dynamical  equations  for  the 
error  function  variables  e{(j,  t}  are 


(12) 


(13) 


(14) 


+  D2[e2(j-  l,f)  -2*2(7,0  +«2C/+  1,01. 


j,  0  +  Da2e2(j,  t)]Xl(j,  t), 


written  using  (3)  and  (10).  Then,  for  a  chosen  positive  Lyapunov  functional  L  defined  as 
^  —  Z)i  Z);  e«2(7» f)'  its  derivative  L  is 


(15) 


on  simplification 


TV 

—  D\  V"^  [ei(7,  t)  —  e\(j  +  150]2J  (16) 

7=1 

/v  /v 

7=1  7=1 

'N 


(j,t)}X2(j,t)}.     (18) 

7=1  7=1 

It  may  be  noted  from  (12)  and  (13)  that  the  error  dynamics  of  e\(j,i]  and  e2(j:t)  are 
dependent  only  on  their  respective  error  function  variables,  and  on  the  square  of  the 
driving  signals,  X-$(j,  f).  Equations  (12)  and  (13)  are  decoupled  in  this  sense  and  may  be 
considered  independently  for  assessing  the  synchronization  behaviour  of  X\  and  X2  by 
independent  Lyapunov  functions  L{  =  Y^j  ej(h  t),i=  1,2.  Then,  from  (16)  and  (17),  it  is 
clear  that  the  L/'s  are  negative  definite,  and  e\(j,t),e2(j^t}  —  *  0  as  t  —  *  oo  by  the 
Lyapunov  stability  theorem.  Using  these  conditions  in  (15),  L  is  strictly  nonpositive  and 
indicates  the  stable  convergence  of  the  error  dynamics  (12-14)  to  the  origin.  That  is,  the 
response  system  would  dynamically  synchronize  with  the  process  dynamics  in  all  its 
variables. 

On  the  other  hand,  if  the  time-series  signals  are  available  from  a  sub-system  of  size 
ns(<yV),  then  following  the  above  Lyapunov  stability  analysis,  synchronization  should 
be  possible  only  within  the  sub-system  region.  This  is  because  for  the  variables  outside 
the  sub-system,  the  error  function  variables  are  governed  by 

*!  (/,  f)  -  -«i  (/,  0  -  Dal  [X,  (I,  t)X2(l,  t)  -  X,  (/,  t}X2(l,  01 

+  Di[«i(Z-  1,0  -2«i(/,0+«i(f  +  1,01.  (19) 

e2(l,  t)  =  -62(1,  t}  -  Da2[X2(l:  *)*?(/,  t)  -  X2(l,  t)X2(l,  t}} 

+  D2[e2(l  -1,0-  2«2(f,  0  +  «a(/  +  1,  01.  (20) 


+  aDa,  [X,  (/,  0*f  (/,  0  -  ^1  (f>  O^f  ('.  01 

+  aDa2(X2(l,  OXf  (/,  0  -  *z(/,  t}X2(l,  f)} 

+  ei(l  +  l,t)},  (21) 


where  /  denotes  the  lattice  sites  outside  the  sub-system,  i.e.,  /  =  1,  2,  .  .  .  ,  N,  excluding  the 
ns  sub-system  lattices  sites  from  the  central  region.  Note  that  the  terms  (19)  and  (20)  are. 
now  not  necessarily  negative  and  synchronization  in  these  variables  cannot  be  guaranteed. 
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This  behaviour  was  numerically  observed  on  driving  the  response  system  (10)  by  scalar 
time-series  signals  from  a  sub-system  of  the  process  of  size  ns  =  1 1 ,  from  the  central 
region  of  the  lattice.  The  space-time  plot  of  the  error  function  e'2(j,  t)  in  figure  4b  clearly 
shows  only  sub-system  synchronization. 

In  situations  when  a  suitable  Lyapunov  functional  cannot  be  constructed,  the 
synchronization  capability  of  the  response  system  with  that  of  the  process  may  be 
inferred  from  an  analysis  of  the  conditional  Lyapunov  exponents  of  the  response  system 
[18, 19].  Following  the  results  of  the  sub-system  invariants  discussed  in  §3,  it  may  be 
sufficient  to  analyse  the  conditional  Lyapunov  exponents  of  only  a  sub-system  of  size 
«s(>  »sc)  of  the  response  system.  The  procedure  for  calculating  the  conditional  Lyapunov 
exponents  of  a  sub-system  is  similar  to  that  of  the  sub-system  Lyapunov  exponents 
discussed  in  the  §  3,  but  now  we  need  to  monitor  the  growth  rate  of  only  2ns  sets  of 
orthonormal  vectors  (<5X,(fc,r),  i  =  1,2;  k  =  1,2, . . .  ,ns)  in  a  linearized  region  of  (11). 
The  maximum  conditional  Lyapunov  exponent  for  a  sub-system  of  size  ns  =  1 1  of  the 
response,  system  (11)  was  found  to  be  negative  (Amax  ~  —0.07),  as  expected.  Thus,  the 
sub-system  conditional  exponents  may  be  used  to  assess  the  synchronization  capability  of 
the  response  system  (see  figure  4a). 

4.2  Synchronization  in  non-replica  systems 

Next,  we  consider  the  situation  when  the  response  system  is  a  non-replica  of  the  process, 
i.e.,  the  response  and  the  process  are  assumed  to  be  operating  in  different  parametric 
regimes.  Synchronization  of  the  response  dynamics  with  that  of  the  process  in  this 
situation  may  be  difficult  because  of  inaccuracies  in  its  parameter  values.  For  example, 
consider  both  the  process  and  the  response  system  exhibiting  spatiotemporal  chaos,  but 
with  a  chosen  control  parameter,  (3,  of  the  process  having  a  value  2.93  (which  may  not  be 
known  a  priori  in  a  given  experimental  situation),  and  the  corresponding  response 
parameter,  J3,  set  incorrectly  at  2.73  at  t  =  0.  The  hence  response  and  the  process  evolve 
asynchronously  because  of  this  parametric  inaccuracy,  and  simple  driving  as  employed  in 
subsection  4.1  for  replica  systems,  was  found  to  be  insufficient  for  the  complete 
dynamical  synchronization.  There  exists  a  need,  therefore,  to  correctly  estimate  the 
parametric  inaccuracy.  To  do  so,  we  focus  attention  on  a  convenient  strategy  which  allows 
to  correctly  estimate  the  inaccuracy  in  the  control  parameter  via  suitably  chosen 
functional  forms  for  parametric  self-adaptation.  For  low-dimensional  chaotic  systems  it 
has  been  shown  that  many  simple  choices  of  the  functional  forms  may  indeed  be  used  for 
parametric  self-adaptation  and  control  [57-65]. 

For  parametric  self-adaptation  of  spatiotemporal  systems,  we  begin  by  introducing 
space-time  dependence  in  the  chosen  response  parameter,  i.e.,  J3(j,  t}  and  define  a  new 
variable,  A/3(7,if),  denoting  the  tune-dependent  parametric  corrections  to  be  made  such 
that,  fi(j,  r)  =  [/3(y,  0)  +  A/3(y,  t)}  — >  (3  as  t  — >  oo.  For  convenience,  it  was  assumed  that 
at  t  =  0,  /3(j,  0)  =  /3,  and  A/5( 7, 0)  =  0,  for  ;  =  1 , 2, . . . ,  N  with  the  following  simple 
functional  form  for  the  dynamical  corrections,  A/3(j,  t),  viz., 

=  -&C/,0  -X3t/,0]  =  -«3(j,0-  (22) 

It  would  be  interesting  to  see  if  a  Lyapunov  functional  can  predict  the  stability  of  the  error 


dynamics  with  the  parametric  self-adapter  equation  (22).  The  error  dynamics  in  this  case 
follows 


+  Di[«i(j-  1,0  -2*i(./,0  +«iO'+M)],  (23) 
,  0  =  0(7,  0)  +  A/3(y,  0)  -  £  -  *2(7,  0  [1  +  Ofl2X32(./,  0] 

+  D2[e2(j  -1,0-  2*2(7,  0  +  ^2(7  +1,0],  (24) 

]Xl(j,t).  (25) 


It  may  be  noted  that,  similar  to  the  analysis  for  the  replica  system,  the  error  dynamics  of 
ci(7,0  is  again  dependent  only  on  e\(j,t)  and  the  square  of  the  time-series  signals, 
Xj(j,  0-  That  is,  we  can  define  a  Lyapunov  function  L\  —  ]Cj<22(7,  0  sucn  mat 


O-.O-^C/'+l.O]2  (26) 

is  negative  definite.  Hence,  following  the  Lyapunov  stability  theorem,  e\  (j,  0  ~>  0, 
irrespective  of  the  dynamical  behaviour  of  the  time-series  signals,  X$(j,t)  and  the 
remaining  variables  of  the  response  system.  This  implies  that  synchronization  in  X\  (j,  t) 
is  guaranteed  and  this  feature  may  be  used  for  studying  the  synchronization  behaviour 
with  respect  to  the  remaining  variables.  Thus,  on  defining  a  positive  Lyapunov  functional 
of  the  form  L  =  £;*?(j,0  +  *?  (7,0^(7,0  +  e\(j,t)  +  A/32(./,0]>  we  find  that  its 
derivative,  L,  is  strictly  nonpositive  as  t  —  »  oo.  Hence,  dynamical  synchronization  of  the 
response  system  with  that  of  the  process  is  likely  with  the  corrections,  A/3(y,  t}, 
appropriately  estimated.  A  numerical  study  confirmed  the  above  prediction  (results  not 
shown). 

It  is  interesting  to  note  that  the  above  Lyapunov  functional  analysis  is  valid  even  if  the 
monitored  time-series  signals  are  noisy  due  to  measurement  errors.  That  is,  we  assume 
the  response  model  to  be  driven  by  noisy  scalar  time-series  data,  X'3(j,t},  of  the  form 

X3(7,0=*3(7,  0+7??(7,0,     7=1,...,^,  (27) 


where  7  is  the  intensity  of  Gaussian  noise,  77(7,  0-  The  inherent  tendency  of  the  response 
to  reduce  the  errors  ei(j,t],  i—  1,2,3  still  persists  with  a  noise  reduction  capability 
depending  on  the  noise  level  and  frequency.  Our  studies  showed  that  when  the  driving 
signals  X'3(j,t),  were  assumed  to  be  available  from  all  the  lattice  sites,  synchronization 
in  the  response  variables  was  observed  over  the  entire  spatial  domain  even  for  noisy 
time-series  signals.  This  is  depicted  in  figures  5a  and  b,  by  the  diminishing  behaviour 
of  the  relative  error  signals  *!(./',  0  =  &(7?0  ~xi(j,  0]  /*/(./>  0>  *'=1>2,  with  the 
convergence  of  A/3(y,  0  to  the  mean  correction,  J3  -  /3(j,  0)  =  0.2,  shown  in  figure  5c. 
Note  that  A/5(7,  0  fluctuates  around  this  mean  correction  in  an  attempt  to  filter  the  effects 
of  noise  on  the  dynamics  of  the  response  system.  In  keeping  with  the  observations  made 
earlier  on  studying  synchronization  properties  for  data  monitored  from  a  subsystem, 
the  results  obtained  with  noisy  driving  signals  from  a  sub-system,  i.e.,  X'3(k,t), 
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Figure  5.  Synchronization^  in  a  non-replica  system  for  an  initial  inaccuracy  in 
response  parameter  J3,  i.e.,  J3  =  2.73,  while  process  parameter  (3  =  2.93.  The  study 
assumed  the  response  system  to  be  driven  by  time-series  signals,  X'3(j,t), 
j  =  1,2, ...  ,Af,  corrupted  with  measurement  noise  of  intensity  7  =  0.00002.  (a,b) 
The  relative  error  signals,  ei(7,r),e2(7,0  ->  0,  showing  the  synchronization  of  the 
response  dynamics  with  the  process;  e\  axis-scale:  (-0.051,0.316);  e'2  axis-scale: 
(-0.051,0.316).  (c)  Space-time  behaviour  of  the  correction,  A/3(y,r),  converging 
accurately  to  0.2  over  the  entire  spatial  domain  is  seen;  A/3  axis-scale: 
(-0.051,0.251). 

k=  1,2, ...,ns  showed  that  synchronization  was  again  possible  only  within  the  sub- 
system region.  This  is  shown  in  figures  6a  and  b,  where  the  relative  synchronization . 
error,  e[(j,t),  i  =  1,2,  has  been  plotted.  Note  that  for  the  lattice  sites  outside  the  sub- 
system (and  not  covered  by  the  adapter  equations  (22)),  an  average  adaptation 
(A/3)  =  ^fc  A/?(fc,  r)/ns  was  employed.  The  dynamical  convergence  of  this  average 
correction,  (A/3),  to  0.2  is  shown  in  figure  6c.  These  results  suggest  that  even  with 
limited  and  inaccurate  sub-system  information,  it  may  be  possible  to  correctly  estimate 
the  inaccuracy  in  the  chosen  control  parameter,  though,  complete  dynamical 
synchronization  of  the  response  in  all  its  variables  may  be  a  difficult  task. 


4.3  Stabilization  of  spatiotemporal  chaos 

Frequently,  it  may  be  necessary  to  stabilize  the  spatiotemporally  chaotic  dynamics  and 
in  this  subsection  we  present  the  results  obtained  on  driving  a  response  system 
using  limited  and  noisy  time-series  signals  from  a  process  exhibiting  time-independent 
stable  dynamics.  The  response  system  was  assumed  to  exhibit  spatiotemporal  chaos 
with  D«3  =  80  and  the  aim  was  to  self-adapt  this  parameter  so  that  the  response  system 
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Figure  6.  Synchronization  behaviour  of  a  spatiotemporally  chaotic  dynamics  of  the 
response  when  driven  with  noisy  data  from  a  sub-system  of  size  ns  =  11,  from  the 
central  region  of  the  lattice,  (a,  b)  Synchronization  observed  only  within  the  sub- 
system region  where,  e\(j,t},ei(j,t}  ->  0;  e(  axis-scale:  (-24.89,182.8);  e'2  axis- 
scale:  (-1.117,8.316).  (c)  The  temporal  behaviour  of  (A/5),  is  seen  to  accurately 
estimate  the  correction  required. 


a  priori).  Thus,  in  this  case,  we  study  the  inverse  problem  of  locating  the  control 
parameter  value  which  yields  a  desired  output  in  the  dependent  variables  of  the 
response  system.  The  dynamical  equations  for  the  error  functions  ei(j,  t},  in  this  situation 
follows 


(28) 
(29) 


e2(j,t)  =  -e2(i, 


j  -  1,  f)  -  2e2(j,  0 


j,  t), 


(30) 


on  introducing  space-time  dependence  in  the  chosen  response  parameter,  i.e.,  Dai(j,f) 
with  Da$(j, 0)  —Da^.  As  before,  we  now  define  a  new  variable,  ADa.3(j,t),  for  the 
parametric  corrections  required  to  be  made  to  the  response  parameter  Da^  such  that  the 
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Figure  7.  Stabilization  of  the  spatiotemporal  chaos  with  noise  reduction,  (a)  The 
relative  error  e'3(j,  t)  -»  0  indicating  stabilization  of  the  response  system  at  the  desired 
state;  e'3  axis-scale:  (-2.88,  27.89).  (b)  The  convergence  of  {A£>a3}  to  the  initial 
difference  Daj  -  Das  =  -50. 

true  process  value  Da3  is  realized  as  t  — >  oo.  For  convenience,  we  assign  at  t  =  0, 
ADa3(;,0)  =  0  and  to  bring  out  the  flexibility  in  choosing  a  dynamical  form  for  the 
parametric  self-adaptation,  we  now  consider 


,  0  -  X3(;,  r)]X3(  j,  0  =  e3(j,  0X3(7,  0- 


at 
Then,  for  a  positive  Lyapunov  functional  L  defined  as 


(31) 


(32) 


its  derivative,  L,  is  again  seen  to  be  strictly  nonpositive  and  from  the  Lyapunov  stability 
theorem  it  follows  that  the  response  system  would  completely  synchronize  with  the  stable 
process  dynamics  for  appropriate  dynamic  corrections,  AD#3(7,  f).  This  was  also 
numerically  confirmed  for  noisy  driving  signals,  X'3(k,t],  from  a  sub-system  of  the 
process.  However,  unlike  in  the  previous  studies,  we  now  observe  synchronization  over 
the  entire  spatial  domain.  This  is  clearly  seen  in  figure  7a  for  the  relative  error  signal, 
63(7,  ?)  — >  0,7  =  1, . . . ,  N,  when  the  response  system  was  assumed  to  be  driven  by  time- 
series  signals  from  a  sub-system  of  size  ns  —  11,  from  the  central  region  of  the  process 
(following  (27)),  viz.,  X'3(k,  t),  k  =  1,2, . . .  ,ns.  As  in  the  previous  study,  for  the  lattice 
sites  outside  the  sub-system  (and  not  covered  by  (31)),  an  average  adaptation 
(ADa3)  =  ^  ADa3(fc,  ?)/ns  was  employed.  The  convergence  of  (AD«3}  to  the  initial 
difference  in  the  response  and  process  parameters,  Da3  —  Da3  =  —50,  is  seen  in 
figure  7b.  Similar  results  were  observed  on  varying  the  noise  strength  7.  Thus,  the  self- 
adaptation  mechanism  can  be  effectively  used  even  in  the  presence  of  reasonable  extents 
of  noise.  These  results  also  suggest  that  it  may  be  possible  to  stabilize  the  spatiotemporal 
chaos  in  real-life  experiments,  using  only  scalar  driving  signals  from  a  subsystem  of  an 
experimental  process  along  with  an  appropriate  self-adaptation  mechanism  as  controller 
feedback.  Importantly,  a  knowledge  of  the  nonlinear  mathematical  model  or  a  process 


In  the  different  situations  discussed  above,  inaccuracies  were  assumed  in  only  a  single 
system  parameter.  However  in  many  situations,  the  inaccuracies  may  exist  in  more  than 
one  parameter  and  for  synchronization  to  occur,  simultaneous  correction  in  all  the 
inaccurate  parameters  would  then  be  required.  This  topic  is  being  actively  pursued  and 
different  strategies,  e.g.,  parametric  self-adaptation  [64,66],  optimization  [68],  neural 
networks  [69,70],  etc.  have  been  reported.  In  this  subsection,  we  show  that  multi- 
parameter  control  of  a  spatiotemporally  chaotic  system  via  judicious  self-adaptation  is 
feasible  and  advantageous  because  of  its  simplicity.  For  the  sake  of  convenience,  we 
rewrite  the  process  equations  as 

X!  (7,  0  =1  -  *i  (7,  0  -  0*1*1  (7,  t)Xl(j\  0 


*2(7,  0  =£  -  *2(7,  0  ~  002X2(7,  f)Xf  (7,  0 

+  02[X2(7+  1,0  -2X2(7,0  +*2(7-  1,0], 

*3(7',  0  =&i  -A2Xi(j,t}+Ai[Da]X}(j,t)+Da2X2(j,t}}Xl(j,t) 

+  D3[X3(j+  1,0  -2X3(7,  0+*»(7  -1,0],  (33) 

where  AI,  A2,  and  A3  are  the  chosen  control  parameters  and  may  be  identified  as 
A!  =  1.0,  A2  =  1  +1)0(3  =  81,  and  A3  =  a  =  1.5  in  (3).  Let  us  assume  that  these  are 
incorrectly  set  to  A  i  =  1.1,  A2  =  82,  and  AS  =  1  .7,  respectively,  in  the  response  system 
driven  by  the  time-series  signals,  Xi(j,  t),j=l,2,...,N.  Thus, 

X,  (7,  0  =1  -  Xi  (7,  0  -  Da  A  (7,  0*3  C/>  0 

+  01[X1(7-+l,0-2X1(</,0+#i(7-l,0], 
#2(7,  0  =13  -  X2(j,  0  -  ZtezXaU,  0*3  (./,  0 


^3(7,0  =Ai  -^2X3  (7,0  +  A3[Da,X,  (7,0  +002*2(7,  01^(7,0 

+  D3(X2(j+  1,0  -2X3(7,0  +X*U-  1,01-  (34) 

The  aim  is  then  to  control  the  response  system  dynamics  by  proper  estimation  of  the 
parametric  corrections,  viz.,  AAj,  AA2  and  AAs.  We  first  show  that  for  an  appropriate 
choice  of  driving  signals  and  functional  forms  for  self-adaptation,  a  Lyapunov 
functional  may  be  formulated  to  assess  the  synchronization  and  control  capability  of  the 
response  system.  The  dynamical  behaviour  of  the  error  functions,  e,-(y,  0  = 


i(;-l,0-2«i(7,0  +  «i(7  +  l,0],  (35) 

e2(j,  0  =  ~ez(J,  0[1  +  Oa2Xj(7,  t}} 

+  D2[e2(j  -1,0-  2^2(y,  ?)  +  ez(j  +1,0],  (36) 
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tfaU.O  =  AAiO',0  -  AA2(j,  OX3(;,  0 

+  Dai  [AA3  (  j,  OXi  (7,  0  +  A3ei  (7,  ?)]X3  (7.  0 

+  Dfl2[AA3(7,OX2(7,0  +  A3«2(7,0]X3(7,0>  (37) 

with  the  dynamical  equations  for  the  parametric  corrections,  AA,-(7,  f),  i  =  1,  2,  3,  chosen 
respectively,  as 

AAiC/,0  =  -[X3(/,0  -X3(7,01  =  -«3(7,0,  (38) 

AA2(7,  0  =  [X3(y,  0  -  X3(7,  0]X3(7,  0  =  £3(7,  t)X3(jt  0, 


01*3(7,  0  -  *3  (7,  01X1(7,  0 
=  [Da!*!  (7,  0  +  Da2X2(j:  0]«3(7,  OX?  (7,  0-  (4») 

Then  the  derivative  of  a  positive  Lyapunov  functional,  L  =  Y^i  ]C/[e?C/>  0  +  AA2(7,  f)], 


that  is, 

(1/2)L=  - 


7=1 

7,  01  -  SzfeO,  0  -  «2(7  +  1,  Ol2} 


]X32(7,0-  '     (42) 

It  may  be  noted  that  (35)  and  (36)  are  again  decoupled  systems,  and  following  the 
arguments  presented  in  the  previous  subsections,  we  obtain  ei(j,i)  —  >  0  as  t  —  >•  oo,  for 
i  =  1,2.  Using  these  results  in  (41),  it  is  clear  that  L  is  non-positive  and  dynamical 
synchronization  of  the  response  with  the  process  should  be  possible.  This  was  confirmed 
numerically  and  below  we  present  our  results  obtained  on  using  driving  signals  only  from 
a  sub-system  of  the  process,  i.e.,  in  X^(k,  t),  k  =  1,2,  .  .  .  ,ns.  As  before,  we  begin  by 
introducing  space-time  dependence  in  the  chosen  control  parameters,  and  assume  that  at 
t  =  0,  Ai(7,  0)  =  A,-  7^  Aj.  The  parametric  corrections,  AA,-(fc,  f)'s,  within  the  sub-system 
are  assumed  to  be  governed  by  (38-40).  For  the  lattice  sites  outside  the  sub-system,  an 
average  adaptation,  (AA,-)  =  X^fc  AA,-(fc,f)/ns,  were  implemented.  Figures  8a,  8b  and  8c 
respectively  show  the  convergence  of  (AAi)  -»  -0.1,  (AA2)  -»•  -1.0  and  (AA3)  —  »• 
-0.2,  the  required  corrections.  Thus,  these  results  suggest  that  it  may  be  possible  to 
simultaneously  adapt  more  than  one  system  parameter  using  limited  scalar  time-series 
data  from  a  sub-system  of  the  process  for  appropriate  choices  of  the  functional  forms  for 
parametric  corrections.  It  may  be  clarified  that  when  ns  =  Nt  not  only  adaptation  of  the 
parameters  but  control  via  synchronization  of  the  complete  space-time  dynamics  of  the 
response  system  may  be  possible. 
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Figure  8.     (a,  b,  c)  Multi -parameter  control  via  self-adaptation  showing  the  accurate 
estimation  of  the  corrections,  (AA/),  /  =  1, 2,3. 


5.  Conclusion 

The  analysis  of  sub-system  dynamics  for  a  nonlinear  autocatalytic  reaction-diffusion 
system  as  a  function  of  increasing  sub-system  size  showed  interesting  linear  scaling 
relationships  with  respect  to  the  invariant  measures  like  the  Lyapunov  dimension  and  KS- 
entropy,  while  the  dimension  densities  and  normalized  entropy  were  found  to  become 
independent  of  the  sub-system  size  beyond  a  certain  critical  size  nsc.  These  results 
suggest  that  characterization  of  the  complete  system  dynamics  may  be  possible  from  an 
analysis  of  the  Lyapunov  spectrum  of  its  sub- system  of  size  ns  >  nsc  with  considerable 
reduction  in  the  computational  effort.  Our  preliminary  results  for  the  autocatalytic 
reaction-diffusion  system  on  a  two-dimensional  lattice  has  also  shown  similar  trends  for 
the  system  invariants  as  a  function  of  its  sub-system  size.  Thus,  the  approach  of  analyzing 
sub-system  behaviour  may  be  advantageously  used  in  the  characterization  of  more 
complex  higher-dimensional  systems.  It  may  be  noted  that  the  studies  discussed  in  this 
paper  were  carried  out  for  nearest-neighbor  diffusive  coupling.  However,  in  most 
practical  situations,  other  transport  mechanisms  such  as  convection  may  also  concurrently 
occur.  It  would  then  be  worthwhile  to  study  the  validity  of  the  scaling  relationships  in  the 
sub-system  invariant  properties  for  systems  involving  more  complex  and  long-range 
coupling. 

We  have  used  the  above  characterization  results  to  analyse  the  synchronization  and 
controllability  of  spatiotemporal  systems  exhibiting  chaotic  dynamics.  The  control 
objectives  were  carried  out  for  inaccurate  knowledge  of  the  initial  conditions,  parameter 
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settings,  and  limitations  in  the  monitoring  of  time-series  data  from  the  spatial  domain 
which  may  additionally  be  corrupted  with  noise.  Our  results  using  sub-system 
information  show  that  it  may  be  possible  to  correctly  estimate  the  inaccuracies  in  the 
control  parameter(s)  via  suitably  chosen  forms  of  parametric  self-adaptation(s). 
Suppression  of  chaos  and  stabilization  of  the  spatiotemporally  chaotic  dynamics  to  a 
desired  stable  state  was  also  possible  using  sub-system  driving  signals  even  in  the 
presence  of  reasonable  extents  of  noise.  However,  for  the  synchronization  of  complex  and 
chaotic  dynamics,  time-series  data  from  the  entire  spatial  domain  may  be  required. 
Further,  it  has  been  shown  that  it  may  be  possible  to  assess  the  synchronization  capability 
of  a  spatiotemporal  system  from  the  stability  analysis  of  suitably  constructed  Lyapunov 
functionals.  This  approach  offers  the  advantage  of  a  priori  knowledge  of  the 
synchronization  behaviour  under  different  situations. 
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Abstract.  We  review  some  advances  in  the  theory  of  homogeneous,  isotropic  turbulence.  Our 
emphasis  is  on  the  new  insights  that  have  been  gained  from  recent  numerical  studies  of  the  three- 
dimensional  Navier  Stokes  equation  and  simpler  shell  models  for  turbulence.  In  particular,  we 
examine  the  status  of  multiscaling  corrections  to  Kolmogorov  scaling,  extended  self  similarity, 
generalized  extended  self  similarity,  and  non-Gaussian  probability  distributions  for  velocity 
differences  and  related  quantities.  We  recount  our  recent  proposal  of  a  wave- vector- space  version  of 
generalized  extended  self  similarity  and  show  how  it  allows  us  to  explore  an  intriguing  and 
apparently  universal  crossover  from  inertial-  to  dissipation-range  asymptotics. 
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1.  Introduction 

If  a  fluid  is  forced  through  a  grid  (say  in  a  wind  tunnel)  or  through  an  orifice,  the  flow 
downstream  is  turbulent  [1].  Most  of  the  energy  is  contained  in  the  large  scales  of  the 
flow;  transport  too  is  dominated  by  the  large  scales.  Nevertheless,  there  has  been  great 
interest  in  the  study  of  turbulence  at  small  scales;  these  include  the  dissipation  range,  in 
which  energy  is  dissipated,  and  the  inertial  range,  that  lies  in  between  dissipation  and 
large  scales.  There  are  both  practical  and  fundamental  reasons  for  this  continuing  interest. 
We  concentrate  on  the  latter  here  and  refer  the  reader  to  the  recent  article  by  Sreenivasan 
and  Antonia  [2]  for  a  more  detailed  discussion.  Other  recent  reviews  include  those  by 
Siggia  [3]  and  Nelkin  [4],  the  brief  overview  by  L'vov  and  Procaccia  [5],  and  the  books 
by  McComb  [6],  Frisch  [7],  and  Bohr,  Jensen,  Paladin,  and  Vulpiani  [8];  older  material 
can  be  found  in  the  classics  by  Batchelor  [9],  Tennnekes  and  Lumley  [10],  and  Monin  and 
Yaglom  [11]. 

There  are  many  fundamental  reasons  for  studying  the  statistical  properties  of  small- 
scale  turbulence,  which  is  (in  many  cases,  e.g.,  far  from  boundaries,  etc.),  to  a  good 
approximation  [2, 12, 13],  homogeneous  and  isotropic:  (1)  These  properties  are  believed 
to  be  universal,  i.e.,  independent  of  the  details  of  the  flow;  specifically,  the  asymptotic 
behaviours  of  certain  correlation  functions  (the  structure  functions  defined  below)  are 
universal  in  a  way  that  is  reminiscent  of,  but  perhaps  richer  than,  the  universality  of 
correlation  functions  at  continuous  phase  transitions  [14].  (2)  From  the  point  of  view  of 


dynamical  systems,  turbulence  is  the  quintessential  example  of  spatiotemporal  chaos  [8] 
in  an  extended,  deterministic,  driven  system.  (3)  To  the  extent  that  the  chaos  generated  by 
turbulence  can  be  mimicked  by  an  external  noise  [15],  one  can  try  to  obtain  the  properties 
of  correlation  functions  by  studying  effective  stochastic  equations,  which  are  amenable  to 
analytical  studies  that  use  renormalization-group,  field-theoretical,  or  closure  techniques 
[16-20].  (4)  And  last,  the  scale-invariance  of  correlation  functions  in  the  inertial  range  is 
an  example  of  self-organized  criticality  [21],  though  very  little  has  been  gained  by 
thinking  about  turbulence  in  this  manner. 

This  paper  is  organized  as  follows:  Section  2  gives  a  brief,  elementary  overview  of  the 
phenomenology  of  homogeneous,  isotropic  turbulence  to  establish  notations.  Section  3 
introduces  some  of  the  deterministic  models  that  have  been  used  in  theoretical  and 
numerical  work  and  reviews  what  has  emerged  from  such  studies.  We  make  some 
concluding  remarks  hi  §  4. 

2.  The  phenomenology  of  homogeneous  isotropic  turbulence 

Experiments  on  turbulent  flows  often  measure  the  velocity  v(r,  t)  or  the  energy  dissipa- 
tion e  at  a  given  spatial  point  r  at  time  t,  by  using  probes  such  as  hot-wire  anemometers. 
These  are  also  augmented  by  flow- visualisation  measurements  [1].  Therefore,  a  typical 
experiment  yields  a  time  series  for  v  or  e  at  a  given  spatial  point.  Most  experiments  are 
done  with  a  mean  background  flow  with  velocity  U  and  the  resulting  temporal  informa- 
tion is  converted  into  spatial  information  by  using  Taylor's  frozen-flow  hypothesis:  This 
states  that  the  turbulence  is  advected  by  the  mean  flow  without  significant  distortion,  so  a 
temporal  delay  At  can  be  converted  into  a  spatial  separation  (along  the  mean  flow) 
Ax  =  AtU  to  obtain  spatial  correlation  functions.  Such  a  hypothesis  is  reasonable  when 
|U|  »  Vrms,  the  root-mean-square  fluctuations  in  the  velocity  because  of  the  turbulence; 
for  a  more  critical  evaluation  of  its  validity  we  refer  the  reader  to  refs  [2,4,7].  The 
Reynolds  number  Re  =  UL/v  is  the  relevant  control  parameter,  which,  when  large,  leads 
to  turbulence;  here  U  and  L  are  typical  velocity  and  length  scales,  respectively,  and  v  is 
the  kinematic  viscosity.  The  Taylor-microscale  Reynolds  number  Re^,  which  is  useful  in 
comparing  widely  different  systems,  will  be  defined  in  §  3. 

Another  important  approximation  is  made  in  obtaining  the  energy  dissipation 
6  =  fSry^'Y/'  +  ^/V')  »  where  the  subscripts  i  andy  denote  Cartesian  components  and 
v  is  the  kinematic  viscosity:  It  turns  out  to  be  difficult  to  measure  e,  so,  typically,  the 
surrogate  e!  ~  (dui/dt}2  is  measured,  with  u\  the  velocity  in  the  streamwise  direction; 
again  we  recommend  [2]  for  a  discussion  of  the  validity  of  using  this  approximation. 

Given  such  data,  a  variety  of  information  can  be  extracted  about  the  turbulent  flow  in 
question.  We  concentrate  on  measurements  that  can  be  classified  into  three  rough 
categories: 

1.  The  first  is  predominantly  qualitative  information  about  coherent  structures,  such  as 
vortex  streets,  eddies,  etc.  These  are  helpful,  as  we  will  see  below,  in  developing 
physically  appealing  scenarios  for  turbulent  cascades,  intermittency,  and  multifractal 
dissipation. 

2.  The  second  is  the  measurement  of  spatial  correlation  functions  (typically  via  the 
Taylor  hypothesis),  the  most  important  ones  being  the  order-p  velocity  structure 
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Sp(r)  =  <[v,(x  +  r)-v,(x)n.  (1) 

The  angular  brackets  imply,  in  principle,  a  spatiotemporal  average  over  the  statistical 
steady  state  that  obtains  in  the  turbulent  fluid.  In  experiments,  often  only  the 
component  parallel  to  the  mean  flow  is  used.  Some  workers  use  a  modulus  instead  of 
the  square  brackets  in  (1),  but  others  [2]  discourage  its  use;  it  clearly  makes  no 
difference  for  even  p  but  can  affect  exponents  [22]  for  odd  p.  Other  commonly 
measured  correlation  functions  include  the  energy  spectrum  E(k)  =  47r£2(|v(k)|2}, 
where  v(k)  is  the  spatial  Fourier  transform  of  the  velocity,  k  the  wave  vector,  and 
*=|k|. 

3.  The  third  is  the  measurement  of  probability  distribution  functions  for  velocity 
differences,  such  as  6  v,-  =  [v,-(x  +  r)  —  v/(r)]  (in  experiments  i  is  often  the  direction 
of  the  mean  flow),  the  energy  dissipation  e,  etc. 


2.1  Flow  visualization 

The  first  category  of  measurements,  namely,  flow  visualization,  have  shown  that 
fluid  turbulence  is  typically  associated  with  coherent  structures.  These  can  be  eddies, 
as  in  mixing-layer  experiments,  or  plumes,  as  in  Rayleigh-Beenard  systems,  etc.  [1], 
These  structures,  though  clear  to  the  eye,  are  difficult  to  quantify  and  define.  In  recent 
elegant  experiments  [23, 24, 25]  high-vorticity,  low-pressure  filaments,  of  the  type  seen 
in  direct  numerical  simulations  (§  3),  have  been  visualized  directly  in  different  turbulent 
flows. 

Many  years  ago  Richardson  [26]  suggested  the  following  cascade  picture:  Turbulent 
flow  is  considered  to  be  made  up  of  a  hierarchy  of  eddies  (disturbances  or 
inhomogeneities  in  the  flow)  of  various  length  scales.  Energy  is  added  to  the  turbulence 
in  the  integral  range,  which  corresponds  to  large  length  scales  or  small  wave  numbers, 
via  large-scale  processes  like  stirring.  The  nonlinear  interactions  in  the  fluid  cause  this 
energy  to  flow  towards  small  scales  (large  wave  numbers),  by  the  formation  of 
successively  smaller  eddies:  large  eddies  lose  stability,  form  smaller  eddies,  which  in  turn 
lose  stability,  break  into  even  smaller  eddies,  and  so  on.  In  this  process,  the  energy 
cascades  from  large  to  small  length  scales,  with  negligible  energy  dissipation  in  the 
inertial  range.  Viscous  processes  lead  to  significant  dissipation  once  the  sizes  of  eddies 
become  comparable  to,  or  smaller  than,  the  dissipation  scale  r)d  =  (^/(e))  . 


2.2  Structure  functions 

If  the  Reynolds  number  Re  is  sufficiently  large  then  there  is  a  substantial  inertial  range,  in 
which  there  can  be  universal  scaling  behaviour  as  noted  first  by  Kolmogorov  [27]  in  1941 
(henceforth  K41).  His  theory  is  based  on  the  following  premises: 

1.  On  small  scales  and  far  away  from  boundaries  fluctuations  in  a  turbulent  flow  are 
homogeneous  and  isotropic,  statistically  steady,  and  universally  determined  by  the 
mean  energy  dissipation  rate  (per  unit  mass  of  fluid)  e  and  the  kinematic  viscosity  v. 
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Figure  1.  A  plot  comparing  data  for  the  inertial-range  exponent  £p  with  different 
theoretical  predictions.  Experimental  data  are  from  Anselmet  et  al  (1984)  and  Benzi 
et  al  (1993).  Theoretical  predictions  are  from  K41,  the  Kolmogorov  log-normal  model 
[42],  with  //  =  0.2,  the  $  model  (with  D  =  2.8),  and  the  SL  model  (see  text).  Clearly 
the  SL  prediction  agrees  best  with  experimental  data. 

2.  In  the  inertial  range,  viscosity  does  not  play  a  significant  part,  so  the  statistical 
properties  at  a  given  length  scale  I  are  determined  by  e  (which  is  finite,  and  non-zero) 
and  the  length  scale  I  itself. 


Given  these  assumptions,  correlation  functions  assume  simple  and  universal  scaling 
forms  in  the  inertial  range,  which  follow  via  dimensional  analysis.  (Note  that  the 
dimensions  of  e  and  the  energy  spectrum  E(k),  which  measures  the  distribution  of  energy 

'.)  Specifically 


over  different  length  scales,  are,  respectively,  L2T  3  and  L3T  2 


Sp(r]  «  CP((W 


(2) 


where  Cp  are  universal  amplitudes;  the  case  p  =  3  yields  the  Kolmogorov  4/5  law 


(C3  =  -4/5).  Also 


E(k)  =  C(e)2/V5/3, 


(3) 


which  is  the  celebrated  -5/3  law. 

The  second  category  of  experimental  measurements  comprise  studies  of  such  structure 
functions.  They  find  that  low-order  structure  functions  (p  <  3)  are  in  close  agreement 
with  the  predictions  of  K41  [28].  In  particular,  the  -5/3  law  has  been  checked  [2, 13, 29] 
over  several  decades  of  wavenumber  k;  the  Kolmogorov  constant  is  found  to  be  [2, 30] 
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Figure  2.  Typical  signals  of  a  representative  component  of  the  surrogate  for  the 
energy  dissipation  e'  (normalized  by  its  mean)  versus  time  t  from  an  atmospheric 
boundary  layer  at  a  moderate  Reynolds  number  (after  Meneveau  and  Sreenivasan 
[50]). 

C  =  0.5  ±  0.05.  However,  the  kurtosis  K  =  S$(r} / (e>2(r))2 ,  which  is  a  measure  of  the 
flatness  of  the  distribution  of  velocity  differences,  has  values  larger  than  the  K41 
prediction  as  r  — >  0  [11],  which  indicates  that  the  velocity  gradients  are  intermittent  (i.e., 
large  values  of  velocity  differences  are  more  probable  than  in  K41).  This  suggests,  in 
turn,  that  the  energy  dissipation  is  nommiform,  contrary  to  one  of  the  assumptions  made 
by  Kolmogorov  in  K41. 

The  validity  of  K41  can  be  tested  experimentally  by  the  measurement  of  high-order 
(p  3>  3)  structure  functions,  which  can  capture  rare  events.  Such  experiments  show  that 

SP(r)~rfc;  (4) 

the  K41  prediction  is  (p  =  £*41  =/?/3.  Even  though  the  measurement  of  high-order 
structure  functions  is  a  difficult  task,  there  are  good  data  from  the  recent  experiments 
[31v,  32, 33]  which  show  that:  (1)  the  (^,'s  are  monotonically  increasing,  nonlinear 
functions  of  p  (i.e.,  we  have  multiscaling);  (2)  the  deviation  from  the  K41  prediction 
S£p  =  p/3  —  £p  is  an  increasing  function  of  p  and  starts  becoming  significant  for  p  >  4 
(figure  1). 

We  mention  in  passing  that  some  experimental  studies  [34]  indicate  that  6£p  decreases 
with  increasing  Re,  as  also  suggested  by  some  theories  [35, 36];  there  are  also  suggestions 
[37]  of  (In  Re)"1  corrections  to  K41.  However,  we  believe  it  is  fair  to  say  that  the  general 
experimental  position  [2, 33]  is  that  the  multiscaling  corrections  to  K41  persist.  A  careful 
study  over  a  wide  range  of  Re,  possible,  e.g.,  in  flows  in  Helium  gas  [38, 39],  should  be 
able  to  resolve  this  potentially  significant  point.  Another  intriguing  issue  has  been  raised 
by  recent  experiments  on  both  open  [38]  and  closed  [39]  flows  in  Helium  gas:  these 
provide  some  evidence  of  a  transition  or  a  crossover  (in,  e.g.,  the  kurtosis  K)  with 
increasing  Re.  Careful  studies,  already  under  way  [40],  must  resolve  whether  these 
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transitions  are  real  or  arise  because  of  the  limitations  of  responses  of  the  probes  used  in 
these  measurements. 

How  might  multiscaling  corrections  to  K41  arise?  Experimental  data  (figure  2)  show 
that  the  energy  dissipation  rate  e  is  strongly  intermittent.  This  is  ignored  in  the  K41 
theory.  High-order  velocity  structure  functions  are  sensitive  to  the  rare  but  large  events 
which  lead  to  intermittency,  hence  the  discrepancy  between  K41  and  the  experimental 
C/s  for  large  p. 

Many  phenomenological  theories  have  been  proposed  to  account  for  the  intermittency 
of  e.  The  oldest  is  the  refined  similarity  hypothesis  (RSH)  of  Kolmogorov  [41],  which 
replaces  (e)  in  (2)  by  er,  the  energy  dissipation  rate  averaged  over  a  sphere  of  radius  r,  i.e., 


SP('}  -  WV73-  (5) 

The  validity  of  the  RSH  has  been  tested  to  some  extent  [42-45],  but  the  issue  is  certainly 
not  closed  as  noted  by  Chen  et  al  [46]  on  the  basis  of  direct  numerical  simulations  (§  3) 
and  Sreenivasan  and  Antonia  [2].  However,  it  is  often  used  as  a  good  working  hypothesis. 
If,  in  the  inertial  range,  we  further  assume  the  scaling  form 

W)-^,  '        (6) 

it  follows  from  (4)  and  (5)  that 

(p=p/3+rp/3.  (7) 

Note  that  the  RSH  yields  a  relation  between  the  statistics  of  velocity  differences  and  the 
unknown  behaviour  of  {e*!/3}.  An  assumption  about  the  latter  yields  rp  and  hence  £p.  In 
his  1962  theory,  Kolmogorov  assumed  that  er/(e)  has  a  log-normal  distribution,  which, 
along  with  the  RSH,  and  the  constraint  r\  =  0,  gives 


(8) 
or 

yer,  D  D    /  7}\ 

cr=f+Mf(i-f),  (9) 

where  the  intermittency  exponent  fj,  =  -r2  c±  0.2.  This  yields  [47]  better  agreement  with 
experiments  (figure  1)  than  K41;  however,  for  large  p,  deviations  are  noticeable  and,  as 
many  authors  have  noted  [2],  there  are  several  problems  with  a  log-normal  distribution 
for  er/(e). 

Other  phenomenological  theories  include  the  J3  model,  proposed  by  Frisch,  Sulem  and 
Nelkin  [48],  and  the  multifractal  description  of  turbulence,  postulated  by  Parisi  and  Frisch 
[49].  The  J3  model  yields 


Here  D  =  In  /3/  In  b,  identified  with  \hefractal  dimension  of  the  space  filled  by  the  eddies 
(at  each  stage  of  the  cascade  the  linear  size  of  the  eddy  is  scaled  down  by  a  factor  b  >  1 
and  j3  is  the  ratio  between  the  volumes  of  mother  and  daughter  eddies),  is  used  as  a  fitting 
parameter.  Figure  1  shows  a  fit  to  £p  with  D  =  2.8;  clearly,  the  /3-model  fit  overestimates 


£p  for  8  <  p.  It  also  yields  an  inertial-range  energy  spectrum  which  is  steeper  than  the 
-5/3  spectrum  of  K41. 

It  is  easy  to  see  [8]  that  the  Navier-Stokes  equation  (see  below)  for  an  incompressible 
fluid  is  invariant  under  the  scale  transformation  r  —  >  br,  v  —  >  bhv,  v  —  »  bl+hv,  t  —  >  bl~ht, 
with  an  arbitrary  exponent  h.  It  has  been  suggested  [49]  that,  at  high  Re,  i.e.,  small  v  ,  one 
must  account  for  multifractality  with  many  exponents  h  as  has  been  shown  explicitly  [50] 
for  e  (see  below):  Global  scale-invariance  (as  in  K41)  is  replaced  by  local  scale-invariant 
structures,  i.e.,  the  scaling,  or  Holder,  exponent  h  varies  at  different  points  in  the  fluid, 
which  is  considered  to  be  a  superposition  of  fractal  subsets,  each  with  an  /i-dependent 
dimension  D(h}.  Structure  functions,  which  now  have  contributions  from  each  fractal 
subset,  can  be  written  as 


Sp(r]  ~      (r/L-dh,  (11) 

where  L/  is  the  integral  scale.  A  saddle-point  estimate  yields  (cf.,  (4)) 

Cp  =  rmnh[hp  +  3-D(h}}  (12) 

or 

/i-CP].  (13) 


The  K41  theory  corresponds  to  a  single  exponent  h  =  1/3  with  D(ti)  =  3. 

The  emergence  of  multifractal  scaling  for  e  through  an  asymmetric  breakdown  of 
eddies  has  been  discussed  in  detail  and  substantiated  experimentally  in  the  elegant  studies 
of  Meneveau  and  Sreenivasan  [50].  In  their  picture,  an  eddy  of  size  t  in  d  dimensions 
breaks  up  into  2d  eddies  of  equal  size  1/2.  The  key  assumption  in  this  model  is  that 
the  energy  flow  to  these  smaller  eddies  takes  place  asymmetrically.  The  simplest 
nontrivial  choice  is  that  a  fraction  p\  of  the  total  energy  gets  distributed  equally  to  half  of 
these  new  2d  eddies;  the  remaining  fraction  (p2  =  1  —  p\)  of  energy  goes  to  the  other 
half.  These  eddies,  in  turn,  break  up  into  smaller  eddies,  and  this  process  is  repeated  with 
fixed  pi,  until  one  obtains  eddies  of  the  order  of  the  dissipation  scale.  A  one-dimensional 
section  of  the  resulting  multifractal  set  should  look  qualitatively  similar  to  the  energy 
dissipation  signal  shown  in  figure  2.  Indeed,  Meneveau  and  Sreenivasan  [50]  have  shown 
that  such  a  multiplicative  process  with  p\  =  0.7  yields  a  multifractal  set  which  has  the 
same  singularity  spectrum  as  the  one  obtained  by  a  multifractal  analysis  of  the  data  in 
figure  2. 

Recently,  She  and  Leveque  (SL)  [51]  proposed  a  simple  model  which  leads  to  a 
prediction  for  £p  in  very  good  agreement  with  experimental  results.  This  model  is  based 
on  a  set  of  hypotheses  about  the  statistics  of  small-scale  fluctuations  in  turbulence.  These 
lead  to  a  formula  for  £p,  which  has  no  adjustable  parameters.  The  SL  hypotheses  are: 

(i)  The  RSH  (5)  and  (6)  is  assumed  to  hold. 

(ii)  Furthermore  the  moments  of  the  energy  dissipation  are  assumed  to  obey  the 
hierarchy 

0  </?<!,  (14) 
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where  Ap  are  constants  and  ej00^  =  limp_>00(e^+  )/{eJ}  is  a  quantity  that  is  sensitive 
to  the  tail  of  the  distribution  of  Q,  i.e.,  to  very  intermittent  structures.  SL  postulate 
that  the  hierarchy  originates  from  some  hidden  statistical  symmetry  of  the  Navier- 
Stokes  equation. 

(iii)  The  quantity  4°°^  appearing  in  the  moment  hierarchy  (14)  is  associated  with  the 
most  intermittent  dissipative  structures  and  SL  suggest  that  it  has  a  divergent  scale 
dependence,  namely, 

e'^-r*,     1-+Q.  (15) 

They  argue  further  that  ef°]  ~  SE°°/tt,  where  6E°°  ~  81$  ~  (e)2/3^/3  is  the  largest 
amount  of  energy  available  for  dissipation  (i.e.,  the  energy  in  the  largest 
structures)  and  the  scale-dependent  energy-transfer  time  fy  ~  (e)"1'3^2/3.  The  last 
asymptotic  relation  follows  from  dimensional  analysis  and  the  assumption  that  tg 
shows  no  anomalous  scaling.  Finally  one  obtains  A  =  2/3. 

(iv)  The  most  intermittent  dissipative  structures  are  assumed  to  be  filamentary,  i.e.,  of 
spatial  dimension  1  or  codimension  2. 

Equations  (6)  and  (14)  yield  a  difference  equation  for  rp  which,  when  solved  with  the 
boundary  conditions  that  follow  from  the  SL  hypotheses,  gives 

rf  =  -lp  +  2[1-  ®a  (16) 

or,  via  (7), 


The  most  important  boundary  condition  follows  from  (14)  and  (15)  and  is  limp-,00 
(TP+I  —  Tp)  =  —2/3,  i.e.,  rp  —  >  —  2^/3  +  Co  as  p  —  *  oo;  it  is  easy  to  see  from  (13)  that  Q 
is  the  codimension  of  the  most  singular  structure,  i.e.,  the  one  with  the  smallest  Holder 
exponent  h^n.  Assumption  (iv)  yields  Co  =  2  which  corresponds  to  h^n  =  1/9. 

The  SL  formula  is  in  good  agreement  with  the  experimental  values  of  £p  as  can  be  seen 
from  figure  1.  The  success  of  this  formula  is  indeed  remarkable,  since  the  validity  of  the 
assumptions  made  is  not  yet  clear.  Of  the  assumptions  (i)-(iv),  (i)  is  generally  accepted  to 
be  a  good  working  approximation.  Direct  numerical  simulations  indicate  that  high- 
vorticity  regions  are  filamentary  [52,53],  which  is  used  to  motivate  assumption  (iv);  we 
return  to  this  point  in  §3.  Assumption  (iii)  awaits  detailed  experimental  and  numerical 
verification.  Recently,  this  hierarchy  has  been  verified  experimentally  [54]  to  some  extent 
using  wind-tunnel  data.  Dubrulle  [55]  mentions  that  a  similar  hierarchy  has  been  shown 
to  exist  in  the  GOY  shell  model  (see  below). 

Benzi  et  al  [32]  have  proposed  a  very  interesting  way  of  extending  the  region  over 
which  inertial-range  exponents  may  be  extracted  from  structure  functions.  The  extended 
self-similarity  (ESS),  as  the  procedure  is  called,  states  that  the  real-space  structure 
functions  present  an  extended  inertial  scaling  range  in  log-log  plots  of  one  structure 
function  against  another.  From  (4)  it  follows  that 

S,(r)~S,(r)^';  (18) 

and  Benzi  et  al  [32]  have  shown,  by  using  experimental  data,  that  the  inertial  scaling 
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region  extends  up  to  r  ~  5rjd  in  ESS  plots.  (By  contrast,  plots  of  Sp(r]  versus  r  show 
inertial  range  scaling  only  up  to  r  ~  20  77^.)  It  is  conventional  to  use  q  =  3  in  (18),  so 
Cp/Cs  ~  C,p  since  £3  is  very  nearly  equal  to  unity,  when  the  flow  is  homogeneous  and 
isotropic.  (Indeed  the  Karman-Howarth  relation  [7]  yields  £3  =  1.)  Thus  ESS  provides  an 
effective  method  for  determining  £p  from  data  for  which  the  extent  of  the  inertial  range  is 
often  restricted.  However,  the  reason  behind  the  success  of  ESS  is  still  unclear  (but  see 
[56, 57]  for  some  progress  on  this  front). 

Although  ESS  is  an  effective  method  for  determining  inertial-range  scaling  behaviour 
in  some  turbulent  flows,  it  does  not  work  well  for  all  such  flows,  particularly  in  the 
presence  of  a  strong  shear.  For  such  systems,  Benzi  et  al  [58]  have  proposed  the 
generalized  extended  self  similarity  (GESS)  ansatz,  which  they  claim  extends  the  scaling 
down  to  the  smallest  resolvable  spatial  separations.  They  begin  with  the  dimensionless, 
real-space  structure  functions 


If  K41  were  true,  Qp(r}  would  be  independent  of  r  in  the  inertial  range.  However,  because 
of  multiscaling,  Qp(r)  displays  nontrivial  scaling  behaviour.  Specifically,  Benzi  et  al  [58] 
propose  that  Qp(r)  is  self-scaling  over  all  resolvable  length  scales,  i.e., 


From  the  r-dependence  of  Sp    (4)  it  follows  that 


Thus,  if  log  Gp(r]  is  plotted  against  log  Qq(r)  one  should  get  a  straight  line  with  slope  ppit} 
up  to  all  resolvable  length  scales.  This  has  been  checked  to  some  extent  by  Benzi  et  al 
[58]  from  numerical  and  experimental  data  for  (p,q)  =  (5,6). 

Dissipation-range  behaviors  of  structure  functions  have  been  studied  less  than  their 
counterparts  in  the  inertial  range.  There  are  some  data  for  the  energy  spectrum  E(k}.  In 
particular,  K41  suggests  that 


E(k)  =  (e}r53/(*Ad),  (22) 

where  the  dissipation  scale  &J1  =  ria  and  the  scaling  function  f(k/kd)  —  »  C,  for  small 
k/k^.  The  general  form  of/  (£774)  is  not  known,  but  the  consensus  from  experiments  [2,13] 
and  direct  numerical  studies  [59]  is  that,  in  the  far  dissipation  range,  i.e.,  for  large  k/kd, 
E(k]  ~  fc<5exp(—  ck/kd),  a  form  first  suggested  by  Kraichnan  [60],  The  exponent  8  and 
the  constant  c  are  not  easy  to  determine;  the  best  numerical  studies  [59]  yield  6  ~  3.3  and 
c  ~  7.1.  It  is  also  worth  noting  that  /(fc/fcd)  does  not  decay  monotonically  with  k/k&\ 
rather,  it  increases  mildly  in  the  inertial  range  and  then  starts  decreasing  in  the  dissipation 
range  leading  to  a  slight  pile  up  in  the  energy  spectrum  in  the  crossover  regime.  This  has 
been  referred  to  as  a  bottleneck  phenomenon  [61]. 

Soon  after  the  introduction  of  ESS  it  was  pointed  out  by  Stolovitzky  and  Sreenivasan 
[22]  that  ESS  plots  cross  over  from  their  inertial-range  asymptotic  behaviour  to  another 
asymptotic  behaviour  in  the  far  dissipation  range  that  is  characterized  by  a  steeper  power 
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law  (cf.  (18)).  This  has  not  been  explored  experimentally  in  the  GESS  context.  We  will 
study  this  in  detail  in  §  3  in  the  context  of  our  wave-vector-space  (henceforth  k  space) 
generalizations  of  ESS  and  GESS. 

2.3  Probability  distributions 

The  third  category  of  measurements  yield  probability  distributions  for  quantities  such  as 
the  velocity,  components  of  velocity  differences  (i.e.,  £v,-  =  [v,-  (x  +  r)— v,-  (x)]),  etc. 


n  i  i  |  t  t  i  i  |  ii  i  -i  i  i  i  i  i  i  i  i  i  i  I 
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Figure  3.  PDFs  of  velocity  differences  for  separations  r  within  the  inertial  range  for 
(a)  wind-tunnel  data:  r/r)A  =  37.5  (o),  351  (a),  and  1870  (A);  (b)  atmospheric  surface 
layer:  r/r)d  =  61.3  (o),  408  (a),  and  2900  (A).  The  solid  lines  correspond  to  best  fits 
which  are  of  the  exponential  form  (after  Praskovsky  and  Oncley  [62]). 
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Figure  4.  PDFs  of  band-pass-filtered  signals  with  fc  =  (a)  0.07  kHz,  (b)  0.15  kHz, 
(c)  0.70kHz,  (d)  3.0kHz,  and  (e)  10kHz  (see  text).  Solid  curves  show  Gaussian 
distributions  (after  Katsuyama,  Horiuchi  and  Nagata  [34]). 


There  is  considerable  evidence  [62]  that  the  distribution  of  velocity  differences  is  not 
Gaussian  at  small  length  scales  in  fully  developed  turbulence:  Experimental  data  show 
that  the  probability  distribution  functions  (PDFs)  of  velocity  gradients  over  small 
distances  (i.e.,  large  wavenumbers)  have  a  tail  which  falls  more  slowly  than  that  of  a 
Gaussian  distribution  [62,63].  It  is  often  said  [64]  that  such  non-Gaussian  behaviour  is 
associated  with  the  presence  of  small-scale  coherent  structures  which  lead  to 
intermittency.  Whereas  this  might  well  be  true,  it  is  worth  noting  [8]  that  even  K41 
yields  a  non-Gaussian,  stretched-exponential  distribution  for  the  distribution  of  velocity 
gradients;  however,  the  stretching  predicted  by  K41  is  less  than  that  observed 
experimentally.  Figure  3  shows  a  plot  of  the  PDFs  of  velocity  differences  obtained  by 
Praskovsky  and  Oncley  [62]  from  wind-tunnel  experiments  and  from  the  atmospheric 
surface  layer;  they  fit  well  to  an  exponential  form.  However,  a  general  theoretical 
understanding  of  the  precise  relation  between  small-scale  coherent  structures  and  such 
PDFs  is  still  lacking. 

Katsuyama,  Horiuchi  and  Nagata  [34]  have  studied  the  behaviour  of  PDFs  obtained 
from  a  band-pass-filtered  velocity  signal.  In  this  study,  the  velocity  time  series,  recorded 
in  wind-tunnel  experiments,  was  band-pass  filtered  to  obtain  the  record  V(t\  /c),  where  fc 
is  the  midband  frequency  of  the  filter.  The  PDFs  of  V(f,  /c)  are  Gaussian  at  small  fc,  but 
decay  more  slowly  than  Gaussian  distributions  as  fc  is  increased  (figure  4).  A  number  of 
authors  had  suggested  earlier  [63, 65]  that  such  PDFs  can  be  approximated  by  stretched 
exponentials  of  the  form  P(V)  oc  exp(— c^Vl"1).  For  their  band-pass-filtered  signals, 
Katsuyama  et  al  [34]  measured  the  stretching  exponent  m  and  found  that,  for  small /c, 
corresponding  to  integral-range  time  scales,  m  ~  2,  i.e.,  a  Gaussian  PDF;  as  fc  was 
increased  (times  corresponding  to  inertial-range  timescales),  m  started  falling  below  2. 


This  fall  became  more  and  more  significant  tor  mgn  /c's,  corresponding  10  aissipauun- 
range  timescales. 

Katsuyama  el  al  used  another  method  for  characterising  the  deviation  of  the  PDF 
V(t\  fc]  from  a  Gaussian  PDF:  They  calculated  the  normalized  moments 


where  (2p  -  1)!!  =  (2p  -  l)(2p  -  3)  •  •  •  3.1.  For  a  Gaussian  distribution,  72^  =  0.5  for 
all  p.  Even  a  very  weak  deviation  from  a  Gaussian  distribution  becomes  apparent  if  we 
calculate  72p  for  sufficiently  large  p.  The  band-pass-filtered  data  show  that  72p  goes 
slightly  below  0.5  for  /c's  corresponding  to  inertial-range  timescales.  This  is  followed  by 
a  more  rapid  decay  in  dissipation-range  timescales  and,  as  anticipated,  the  deviation  from 
Gaussian  behaviour  is  clearly  more  prominent  for  large  p. 

Recall  that,  in  his  1962  theory  [42],  Kolmogorov  assumed  that  er/(e)  has  a  log-normal 
distribution.  This  is  not  substantiated  by  experiments,  which  yield  data  that  can  be  fit  [50] 
to  a  stretched-exponential  form  with  a  stretching  exponent  of  1/2;  however,  this  latter 
form  violates  the  Carleman  criterion,  i.e.,  such  a  PDF  is  not  determined  uniquely  by  its 
moments  [2,  66].  It  has  also  been  shown  recently  [55,  67]  that  the  SL  assumptions  lead  to 
distributions  in  the  log-Poisson  class.  We  are  not  aware  of  a  detailed  experimental 
verification  of  this. 

3.  Deterministic  models  for  the  study  of  homogeneous  isotropic  turbulence 

The  tasks  before  a  theory  of  homogeneous,  isotropic  turbulence  are  many.  Given  the 
perspective  adopted  in  this  paper,  these  must  include  an  understanding  of  the  multiscaling 
behaviours  of  the  structure  functions  and  probability  distributions  defined  in  §  2.  It  is  also 
important  to  elucidate  the  role,  if  any,  of  coherent  structures  in  determining  these 
properties.  At  the  moment  we  are  very  far  from  this  goal.  However,  progress  has  been 
made  over  the  past  decade  by  using:  (a)  approximate  analytical  methods  to  study 
stochastic  models,  which  are  discussed  in  the  companion  paper  by  Bhattacharjee  [20]; 
and  (b)  direct  numerical  studies  of  deterministic  models.  We  concentrate  on  the  latter  in 
this  section,  which  is  organised  as  follows:  It  begins  with  an  introduction  to  direct 
numerical  solutions  of  the  Navier-Stokes  equation  and  the  GOY  shell  model,  which  are 
used  in  studies  of  homogeneous,  isotropic  turbulence.  This  is  followed  by  a  brief 
overview  of  what  has  been  learnt  about  intermittent  structures,  the  scaling  of  structure 
functions  (including  ESS  and  GESS),  and  various  PDFs  from  such  studies.  Our  emphasis 
is  on  showing  how  studies  of  both  the  Navier-Stokes  equation  and  reduced  models  like 
the  GOY  shell  model  can  augment  each  other  as  we  have  shown  in  recent  studies  [89]  of 
fe-space  ESS  and  GESS. 

3.1  Models 

The  model  of  choice  for  the  study  of  fluid  turbulence  is,  of  course,  the  deterministically 
forced  Navier-Stokes  (NS)  equation  for  an  incompressible  fluid.  If  we  restrict  ourselves 
to  low-Mach-number  flows  and  cases  where  the  equation  for  the  energy  density  can  be 
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Table  1.  Parameters  v  (viscosity),  i^  (hyperviscosity),  ReA  (Taylor-microscale  Reynolds 
number),  re  (box-size  eddy-turnover  time),  rav  (averaging  time),  rt  (transient  time)  and  k^ 
(dissipation-scale  wavenumber)  for  our  3d  NS  runs  NS1-4  (L  =  TT,  &max  =  64)  and  GOY-model  runs 
Gl-8  (£max  =  222/t0).  The  step  size  (8t)  used  is  0.02  for  NS  1-4,  10~4  for  Gl-4,  and  2.10~5  for  G5-8. 
In  our  NS  runs  we  estimate  k&  =  r]^]  by  solving  the  nonlinear  equation,  obtained  by  balancing 
dissipative  and  convective  time  scales,  z^J2  +  ^h^4  —  (6)  *ld  •>  which  reduces  to  the  formula 
given  above  for  77^  if  the  hyperviscosity  v^  —  0. 

Run       v  VH         ReA  Te/6t  7i/re      Tav/re        &maxAd 


NS1 

5, 

.io-4 

0 

— 

3.5 

-3.104 

-  1 

2 

-4 

NS2 

2. 

,io-4 

0 

— 

8 

-3.104 

—  1 

-2.5 

—  2 

.3 

NS3 

5. 

,10~4 

5.10~6 

— 

3.5 

-3.104 

—  1 

-  1 

—  6 

.5 

NS4 

5. 

,io-4 

1Q-6 

~ 

22 

-3.103 

~  10 

-7 

—  2 

Gl-4 

5. 

,io-6-  i 

0~7    0 

4. 

104-3.105 

^(1.5- 

2.0)  IO4 

-500 

-2500 

—  2 

5  _  2* 

G5-8 

5. 

,10~8-1 

0~9    0 

3. 

5.105-3.106 

-  (0.7  - 

1)105 

-500 

-2500 

—  2 

:J-  1 

neglected,  the  NS  equation  is 

a,v+(v-V)v  =  i/V2v-V/>/p  +  f,  (24) 

where  v  is  the  velocity  field,  P  the  pressure,  p  the  density,  v  the  kinematic  viscosity,  f  the 
external,  deterministic  force,  and  incompressibility  is  enforced  by  the  condition 

V-v  =  0.  (25) 

Equations  (24)  and  (25)  must  be  solved  with  suitable  boundary  conditions  and  initial 
conditions;  periodic  boundary  conditions  are  convenient  in  studies  of  homogeneous, 
isotropic  turbulence.  At  high  Re,  little  can  be  said  analytically  about  the  statistical  steady 
state  of  this  driven  equation  beyond  the  phenomenological  theories  described  in  §  2  or 
by  using  stochastic  models  [20];  one  must,  perforce,  obtain  information  about  this 
steady  state  from  direct  numerical  solutions  (DNS).  In  studying  the  properties  of 
homogeneous  and  isotropic  turbulence,  it  is  sometimes  convenient  to  work  in  k  space; 
here  the  equation  is, 


d;^(p)  +  iM,jk(p)        ty(q>*(p  -  q)  =  -vp2Vi(p)  +//(p),  (26) 

q 

with  p  and  q  denoting  wave  vectors,  Mijk(p)  =  [pjPik(p]  +pkPij(p]}/2,  PIJ(P]  = 
[Sij  -pipj/p2}  the  transverse  projector,  and  Fourier  transforms  are  implied  by  wave- 
vector  arguments.  Typically  one  uses  periodic  boundary  conditions  and,  if  a 
hyperviscosity  VH  (table  1)  is  used,  vp2  in  (26)  is  replaced  by  (v  +  ^//p2)/?2.  In  numerical 
studies  it  is  advantageous  to  use  a  pseudo-spectral  scheme  that  evaluates  certain  terms  in 
real  space  and  others  in  k  space.  Such  studies  began  with  the  pioneering  work  of  Orszag 
and  Patterson  [68].  Subsequent  studies,  carried  out  by  many  groups,  include 
refs.  [28,  52,  53,  59,  69-76].  We  follow  Meneguzzi  and  Vincent  [73]  for  our  NS 
simulation. 

In  spite  of  rapid  advances  in  computing  power,  direct  numerical  studies  of  the  NS 
equation  remain  daunting  at  high  Re.  The  largest  studies  have  used  5123  Fourier 
modes  for  the  velocity  field  and  achieved  a  Taylor-microscale  Reynolds  number 
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ReA  =  vmsX/v  ~  220;  here  the  Taylor-microscale  is  [73]  A  =  [/ E(k}dk/f  k2E(k}dk]l/2 
and  vms~[2f^E(k}dk}l/2  is  the  root-mean-square  velocity.  Larger  studies  are  not 
feasible  yet  because  computational  costs  scale  as  Re9/4,  for  storage  requirements,  and 
Re1/2  for  the  number  of  arithmetic  operations.  Important  time  scales  include  the  integral 
and  box-size  eddy-turnover  times  defined,  respectively,  as  r,  =  L//vTms  and  re  =  L/vrrns, 
where  L,  =  [/ k~lE(k)dk/ / E(k}dkf/2  is  the  integral  scale  and  L  is  the  box  size  .  For  our 
four  runs  for  the  NS  equation  (table  1)  re  ~  817;  we  report  all  times  in  terms  of  the 
former. 

Given  these  computational  constraints  it  is  useful  to  work  with  simplified  models  that 
are  physically  motivated  and  capture  as  many  aspects  of  fluid  turbulence  as  possible. 
Though  such  models  must,  perforce,  make  several  approximations  (see  below),  they  can 
allow  us  to  explore  the  large-Re  regime,  which  is  crucial  for  the  resolution  of  many 
questions  about  the  asymptotic  behaviours  of  structure  functions,  etc.  The  most 
successful  models  of  this  type  are  shell  models.  Of  these  the  one  due  to  Gledzer  [77]  and 
Ohkitani  and  Yamada  [78],  infelicitously  called  the  GOY  model,  has  been  studied  most 
over  the  past  five  years;  and,  as  we  show  below,  the  GOY  model  yields,  in  spite  of  its 
simplicity,  results  that  are  not  only  physically  instructive  but  also  in  remarkably  good 
agreement  with  experiments. 

The  Richardson-cascade  picture  suggests  that  the  scaling  behaviour  seen  in  fluid 
turbulence  might  well  arise  in  a  simplified  dynamical  system  with  a  reduced  number  of 
degrees  of  freedom  arranged  hierarchically.  This  key  idea  motivated  the  seminal  studies 
of  Obukhov  [79],  Gledzer  [77],  and  Desnyansky  and  Novikov  [80],  who  postulated  shell 
models  quite  similar  to,  though  not  as  succesful  as,  the  GOY  model,  on  which  we 
concentrate  here.  Other  more  complicated  shell  models  are  mentioned  briefly  below;  we 
refer  the  reader  to  ref.  [8]  for  a  more  extensive  coverage  of  these. 

Such  shell  models  must  have  the  symmetries  of  the  NS  equation.  Unfortunately,  it  is 
not  clear  whether  we  know  all  the  relevant  symmetries;  but  one  can  at  least  ensure  the 
essential  conservation  laws  (of  energy  and  helicity)  hold  in  the  inviscid  and  unforced 
limit.  Furthermore,  to  attain  much  larger  Reynolds  numbers  than  is  possible  in  direct 
numerical  simulations  of  the  NS  equation,  the  "shells"  in  k  space  are  logarithmically 
spaced.  In  the  GOY  model  they  are  the  one-dimensional  set  of  wave  vectors  kn,  labelled 
by  shell  numbers  n: 

ft^ftbA",     n=l,2,...,AT,  (27) 

where  A  is  the  ratio  between  wave  vectors  in  nearest-neighbour  shells,  fc0  sets  the  scale  of 
wave  vectors,  and  N  is  the  total  number  of  shells.  The  dynamical  variables  are  the 
complex  scalar  velocities  vn  associated  with  the  shells;  these  can  be  considered  to 
represent  the  Fourier  transform  of  the  velocity  characterising  an  eddy  of  scale  ~  k~l .  The 
evolution  equation  for  vn  is 

d  , 

fcVn  +  vk-vn=Fn+iCn,  (28) 

where  i/  is  the  kinematic  viscosity,  Fn  is  a  forcing  term,  and  Cn  is  the  nonlinear  coupling 
between  different  shells.  Thus  the  NS  equation  is  approximated  by  a  dynamical  system 
comprising  2N  ordinary  differential  equations  (each  vn  has  both  real  and  imaginary 
parts).  Rigid  boundary  conditions  are  imposed  on  the  <s,  i.e.,  vn  =  0  forn  <  1  and 
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i  >  N.  The  forcing  term  has  a  constant  amplitude  and  is  typically  chosen  to  act  on  a 
jarticular  shell  n',  i.e., 

Fa=F06a^}  (29) 

•vith  n'  the  index  of  the  shell  on  which  the  force  acts.  As  described  in  earlier  studies 
78,  81-85],  the  usual  choice  is  n'  =  4,  but  Kadanoff,  Lohse  and  Schbrghofer  [86]  note 
hat  the  choice  n'  =  1  (which  we  adopt  in  our  dynamical  studies  below)  does  not  change 
:he  behaviour  of  the  model  in  the  static  case.  To  mimic  the  advection  terms  in  the  NS 
equation,  the  nonlinear  term  Cn  in  (28)  should  have  terms  like  knvn'Vn»;  but,  since  the 
Rjchardson-cascade  picture  suggests  that  an  eddy  of  a  particular  length  scale  interacts 
significantly  only  with  eddies  of  nearby  scales,  such  interaction  terms  are  chosen  to  be 
.ocal  in  k  space  (i.e.,  n1  and  n"  are  chosen  to  be  close  to  n).  The  GOY  form  of  Cn  is 

Cn  =  anknv*n+lv*n+2  +  &„£„_!  v*_!U*+1  +  c,,^l,_2i;*_1i;*_2,  (30) 

with  ayy_i  =  UN  =  b\  =  b/y  —  c\  =  c^  =  0;  thus  the  velocity  of  a  shell  is  affected  only  by 
:he  velocities  of  its  nearest-  and  next-nearest-neighbour  shells.  To  specify  Cn  completely 
*ve  must  fix  the  constants  an,bn,cn.  The  first  constraint  is  that  the  kinetic  energy 
E  ~  Y^n  K|2/2  must  be  conserved  in  the  inviscid  (y  =  0)  and  unforced  (Fn  —  0  for  all  rc) 
limit.  This  yields,  an  •+- bn+\  +  cn+2  =  0.  These  coefficients  are  defined  modulo  a 
multiplicative  factor,  so  that  one  can  fix  an  =  1  without  loss  of  generality,  whence 

an  =  l;     bn  =  -6;      cn  =  -l+6.  (31) 

The  parameter  8  does  not  have  an  analogue  in  NS  dynamics,  but  it  plays  an  important  role 
in  the  GOY  shell  model  since  it  determines  the  ratio  between  the  backward  (towards 
smaller  k)  and  forward  (towards  larger  k)  energy  cascades  as  shown  by  Biferale,  Lambert, 
Lima,  and  Paladin  [87].  The  choice  A  =  (1  -  8}~~l  (in  most  studies  5=1/2  and  A  =  2), 
in  addition  to  conserving  the  kinetic  energy,  also  conserves  H  =  5Z(— )n&n|i>H  in  the 
inviscid  and  unforced  limit  [88].  Kadanoff,  Lohse,  Wang  and  Benzi  [84]  have  noted  that 
H  is  the  GOY-model  analogue  of  the  helicity,  which  is  conserved  in  the  inviscid,  unforced 
NS  equation.  Note  also  that  the  term  Cn  in  the  GOY  model  conserves  the  phase-space 
volume,  i.e.,  the  total  volume  in  the  2W-dimensional  velocity  space,  since  dCn/dvn  =  0. 
And,  as  in  the  NS  equation,  the  GOY  model  is  forced  near  small,  k,  but  the  dissipation 
occurs  principally  at  large  fc's,  which,  along  with  the  nonlinear  coupling,  ensures  a 
cascade  of  energy  from  small  to  large  wavenumbers.  Of  course  the  GOY  shell  model  is 
not  derivable  from  the  NS  equation  in  any  obvious  way.  But  its  simplicity  and  numerical 
tractability  have  inspired  quite  a  few  studies,  principally  numerical  simulations,  some  of 
which  we  discuss  below.  Some  limitations  of  the  GOY  shell  model  are  discussed  towards 
the  end  of  this  section. 

It  is  easy  to  see  that  the  Kolmogorov  scaling  relation  vn  ~  k~1/3  for  the  GOY  model  is 
a  solution  of  the  static  problem  in  the  inviscid  and  unforced  limit.  It  has  been  shown  [87] 
that  static  solutions  of  the  GOY  model,  in  the  inviscid,  unforced  limit,  are  the  fixed  points 
of  the  map  qn  =  6/2  +  (1  —  8)/4qn_i,  where  qn  =  vn+3/vn.  The  K41-type  solution, 
vn  ~  &"1/3,  corresponds  to  the  fixed  point  q  =  1/2.  For  the  dynamic  but  unforced  and 
inviscid  case,  this  K41  fixed  point  becomes  unstable  (for  8  =  1/2,  see  below),  but 
continues  to  play  a  crucial  role  for  the  dynamics  with  finite  viscosity  and  forcing.  Before 
embarking  on  a  dynamical  study  of  the  GOY  model,  it  is,  therefore,  useful  to  understand 
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the  nature  of  its  static  solutions.  We  refer  the  reader  to  refs  [87,92]  for  detailed 
discussions  of  such  solutions. 

Though  the  GOY-model  equations  look  simple,  they  cannot  be  solved  analytically. 
Thus  one  must  resort  to  direct  numerical  studies,  which  must  be  carried  out  carefully 
since  the  equations  form  a  stiff  system.  We  follow  the  numerical  scheme  of  ref.  [83]. 

In  addition  to  the  choice  of  the  constants  a,b,  c  and  A  given  in  eq.  (31),  we  must 
specify  the  forcing  term  (29).  We  choose:  FQ  =  5(1  +  i}W3 ,  «'  =  1,  i.e.,  we  force  the 
first  shell,  not  the  fourth  as  in  most  earlier  studies  [81-84].  However,  we  have  checked 
explicitly  that  the  scaling  behaviour  of  the  model  is  the  same  with  n'  —  4  and  n'  —  1.  The 
advantage  of  using  n'  =  1  is  that  the  inertial -range  is  broader,  so  we  obtain  better 
estimates  of  the  inertial-range  exponents.  Furthermore,  we  set  &0  =  2~4  and  work  with 
N  =  22.  Since  we  set  A  =  2,  this  leads  to  a  wavenumber  range  of  2~3  <  kn  <  218.  Earlier 
studies  [82-84]  have  shown  that  the  choice  N  —  22  leads  to  a  broad  wavenumber  regime, 
which,  depending  on  Re,  yields  sizeable  inertial  and  dissipation  ranges.  Data  for  our 
GOY-model  runs  are  given  in  table  1. 

Biferale  and  Kerr  [90]  have  pointed  out  that  the  GOY  shell  model  has  an  apparent 
asymmetry  between  the  odd  and  even  shells.  This  does  not  have  any  counterpart  in  NS 
dynamics.  To  overcome  this  they  have  proposed  two  velocity  fields  per  shell  that 
transport  positive-  and  negative-helicity  components,  respectively.  Studies  of  such  models 
are  in  their  infancy  and  we  refer  the  reader  to  ref.  [90]  for  details.  Even  more  realistic  are 
the  hierarchical  shell  models  studied  by  Eggers  and  Grossmann  [95]  and  Grossmann  and 
Lohse  [96].  Their  scheme,  also  referred  to  as  the  reduced  wavevector  approximation 
(REWA),  will  be  described  briefly  below. 

We  are  now  in  a  position  to  discuss  some  of  the  results  that  have  been  obtained  by 
direct  numerical  studies  of  the  NS  equation  and  the  GOY  model.  We  begin  with 
visualisations  of  intermittent  structures,  then  discuss  structure  functions,  ESS,  and  GESS, 
and  end  with  probability  distributions. 

3.2  The  visualisation  of  intermittent  structures 

Siggia  [52]  was  the  first  to  provide  numerical  evidence  that,  at  large  Re,  regions  of  large 
\u\  (where  the  vorticity  u  =  V  x  v  )  are  fairly  long-lived,  tube-like  structures.  The 
existence  of'these  filamentary,  high-vorticity  tubes  has  been  confirmed  by  subsequent 
high-resolution  runs  [53, 69,71,73].  As  the  value  of  |w|  in  an  iso-ju;  plot  is  decreased,  the 
vorticity  tubes  thicken  until  they  eventually  become  sheet-like.  This  is  illustrated  in 
figure  5,  which  has  been  obtained  from  our  relatively  low-resolution  pseudo-spectral 
simulation  that  uses  643  Fourier  modes.  In  figure  5  we  superimpose  the  velocity  field 
(white  arrows)  on  the  iso-|u>|  surfaces  and  in  their  vicinities;  the  circulation  of  the  velocity 
vectors  is  as  expected  around  regions  of  high  vorticity.  She  et  al  [53]  have  noted  that  the 
velocity  field  around  such  vortex  tubes  exhibits  swirling  motion  with  a  sizeable 
component  along  the  local  vortex-tube  axis,  which  might  lead  to  a  local  weakening  of 
nonlinear  effects  [2,93].  The  viewing  angles  in  these  photographs  have  been  chosen  to 
display  the  tubular  nature  of  the  filaments  clearly. 

The  existence  of  such  filamentary  structures  is  a  crucial  input  into  the  She-Leveque 
model  (§  2).  Since  this  model  relies  on  a  set  of  assumptions  for  the  small-scale  behaviour 
of  the  energy  dissipation  e,  it  is  interesting  to  check  whether  the  existence  of  vorticity 


Figure  5.  Iso-|u;|  surfaces  with  superimposed  velocity  vectors  (at  points  near  these 
surfaces)  obtained  from  an  instantaneous  snapshot  of  the  velocity  and  vorticity  fields 
from  our  run  NS4  (table  1).  The  iso-|w|  surfaces  go  through  points  with  fixed 
Wl/Hmax  ~  ^>  where  w|max  is  the  maximum  value  of  \u\;  the  velocity  vectors  are 
shown  by  white  arrows  whose  lengths  are  proportional  to  their  magnitudes  and  whose 
heads  are  too  small  to  be  seen.  (The  small  red  arrowhead  is  a  pointer  that  should  be 
ignored  here  and  in  all  subsequent  photographs.)  The  red  surface  (top  plate)  goes 
through  points  with  O  ~  0.31  and  the  orange  surface  (bottom  plate)  goes  through 
points  with  O  ~  0.12;  this  illustrates  the  thickening  of  the  vorticity  filaments  with 
decreasing  \u\.  These  are  perspective  views  with  the  simulation  box  shown  by  a  white 
boundary;  we  use  periodic  boundary  conditions. 
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Figure  6.  Superimposed  iso-|u;j  and  iso-e  surfaces  obtained  from  an  instantaneous 
snapshot  of  the  vorticity  and  dissipation  fields  from  our  run  NS4  (table  1).  The  cream- 
coloured  surfaces  go  through  points  with  fixed  H/Hmax  =  O  whereas  blue-green 
surfaces  go  through  points  with  fixed  e/emax  =  £  (the  subscript  max  denotes 
maximum  value).  Here  O  =  E  =  0.5.  The  top  figure  (a)  is  a  rotated  version  of  the 
bottom  one  (b).  The  iso-|u|  surfaces  clearly  show  that  high-vorticity  regions  are 
filamentary;  however,  the  iso-e  surfaces  seem  more  like  shredded  sheets  or  blobs. 
Also,  high-vorticity  and  high-dissipation  regions  do  not  coincide.  These  are 
perspective  views  with  the  simulation  box  shown  by  a  white  boundary;  we  use 
periodic  boundary  conditions. 
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Figure  7.  The  same  as  figure  6  but  with  O  =  £  ^  0.33,  so  the  vorticity  tubes  are 
thicker;  however,  the  iso-e  surfaces  still  seem  like  shredded  sheets  or  blobs.  The  top 
figure  (a)  is  a  rotated  version  of  the  bottom  one  (b). 


obtained  from  an  instantaneous  snapshot  of  the  real-space  velocity  field  in  our  numerical 
study  of  the  NS  equation  (run  NS4  in  table  1);  the  cu  and  e  fields  follow  from  this  velocity 
field.  In  figures  6  and  7  the  cream-coloured  iso-|w|  surfaces  show  well-formed  tubes  for 
the  |w  values  used.  The  blue-green  iso-dissipation  surfaces  show  clearly  that  regions  of 
large  e  are,  by  contrast,  like  small  shredded  sheets  or  blobs  (cf.  [52]),  which  have  no 
overlap  with  the  high-vorticity  tubes.  Nor  does  their  codimension  seem  to  be  the  same  as 
that  of  the  vortex  filaments.  (Cream-coloured  surfaces  go  through  points  with  fixed 
H/Hmax  —  ®  whereas  blue-green  surfaces  go  through  points  with  fixed  e/emax  =  £; 
and,  to  make  the  comparison  of  these  surfaces  as  meaningful  as  possible,  we  use  equal 
ratios,  i.e.,  O  =  £.)  The  precise  implications  of  this  for  the  derivation  of  the  SL  formula 
(§  2)  are  unclear.  If  its  success  in  predicting  £p  is  any  clue,  there  might  be  a  way  of 
deriving  it  without  using  the  precise  codimensions  of  high-dissipation,  as  opposed  to 
high-vorticity,  regions.  __  ^ 

It  is  debatable  whether  any  real-space  structures  can  be  obtained  reliably  from  the 
GOY  model,  which  is  defined  on  a  logarithmically  discretised  k  space.  Some  authors 
have  recently  proposed  [8]  a  procedure  for  generating  an  artificial  velocity  field  v(r,  t]  in 
three-dimensional  real  space  from  the  shell-model  velocity  fields.  The  first  step  is  to 
introduce  a  three-dimensional  wavevector  k  =  knen,  where  kn  is  the  wavenumber  of  the 
n-th  shell  and  the  en's  vectors  of  unit  norm.  Next  they-th  component  (j  =  x,y,z)  of  the 
real-space  velocity  field  is  constructed  through  some  kind  of  an  "inverse  Fourier 
transform" 


(r)C*-"+c.c.]1  (32) 

. 

where  the  coefficients  CJr  are  random  numbers  of  0(1)  and  c.c.  denotes  complex 
conjugate.  It  can  be  checked  that  the  resulting  -u/s  obey  the  incompressibility  condition 
(25),  since  the  coefficients  satisfy  ^]=\  C(nj)e(nj]  =  0,  for  all  n.  However,  it  remains  to  be 
checked  explicitly  whether  the  velocity  fields  calculated  by  using  such  a  prescription 
conform  with  those  seen  experimentally. 


3.3  Structure  functions,  ESS  and  GESS 

The  scaling  of  real-space  structure  functions  Sp(r)  has  been  studied  via  numerical 
simulations  of  the  NS  equation  by  several  groups  [71,73,76].  Agreement  with  K41  is 
good  for  p  <  3  and  E(k)  follows  the  -5/3  law  well  [73].  Some  numerical  studies  have 
found  that  the  RSH  (5)  holds  reasonably  well.  However,  the  most  recent  study  has 
cautioned  that  the  result  depends  on  which  surrogate  (see  §  1)  of  e  is  used.  Specifically,  if 
one  defines  eij  =  ^(dvj/dx,-)2,  then  the  RSH  holds  better  with  the  surrogate  e1  '  than  with 


e21,  in  agreement  with  recent  experiments. 
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As  regards  multiscaling,  data  for  £P  from  direct  numerical  studies  of  the  NS  equation 
[71,73]  show  significant  deviations  from  K41  forp  >  3;  however,  it  is  still  not  feasible 
computationally  to  attain  very  large  Re  A  (the  largest  so  far  [76]  seems  to  be  RCA  =216). 
Unfortunately  this  is  not  large  enough  to  settle  certain  issues  of  principle:  For  example, 
does  S(,p  =  p/3  —  Cp  — >  0  as  Re  — >  oo,  as  suggested  by  some  studies  [34-36]?  This  is 
best  done  by  using  simpler  models  like  the  GOY  shell  model  described  above;  of  course, 
one  must  then  contend  with  the  issue  of  the  relation  of  shell-model  and  NS  results,  which 
we  discuss  later. 

The  velocity  structure  functions  for  the  GOY  shell  model  are 

Sp(kn)  =  (\vHp).  (33) 

Since  the  kn's  in  the  GOY  shell  model  are  logarithmically  spaced,  the  GOY-model 
analogue  of  the  energy  spectrum  is  E(k]  =  \vn\2/kn  and  the  exponents  (,,,  are  given  by 

SP(kJ~tf>.  (34) 

*  It  is  natural  in  the  GOY  model  to  use  these  fc-space  structure  functions.  Since  we  want  to 

use  direct  numerical  simulations  of  the  NS  equation  and  the  GOY  model  in 
complementary  ways,  it  is  useful  to  study  &-space  structure  functions  in  the  NS  context. 
We  define  these  as 

Sp(k)  ~  <|v(k)|").  (35) 

We  use  the  convention  that  calligraphic  symbols  like  Sp  denote  real-space  structure 
functions  and  Roman  symbols  like  Sp  their  £-space  analogues  (not  Fourier  transforms).  In 
a  recent  study  we  have  provided  numerical  evidence  that  yields  [89] 

S,(*)~*-(c'+3p/2),  (36) 

for  k  in  the  inertial  range.  This  difference  between  GOY-model  and  NS  A:-space  structure 
jf  functions  arises  because  of  phase  space  reasons  (i.e.,  the  logarithmic  discretisation  of  k 

space  in  the  former  and  the  usual  discretisation  in  the  latter).  To  have  a  meaningful 
comparison  between  these  two  models,  we  also  define  GOY-model  analogues  of  the 
Taylor-microscale  and  the  root-mean-square  velocity  (see  above  for  the  3d  NS  equation): 

A  =  27T/M(£B  K|2)/(£^«H2)]1/2  and  v^  =  [(2£B  \vn\2)/(2*/h)]l/2,  which  can 
be  used  to  obtain  the  Taylor-microscale  Reynolds  number  Re,\  =  vm$\/v  ;  furthermore, 
we  use  (cf.,  [83])  E(k]  =  Sn^/kn.  The  time-scale  associated  with  the  smallest 
wavenumber  is  ~  k^ / \v\\\  when  averaged  over  the  steady  state  it  gives  the  large-scale 
eddy  turnover  time  re,  which  is  the  analogue  of  the  box-size  eddy-turnover  time  in  the  NS 
case.  For  our  GOY-model  runs  we  find  Re\  ~  i/~°-5,  as  expected  [102]  at  large  v~l .  We 
will  return  to  a  comparison  of  these  two  models  when  we  discuss  our  fc-space  versions  of 
ESS  and  GESS  [89]  in  inertial  and  dissipation  ranges  below. 

Our  GOY-model  runs  span  the  range  104<Re.\<3  x  106.  In  figure  8  we  show  the 
energy  spectrum  as  a  function  of  the  shell  number  for  our  runs  Gl-8.  (Care  must  be  taken 
to  ensure  that  the  coputational  runs  are  sufficiently  long  to  obtain  a  statistical  steady  state; 
see  table  1  for  our  run  parameters.)  Note  that  E(k]  is  plotted  on  a  logio  scale  and  the  shell 
number  is  proportional  to  the  logarithm  (base  2)  of  the  wavenumber.  The  slight  rise  in  the 
curves  near  n  =  1  is  a  vestige  of  the  forcing  in  the  first  shell.  The  straight-line  portion  of 
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Figure  8.  The  energy  spectrum  £(£„)  (plotted  on  a  log  base  10  scale)  as  a  function  of 
shell  number  (n),  for  the  runs  Gl-8  (table  1).  The  dashed  line  represents  the  /c~5/3 
form.  The  curves  for  large  values  of  v  (runs  G1-G3)  start  veering  off  from  the  —5/3 
form  near  the  onset  of  the  dissipation  range.  For  small  values  of  v  (runs  G6-8),  the 
dissipation  range  starts  beyond  n  =  22. 

the  plot  for  intermediate  k  marks  the  inertial  range  and  fits  the  expected  AT5/3  form  well. 
For  runs  Gl-3,  which  have  the  highest  viscosities  we  use,  E(k)  starts  falling  from  the 
fc~5/3  line  in  figure  8.  This  marks  the  beginning  of  the  dissipation  range.  Note  that,  for  the 
small-viscosity  runs  (G5-8),  only  the  beginning  of  this  crossover  is  visible  given  our 
wavenumber  range.  We  will  return  to  the  form  of  E(k)  in  the  dissipation  range  below. 

Many  authors  [82-84]  have  used  such  studies  to  show  that  GOY-model  structure 
functions  exhibit  significant  deviations  from  K41  for  p  >  3  and,  for  the  conventional 
choice  of  parameters  which  conserve  both  the  energy  and  the  helicity  in  the  inviscid, 
unforced  limit,  yield  exponents  (,p  in  good  agreement  with  experiments  and  the  SL 
formula.  (An  approximate  analytical  understanding  of  multifractality  in  the  GOY  shell 
model  may  be  obtained  by  a  closure  scheme  as  shown  by  Benzi,  Biferale  and  Parisi  [91].) 
These  studies  have  also  pointed  out  that  log-log  plots  of  Sp(kn]  versus  kn  show  an 
oscillatory  behaviour  that  is  a  GOY-model  artifact  related  to  an  underlying  3-cycle  in  the 
static  solution  [92].  In  figure  9  we  show  such  a  plot  for  different  values  ofp  from  our  run 
Gl.  The  increase  in  Sp  near  n=  I  arises  because  we  force  the  first  shell.  In  the  region 
4  <  n  <  14  the  plots  are  nearly  linear,  suggesting  the  algebraic  decay  of  Sp(kn}  in  the 
inertial  range.  The  dissipation  range  starts  around  n  ~  13  -  17  for  runs  G1-G3;  this  is 
marked  by  the  bend  in  the  structure  functions.  The  period-three  oscillations  mentioned 
earlier  show  clearly  in  the  inertial  range  of  figure  9.  We  have  obtained  C/s  by  fitting  the 
structure  functions  to  a  power-law  form  in  the  inertial  range.  As  can  be  seen  from 
figure  15,  the  fitting  errors  increase  with  increasing  p  because  the  period-three 
oscillations  are  stronger  in  high-order  structure  functions. 
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Figure  9.  A  plot  of  \ogS,,(kn)  versus  the  shell  number  n  for  the  run  Gl  and  for 
p  =  2, 4, 6, . . . ,  20  (from  top  to  bottom).  Note  the  period-three  oscillations  that  show 
clearly  in  high-order  structure  functions. 


Kadanoff  et  al  [84]  have  pointed  out  that  a  simple  way  of  removing  these  oscillations  is 
to  calculate  triple  moments  of  velocities;  the  most  effective  in  removing  these  oscillations 
is  the  structure  function 


(37) 


which  scales  like  Sp(kn]  in  the  inertial  range.  Here  G  denotes  the  imaginary  part  and  we 
use  8=1/2  and  A  =  2.  In  figure  10,  we  show  plots  of  log  Ep  (kn]  against  the  shell  number 
(n)  for  our  run  Gl  with  2  <  p  <  20;  note  that  the  period-three  oscillations  of  Sp(kn] 
(figure  9)  are  absent  and  the  inertial  scaling  region  is  a  few  octaves  larger  than  the  one  in 
figure  9.  A  comparison  of  the  exponents  (p  calculated  from  Sp(kn)  and  £„(&„)  is  shown  in 
figure  11.  The  latter  not  only  yields  estimates  with  smaller  error  bars  but  also  results 
much  closer  to  the  SL  prediction  (17).  Thus  we  use  £p(fcrt)  to  calculate  the  RBA 
dependence  of  £p  (see  below).  However,  for  consistency  we  use  Sp(kn}  when  we  compare 
our  GOY-model  results  with  those  of  our  direct  numerical  simulation  of  the  3d  NS 
equation. 

Before  presenting  our  study  of  6£p  we  note  that  a  similar  study  has  been  carried  out  by 
using  the  hierarchical  shell  model  studied  by  Eggers  and  Grossmann  [95]  and  Grossmann 
and  Lohse  [96].  Their  scheme  is  also  referred  to  as  the  reduced  wavevector  approxima- 
tion (REWA).  The  model  considers  the  velocity  field  associated  with  the  n-th  shell  to  be 
given  by  a  set  of  variables  vnj,  where  j  =  1, ... ,  Jn,  with  J  >  1.  Each  variable  vnj 
interacts  with  all  the  variables  in  the  n  ±  1  and  n  ±  2  shells.  The  resulting  set  of 
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Figure  10.  A  plot  of  log  £/,(&„)  (see  text)  versus  the  shell  number  n  for  the  run  Gl 
and  for  p  =  2, 4, 6, . . . ,  20  (from  top  to  bottom).  Note  that  the  curves  are  free  from  the 
oscillations  that  appear  in  the  structure-function  plots  (figure  9). 

differential  equations  has  many  more  nonlinear  terms  than  its  analogue  in  the  GOY 
model.  The  advantage  of  this  added  complexity  is  that  the  nonlinear  term  has  a  greater 
similarity  with  its  counterpart  in  the  NS  equation  (26).  The  REWA  model  has  many  more 
degrees  of  freedom  than  the  GOY  model,  so  its  numerical  solution  is  almost  as  difficult  as 
a  direct  solution  of  the  3d  NS  equation.  Of  course,  much  higher  values  of  Re  can  be 
achieved  for  the  REWA  models  than  for  the  3d  NS  equation.  We  refer  the  reader  to 
refs  [8,95,96]  for  details  of  this  model.  Grossmann,  Lohse,  L'vov  and  Procaccia  [35] 
have  used  the  REWA  model  to  cover  the  range  104  <  Re  <  107.  (In  these  simulations, 
RCA  ~  12-v/Re  as  shown  in  ref.  [96].)  From  their  plots  of  8£p  —  \£p  -  p/3\  versus  Re, 
these  authors  argue  that  the  theoretical  prediction  of  ref.  [36],  namely,  6£p  ~  Re~3/10, 
might  well  be  true  as  Re  — >  oo. 

We  have  tried  to  address  this  issue  for  the  GOY  shell  model.  Specifically,  we  have 
obtained  (p  and  hence  8£p  from  ^p(kn)  for  eight  different  Reynolds  numbers  (runs  Gl-8, 
table  1)  in  the  range  4  x  104  <  Re  A  3  x  106.  The  resulting  data  are  shown  in  the  plot  of 
6(,p  versus  Re^  in  figure  12;  the  SL  prediction  is  also  shown  for  some  values  of  p.  There 
seems  to  be  no  indication  of  a  decrease  in  8(,p  with  increasing  Re^;  if  anything,  there  is  a 
marginal  increase.  (We  have  checked  that  the  increase  in  8(p  at  the  highest  value  of  ReA 
(run  G8)  remains  on  choosing  a  smaller  time  step  and  larger  averaging  time  for  this  run.) 
This  result  is  in  contrast  with  the  conclusion  of  Grossmann  et  al  [96].  It  is  possible  that 
this  discrepancy  arises  because  of  some  feature  that  is  present  in  the  REWA  model  but  not 
in  the  GOY  shell  model.  Only  more  exhaustive  numerical  simulations  (ideally  of  the  3d 
NS  equation)  and  systematic  experimental  studies  will  be  able  to  decide  whether  the  trend 
shown  in  figure  12  actually  obtains  in  fluid  turbulence.  We  also  note  that,  on  the  basis  of 
their  static  analysis  of  the  GOY  model,  Schorghofer,  Kadanoff,  and  Lohse  [92]  have 
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Figure  11.  £p  versus  p  for  our  run  Gl  obtained  from  the  best  straight-line  fit  in  the 
inertial  range  by  using  (a)  Sp(kn}  and  (b)  T,p(kn}.  Error  bars  in  (a)  denote  the  fitting 
errors;  similar  error  bars  are  also  shown  in  (b)  but  are  not  apparent  since  they  are 
comparable  to  the  symbol  sizes.  The  K41  and  the  SL  predictions  are  also  shown. 


suggested  that  £p  should  have  an  oscillatory  dependence  on  log  v\  our  fully  dynamical 
calculation  (figure  12)  seems  to  be  in  agreement  with  this  suggestion. 

The  obvious  question  that  arises  is  whether  the  £p's  are  universal.  The  data  of  figure  12 
suggest  they  are  not.  This  has  also  been  found  by  Leveque  and  She  [97]  in  their  study  of 
the  GOY  model  in  the  presence  of  pure  hyperviscosity,  i.e.,  with  a  —vHkaH  dissipation 
and  an  not  necessarily  2:  their  Cp's  depend  on  a#.  Similar  results  have  been  obtained  by 
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Figure  12.  Log-log  plot  of  <5(p  versus  the  Taylor-microscale  Reynolds  number  ReA 
for  our  GOY-model  runs  (Gl-8)  with  p  =  6, 8, . . . ,  20  (from  bottom  to  top).  The  dotted 
(p  =  6)  and  dashed  (p  =  8)  lines  show  the  SL  results.  Error  bars  are  shown  but  are 
often  smaller  than  the  symbol  sizes. 


Borue  and  Orszag  [98]  in  a  direct  numerical  solution  of  the  NS  equation  with  pure 
hyperviscosity.  However,  this  nonuniversality  is  removed  [97]  if  one  uses  the  ratios  Cp/Cs- 
We  shall  return  to  this  point  in  our  discussion  of  fc-space  ESS  and  GESS  below. 

Before  presenting  our  numerical  studies  of  ESS  and  GESS,  we  wish  to  discuss  the  form 
of  E(k)  in  the  dissipation  range.  It  has  been  suggested  by  some  authors  [60, 99]  that  the 
energy  spectrum  in  the  far-dissipation  range  is  of  the  form 

r*  \  K    \    r^j    T  \  u   IK   il  PVnl /*(  K   i  K  t\      \  I      i  X  I 

JL-/  \  **•  y  J    \      /       Q  /  ^J\j-'  I         *•"  \  •*•  /  "vQ  /        3  V  / 

where  c  is  a  constant,  /  is  a  weak  function  of  fc/fed,  and  1  <  n  <  2.  In  their  direct 
numerical  simulation  of  the  NS  equation,  Chen  et  al  [59]  find  that  the  form 

is  consistent  with  their  data  with  a  ~  3.3  and  c  ~  7.1.  This  pseudo-spectral  simulation 
uses  2563  Fourier  modes  and  has  achieved  the  best  numerical  resolution  of  the  far 
dissipation  range  of  the  3d  NS  equation  to  date.  Our  low-Re^  studies  (table  1)  also  yield  a 
dissipation  range  in  which  E(k]  fits  such  a  form.  We  have  also  checked  explicitly  that  our 
data  for  63  in  the  GOY  model  can  be  fit  to  the  form 


k6  exp(— 


(40) 


in  the  dissipation  range  (run  Gl)  with  8  —  3.80  and  c/k&  =4.13  x  10~4.  (We  note  that 
such  fits  are  difficult  in  the  GOY  model  and  may  not  be  unique:  given  the  logarithmic 
sampling  of  k  space  ,  we  have  only  five  points  in  five  octaves  of  k.) 
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Figure  13.     Log-log  plots  for  Si2(k}  versus  S3(k)  for  our  runs  (a)  NS1  (the  inertial 
range),  (b)  NS1  (the  dissipation  range)  and  (c)  Gl.  The  inset  in  (c)  shows  a  magnified 

view  of  data  in  the  inertial  range.  The  straight  lines  represent  S\z  ~  S3"  .  Deviations 
from  this  line  show  clearly  for  data  in  the  dissipation  range  (large  k,  i.e.,  small  £3). 


In  recent  work  three  of  us  [89]  have  proposed  and  verified  a  fc-space  version  of  ESS 
and  GESS  in  the  inertial  range.  We  have  also  extended  the  applicability  of  ESS  and  GESS 
to  the  far  dissipation  range  to  uncover  intriguing  and  apparently  universal  crossover  from 
inertial  to  dissipation  ranges.  Our  &-space  ESS  postulate  is 


Sp  =  <|v(k)0 

Sp  =  (|v(k)0 


A 


(41) 


where  the  A/p  and  ADp  are,  respectively,  amplitudes  for  inertial  and  dissipation  ranges  and 
A~!  the  (molecular)  length  at  which  hydrodynamics  fails.  Note  that  our  postulate 
suggests  different  power-law  behaviours  in  inertial  and  dissipation  ranges;  the  real-space 
analogue  of  this  was  noted  by  Stolovitzky  and  Sreenivasan  [22],  who  used  experimental 
data,  soon  after  the  proposal  of  real-space  ESS  by  Benzi,  et  al  [32].  We  do  find  that  this 
way  of  presenting  the  data  extends  the  apparent  inertial  range  in  both  our  NS  and  GOY- 
model  studies,  so  the  expression  k— space  ESS  is  justified.  Representative  log-log  ESS 
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Figure  14.    A  plot  of  (,'  versus  p.  The  squares  indicate  data  from  our  run  NS4. 
The  line  is  £'  =  (2(p  +  3p)/ll  ,  with  CP  =  CpL  (see  text). 


plots  of  Sn(k)  versus  S3(k]  from  our  run  NS1  (table  1)  are  shown  in  figure  13:  here  panel 
(a)  covers  the  inertial  range  and  yields  the  inertial-range  power  £{2,  which  is  in  agreement 
with  the  SL  prediction;  panel  (b)  covers  the  dissipation-range  and  yields  a  power  a\2, 
which  deviates  noticeably  from  the  SL  prediction;  panel  (c)  illustrates  the  same  crossover 
from  inertial-  to  dissipation-range  asymptotic  behaviours  for  the  GOY  shell  model,  where 
we  can  resolve  much  larger  inertial  and  dissipation  ranges,  so  this  crossover  shows 
clearly.  Note  also  that  the  period-3  oscillations  in  figure  9  (structure  functions  for  the 
GOY  model)  are  removed  in  such  ESS  plots,  yielding  better  estimates  for  the  exponents. 
The  agreement  between  our  NS  and  GOY  results,  both  qualitative  and  quantitative  (see 
below),  indicates  that  this  crossover  is  real  and  not  a  numerical  artifact.  To  ensure  that  we 
have  good  statistics,  all  our  NS  runs  have  been  done  in  quadruple  precision  and  we  have 
checked  that  our  estimates  for  £p  and  ap  do  not  change  noticeably  when  we  double  our 
averaging  time  or  halve  our  integration  time  step.  Our  NS  runs  are  such  that  NS1,  NS2, 
and  NS3  resolve  enough  of  the  dissipation  range  (table  1)  to  yield  dissipation-range 
exponents  ap;  run  NS4  does  not  yield  many  points  in  the  dissipation  range,  so  we  do  not 
use  it  to  obtain  ap. 

In  (41)  the  inertial-range  exponent:  £'  =  £p,  for  the  GOY  model;  however,  we  find 
explicitly  (figure  14)  that,  for  the  3d  NS  case, 

(42) 

whence  we  get  (36),  i.e.,  Sp(k)  ~  k~^p+3PW ,  in  the  inertial  range.  To  our  knowledge  this 
result  for  Sp(k)  is  new.  Our  NS  runs,  though  restricted  to  relatively  low  Re  A  (;$  22), 
obtain  it  via  ESS  (41)  and  (3  =  1.  For  even  values  of  p  we  can  obtain  this  result  via 
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Figure  15.  A  plot  of  Cp  versus  p  for  the  runs  NS4  and  Gl .  The  K41  and  SL  lines  are 
also  shown.  For  NS4  we  obtain  £p  by  inverting  (47)  and  using  our  data  for  <^;  this 
inversion  also  magnifies  the  error  bars  (cf.  figure  14).The  estimates  for  these 
exponents  are  obtained  from  ESS  plots  like  figure  13  for  both  GOY  and  NS  runs. 
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Figure  16.  A  plot  of  ap  versus  p  for  the  runs  NS1-3  and  Gl.  The  K41  line  is  also 
shown.  The  estimates  for  ocp  are  obtained  from  the  dissipation-range  asymptotes  of 
ESS  plots  like  figure  13. 
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dimensional  analysis  if  we  Fourier  transform  the  real-space  structure  function  Sp  and 
make  one  numerically  plausible  assumption:  namely,  that  (vn(ki),  .  .  .  ,v^(kp))  is 
dominated  by  terms  in  which  the  km,  m  =  1,  .  .  .  ,p,  arguments  form  equal  and  opposite 
pairs  all  with  magnitude  &,  i.e.,  (vji(ki),  .  .  .  ,v;p(kp)}  ~  Sp(k)  [<5(ki  +  k2)  •  •  • 
6(kp_i  4-  kp)  +  permutations]. 

Both  (,p  (figure  15)  and  ap  (figure  16)  seem  universal,  to  the  extent  that,  given  our  error 
bars,  they  are  the  same  for  all  our  GOY-model  and  3d  NS  runs  (table  1).  £p  agrees 
reasonbly  well  with  the  She-Leveque  (SL)  formula  [51]  for  the  ranges  of  p  and  Re^  in 
figure  15;  and  ap  is  close  to,  but  systematically  less  than,  p/3.  Of  course,  figure  12 
indicates  that  the  GOY-model  exponents  have  a  mild  depedence  on  v\  however,  as  noted 
above,  this  nonuniversality  is  removed  in  our  ESS  and  GESS  plots  (see  below)  in  which 
only  ratios  of  the  exponents  C,p  appear. 

Given  our  fc-space  ESS  (41),  the  asymptotic  k  dependences  of  Sp(k)  in  inertial  and 
dissipation  ranges  follow  from  the  dependence  of  83  on  k.  We  find 


*a,  (43) 

S3  K  BDk6  exp(-c*/Jfcd)  ,     1  .Ska  <k  <  A,  (44) 

where  Bj  and  Bp  are,  respectively,  nonuniversal  amplitudes.  (Equation  (44)  holds  for  the 
3d  NS  equation  (see  above);  for  the  GOY  model  the  factor  9/2  is  absent.)  This  implies 
that,  in  the  far  dissipation  range, 

Sp  ~  kff"  exp(-cap/:/fcd),     kA  <  k  <  A,  (45) 

with  9P  =  ap8,  for  all  p.  This  form  is  not  easy  to  verify  numerically  for  large  p,  given  the 
rapid  decay  at  large  k.  To  the  best  of  our  knowledge  it  has  been  suggested  so  far  [59]  only 
for  S2. 

In  (44),  5,  c,  and  £d  are  not  universal.  Our  fe-space  ESS  shows  that  there  is  a  universal 
part  to  the  inertial-  to  dissipation-range  crossover.  We  now  show  that  this  universal 
crossover  can  be  extracted  best  by  a  &-space  version  of  GESS  that  three  of  us  have 
proposed  recently  [89].  This  holds  for  both  the  GOY  model  and  the  NS  equation.  (Real- 
space  GESS,  due  to  Benzi  et  al  [32],  has  been  discussed  in  §2.)  We  begin  by  defining 

G,=SP/(S3)P/3.  (46) 

Log-log  plots  of  Gp  versus  Gq  yield  curves  with  asymptotes  which  have  universal,  but 
different,  slopes  in  inertial  and  dissipation  ranges.  Figure  17  shows  a  representative  plot, 
with  both  GOY-model  and  NS  data,  for  p  =  6  and  q  =  9.  The  inertial-range  asymptote 
has  a  slope  p(p,  q)  given  by  (21);  it  is  easy  to  check  from  the  formulae  above  that  this  is 
the  same  as  the  slope  of  real-space  GESS  plots  [58]  in  the  inertial  range.  From  the  SL 
value  [51]  for  £,  (17)  we  can  obtain  the  SL  prediction  for  p(p,q);  the  resulting  inertial- 
range  asymptote  is  in  good  agreement  with  our  data  in  the  inertial-range  part  of  figure  17. 
From  our  £-space  ESS  and  GESS  ansatze  above,  it  also  follows  that  the  dissipation-range 
asymptote  has  a  slope 

w(p,  q)  =  [ap  -  P/3\/[ctg  -  g/3],  (47) 

where  ap  is  defined  in  (41)  and  shown  in  figure  16.  Note  that  GESS  plots  amplify 
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Figure  17.  Log-log  plots  for  (a)  Gg  versus  G9,  and  (b)  H9>6  versus  #6,9  (see  text)  for 
all  our  NS  and  GOY-model  runs.  The  data  collapse  in  (b)  yields  the  universal  scaling 
function  that  characterizes  the  crossover  from  the  inertial  to  the  dissipation  range,  for 
p  =  6  and  q  =  9  here. 

considerably  the  difference  between  inertial-  and  dissipation-range  asymptotes,  which  is 
not  very  prominent  in  ESS  plots;  this  amplification  can  be  estimated  analytically. 

In  GESS  plots  like  figure  17  the  slopes  of  these  inertial-  and  dissipation-range 
asymptotes  are  universal,  at  least  at  the  level  of  accuracy  of  figures  15  and  16;  but  the 
point  at  which  the  curve  in  a  GESS  plot  starts  deviating  from  the  inertial-range  asymptote 
depends  on  the  model  (GOY,  NS,  with  and  without  hyperviscosity,  etc.).  It  is  easily  seen, 
though,  that  the  following  transformation  yields  a  universal  crossover  scaling  function 
(this  curve  is,  of  course,  different  for  each  pair  of  values  of  p  and  q  because  of 
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multi scaling):  We  define 


(48) 

where  the  scale  factors  Dpq  =  Dqp  are  nonuniversal.  Now  plots  of  \og(Hpq)  versus 
\og(Hqp)  show  data  collapsing  onto  one  universal  curve  within  our  error  bars.  This 
is  illustrated  in  figure  17b  for  p  =  6  and  q  =  9;  we  emphasize  that  the  data  collapse 
occurs  for  all  GOY  and  3d  NS  runs  and  for  all  k  and  Rev  In  essence  this  transformation 
holds  one  of  the  GESS  plots  fixed  and  slides  those  of  other  models  till  the  asymptotes 
match. 

In  their  discussion  of  real-space  GESS,  Benzi  et  al  have  suggested  a  hierarchy  for  the 
Qp(r},  which  leads  to  the  SL  formula  if  one  uses  the  SL  boundary  conditions  mentioned 
above.  This  hierarchy  has  a  fc-space  analogue  [89],  namely 

r  i1'7 

=  [Gp/Gp_,fx  \\irn  Gp+i/Gp\      ,  (49) 

[p-KXD  J 

with  T3  =  2/3.  If  we  make  the  additional  assumption  Gp(k]  ~  k^p,  which  holds  in  the 
inertial  range,  we  get  a  difference  equation  for  j3p  identical  to  the  SL  one,  which,  when 
solved  with  the  boundary  conditions  /?o  =  /%  =  0  and  limp_>00(/?p+i  —  (3P)  =  2/9,  yields 
the  SL  formula  (17)  via  (7)  (our  /3P  =  ~Tp/3  there).  Elementary  manipulations  show 
that  our  &-space  GESS  gives  [Gp+,/Gp]  ~  [Gp/Gp_i]Tp,  with  Tp  =  (C^+i  -  (P  -  1/3)/ 
(C»  -  Cp-i  —  1/3)-  At  first  sight  this  seems  inconsistent  with  the  assumed  hierarchy  (49); 
however,  it  can  be  seen  to  be  consistent  if  Tp  =  7  —  2(1  —  7)/[9(£p  —  Cp-i  —  Cs/3)], 
which  turns  out  to  be  the  SL  difference  equation.  Thus  we  cannot  claim  to  have  checked 
either  of  the  assumed  hierarchies  (14)  or  (49)  directly.  Our  fc-space  GESS  can  certainly 
hold  with  CP  ^  C|L-  We  can  only  claim  that  the  hierarchy  might  hold  to  the  extent  that 
our  calculated  values  of  £p  agree  reasonably  with  those  predicted  by  the  SL  formula 
(figure  15). 

The  difference  between  inertial-  and  dissipation-range  asymptotic  behaviours  has  not 
been  noted  in  real-space  GESS  so  far.  As  we  have  mentioned  above,  experimental 
evidence  for  slope  differences  between  inertial:-  and  dissipation-range  asymptotes  in 
real-space  ESS  plots  was  given  by  Stolovitzky  and  Sreenivasan  [22].  They  postulated 
Sp  ~  S2f  in  the  dissipation  range  and  suggested 

(50) 

S9/2    r  ~>f  ^ 

Unfortunately,  we  have  not  been  able  to  find  a  simple,  direct  relation  between  their  real- 
space  exponents  a'p  and  our  fc-space  exponents  ap  (unlike  (42)  for  inertial-range 
exponents),  since  Sp  does  not  have  a  simple  power-law  dependence  on  k  in  the  dissipation 
range.  Furthermore,  in  this  range,  Gp  ~  k$>  exp(-c^/fcd),  so  there  is  no  analogue  of  the 
SL  formula  for  our  dissipation-range  exponents  ap.  Nonetheless  it  should  be  interesting 
to  study  the  apparently  universal  inertial-  to  dissipation-range  crossover  that  we  have 
elucidated  above  in  experimental  flows. 
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3.4  Probability  distributions 

Direct  numerical  simulations  have  obtained  data  for  various  probability  ditributions 
[64,73].  In  these  studies,  which  have  attained  R&\  —  150,  it  has  been  found  that  the 
distribution  of  velocity  components  is  close  to  Gaussian.  However,  the  distributions  of 
velocity  derivatives  such  as  dxvx  show  significant  deviations  from  Gaussian  behaviour, 
which  are  consistent  with  experimental  results.  This  deviation  is  even  more  visible  in  the 
distribution  of  derivatives  like  dyvx.  The  PDFs  of  such  velocity  derivatives  show  tails  that 
decay  even  more  slowly  than  tails  of  exponential  distributions. 

PDFs  of  velocity  differences  have  also  been  obtained  by  these  direct  numerical  studies 
[73].  The  distributions  found  are  similar  to  those  seen  in  experiments  [31,  62]  and  seem  to 
have  exponential  tails  for  small  separtions  r  ;  as  this  separtion  increases,  the  deviations 
from  Gaussian  character  decrease  till,  at  large  separtions  comparable  to  integral  scales, 
these  PDFs  approach  Gaussians. 

These  simulations  [73]  have  tried  to  check  the  log-normal  hypothesis  of  Kolmogorov 
[42]  for  the  distribution  of  the  energy  dissipation  rate  e.  Though  there  is  reasonable 
agreement  near  the  peak  of  the  distribution,  there  are  significant  deviations  from  this 
form  in  the  tails.  We  are  not  aware  of  any  direct  numerical  checks  of  the  log-Poisson 
class  of  distributions  that  follow  from  the  She-Leveque  assumptions  [55,  67].  Data  from 
our  Navier-Stokes  runs  would  not  be  good  enough  to  rule  out  other  possible  distributions. 
To  obtain  information  about  the  distribution  of  e  in  the  GOY  model,  one  must  transform 
to  real  space.  As  we  have  said  earlier,  the  logarithmic  discretisation  of  k  space  makes 
such  transformation  questionable  for  this  model,  so  we  restrict  ourselves  to  the 
distributions  of  Fourier  components  (see  below).  The  best  we  can  do  is  to  check  the 
analogue  of  She-Leveque  hierarchy  for  the  moments  of  e,  which  implies  a  distribution  of 
the  log-Poisson  class.  We  have  discussed  this  above  in  terms  of  Gp  (see  the  paragraph 
with  (49)). 

The  study  of  probability  distributions  in  shell  models  is  of  more  recent  origin. 
Non-Gaussian  behaviour  in  the  GOY  shell  model  has  been  studied  by  Biferale 
[100].  He  has  shown,  by  taking  a  Fourier  transform  of  vn,  that  PDFs  of  the  real- 
space  velocity  gradients  are  non-Gaussian  for  the  GOY  shell  model  and  not  far 
from  the  predictions  of  a  multifractal  theory.  Also,  the  deviations  from  Gaussian 
behaviour  increase  with  decreasing  spatial  separation,  in  qualitative  accord  with 
the  results  for  the  NS  equation  and  from  experiments.  One  might  question  whether 
Fourier  transforms  over  the  logarithmically  spaced  wavenumbers  in  the  GOY  model  can 
give  reasonable  real-space  information.  Thus,  in  our  study,  we  restrict  ourselves  to  k- 
space  quantities  and  look  for  possible  non-Gaussian  statistics.  The  REWA  models 
sample  fc-space  better  than  the  GOY  model.  These  have  also  been  used  to  obtain 
the  PDFs  of  velocity  differences  [101]  which  show  a  crossover  from  Gaussian 
behaviour,  at  the  largest  spatial  scales,  to  PDFs  with  increasingly  stretched  tails  at 
smaller  scales. 

In  analogy  with  the  experimental  studies  of  Katsuyarna  et  al  we  first  construct  &-space 
72/s  (cf.  (23))  for  the  GOY  shell  model: 

,     ,  =  2,3,4,...,  '  (51) 


2 
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Figure  18.  Plots  of  (a)  72?  and  (b)  T2p  versus  the  shell  number  n  for 
2p  =  4,6, ...,20,  for  our  run  Gl.  The  dashed  lines  indicate  72?  =  0.5  and 
r2p  =  0.5,  respectively.  One  can  see  that  the  plots  in  (b)  are  free  from  the  oscillations 
present  in  (a). 


which  can  be  rewritten  as 


(52) 


Note  that  the  midband  frequency  fc  in  (23)  has  been  replaced,  effectively,  by  the  shell 
number  n  here.  In  figure  18a  we  plot  -y2p  (calculated  via  eq.  (52))  as  a  function  of  n,  the 
shell  number,  for  run  Gl.  One  can  clearly  observe  the  deviation  of  72^  from  the  Gaussian 
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(c) 


Figure  19.  Inertial-range  behaviour  of  (a)  F4  (b)  FIO  and  (c) 
(figure  18).  Continuous  lines  represent  the  form  T2p  oc  \(2p  -  1)!! 
C's  calculated  from  the  SL  formula. 


for  run  Gl 
^  ,  with  the 


value  of  0.5  in  the  inertial  range  and  the  rapid  decay  of  72^,  in  the  dissipation  range 
(beyond  n  =  14).  However,  the  curves  show  oscillations  that  arises  because  of  the 
underlying  3-cycle  in  the  GOY  shell  model  (see  above).  To  eliminate  these  we  define  a 
new  function  ^  in  terms  of  Sp  ((37)): 


J2p 


=  2,3,4,.... 


(53) 


Figure  18b  shows  the  variation  of  F2P  (calculated  using  (53))  with  n  for  run  Gl.  As  one 
might  have  anticipated,  the  use  of  the  correlation  function  Ep  has  removed  the 
oscillations  so  the  deviations  from  Gaussian  behaviour  show  clearly. 

Since  we  know  that  Ep  ~ 
T2P  in  the  inertial  range,  i.e., 


in  the  inertial  range,  we  can  infer  from  (53)  the  form  of 


k< 


(54) 


To  specify  F2p  completely  in  the  inertial  range  we  must  also  provide  the  amplitudes  for 
the  power-law  dependence  of  Sp  on  k.  However,  this  is  not  necessary  for  the  purpose 
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Figure  20.    T2p  as  functions  of  the  shell  number  n  for  our  runs  Gl,  G3,  G5,  and  G7 

for  1p  =  4, 6, . . , . ,  20  (top  to  bottom).  Dashed  lines  indicate  T2p  =  0.5. 


of  figure  19,  where  we  show  comparisons  between  the  T2/s  shown  in  figure  18b  and 
the  form  suggested  in  (54)  (with  the  Cp's  calculated  from  the  SL  formula)  for  2p  =  4, 10 
and  16.  Clearly,  similar  power  laws  obtain  in  the  inertial  range.  We  note  in  passing  that 
the  slight  hump  before  the  dissipation  range  is  an  indication  of  the  bottleneck 
phenomenon  [61]. 

Figure  20  shows  F2p  as  a  function  of  the  shell  number  n  for  four  different  ReA  (our  runs 
Gl,  G3,  G5  and  G7).  The  general  conclusion  that  we  can  draw  from  these  graphs  is  that 
the  distribution  P(\vn\)  crosses  over  from  being  close  to  Gaussian  at  small  kn  to  ones  that 
fall  more  and  more  slowly  with  increasing  kn.  This  is  in  accord  with  the  data 
of  Katsuyama  et  al  (1994)  (note  our  wavenumber  kn  is  the  analogue  of  their  midband 
frequency  /c).  These  graphs  also  illustrate  that,  with  increasing  ReA,  the  extent  of  the 
inertial  range  increases  and  that  the  deviations  from  Gaussian  distributions  is  far  more  in 
the  dissipation  range  than  in  the  inertial  range.  Direct  plots  of  Iogr2p  versus  ReA  are  in 
general  agreement  with  these  statements,  but  are  not  very  smooth  (like  our  plots  of  8£p 
versus  ReA  (figure  12))  since  we  have  studied  only  eight  widely  separated  values  of  ReA. 
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4.  Concluding  remarks 

We  have  provided  a  brief  overview  of  experimental  and  numerical  studies  of  coherent 
structures,  structure  functions,  and  probability  distributions  in  turbulent  flows  that  are,  to 
a  good  approximation,  homogeneous  and  isotropic.  We  have  also  outlined  some  of  the 
phenomenological  and  stochastic  models  that  have  been  used  to  develop  an  under- 
standing of  these  quantities.  However,  as  is  well  known,  a  complete  theoretical 
understanding  of  homogeneous,  isotropic  turbulence  has  remained  an  elusive  goal,  in 
spite  of  the  substantial  progress  that  has  been  made  in  all  the  areas  described  above. 

Our  emphasis  here  has  been  on  using  direct  numerical  studies  of  the  deterministically 
forced  NS  equation  and  the  GOY  shell  model  in  conjunction  with  each  other.  In 
particular,  we  have  explored,  via  ESS  and  GESS,  an  apparently  universal  crossover  from 
inertial-  to  dissipation-range  behaviours  in  these  models.  Thus  our  study  adds  to  the 
growing  evidence  for  similar  multiscaling  in  NS  and  GOY  models.  It  is  tempting  to  say, 
therefore,  that  the  GOY  model  captures  some  of  the  universal  scaling  properties  of  fluid 
turbulence  because  all  known  conservation  laws  have  been  built  in  by  a  suitable  choice  of 
parameters  (see  above).  However,  we  must  temper  this  optimistic  view  for  there  are  many 
features,  which  seem  important  for  fluid  turbulence,  that  are  not  contained  in,  or  poorly 
represented  by,  the  GOY  model.  As  noted  by  Kadanoff  et  al  [84],  there  is  no  analogue  of 
sweeping  effects  in  the  GOY  model.  Furthermore,  since  the  GOY  model  has  only  scalar 
velocities  and  k's  which  are  logarithmically  spaced,  it  cannot  represent  well  the  high- 
vorticity  filamentary  structures,  which  are  believed  to  be  important  in  fully  developed 
turbulence.  Recall  that  the  arguments  which  lead  to  the  She-Leveque  formula  rely  on  the 
filamentary  nature  of  these  structures;  yet,  strangely  enough,  the  GOY-model  exponents 
£p  agree  reasonably  well  with  this  formula.  Are  all  these  apparently  important  effects, 
which  are  not  contained  in  the  GOY  shell  model,  irrelevant  in  some  way?  Unfortunately 
there  is  no  clear  answer  to  this  question  at  this  moment.  Further  studies  are  needed  to 
elucidate  the  similarities  and  differences  between  the  statistical  properties  of  the  solutions 
of  the  GOY  shell  model  and  the  3d  NS  equation.  However,  given  that  numerical  studies 
of  the  GOY  shell  model  are  clearly  far  easier  than  direct  numerical  simulations  of  the  3d 
NS  equation,  the  former  can  be  used  (with  caution)  as  preliminary  testing  grounds  for 
new  theoretical  ideas. 

Acknowledgements 

The  authors  thank  C  Jayaprakash,  R  Narasimha  and  S  Ramaswamy  for  discussions  and 
the  referee  for  helpful  suggestions.  They  also  thank  CSIR  and  BRNS,  India  for  support 
and  SERC,  Bangalore  for  computational  resources. 

References 

[1]  M  Van  Dyke,  An  album  of  fluid  motion  (The  Parabolic  Press,  Stanford,  California,  1982) 

[2]  K  R  Sreenivasan  and  R  A  Antonia,  Ann.  Rev.  Fluid  Mech.  29,  435  (1997) 

[3]  E  D  Siggia,  Ann.  Rev.  Fluid  Mech.  26,  137  (1994) 

[4]  M  Nelkin,  Adv.  Phys.  43,  143  (1994) 

[5]  V  L'vov  and  I  Procaccia,  Phys.  World  35,  (1996) 


[6]  W  D  McComb,  The  physics  of  fluid  turbulence,  (Uxtord  university  Fress,  uxrora, 

[7]  U  Frisch,  Turbulence:  the  legacy  of  A  N  Kolmogorov  (Cambridge  University  Press, 

Cambridge,  1995)  , 

[8]  T  Bohr,  M  H  Jensen,  G  Paladin,  A  Vulpiani,  Dynamical  systems  approach  to  turbulence,  to 

be  published  (Cambridge  University  Press,  Cambridge,  1997) 
[9]  G  K  Batchelor,  The  theory  of  homogeneous  turbulence  (Cambridge  University  Press, 

Cambridge,  1953) 
[10]  H  Tennekes  and  J  L  Lumley,  A  first  course  in  turbulence  (MIT  Press,  Cambridge, 

Massachusetts,  1972) 
[11]  A  S   Monin  and  A  M  Yaglom,  Statistical  fluid  mechanics   (MIT  Press,   Cambridge, 

Massachusetts,  1975) 

[12]  G  I  Taylor,  Proc.  R.  Soc.  (London)  A151,  421  (1935) 
[13]  G  S  Saddoughi  and  S  V  Veeravalli,  J.  Fluid  Mech.  268,  333  (1994) 
[14]  N  Goldenfeld,  Lectures  on  phase  transitions  and  the  renormalization  group  (Addison- 

Wesley,  New  York,  1992) 

[15]  To  the  best  of  our  knowledge,  a  direct  mapping  of  a  deterministic  partial  differential  equation 
(PDE)  with  spatiotemporal  chaos  onto  a  stochastic  PDE  has  been  carried  out  only  for  the 
Kuramoto-Sivashinsky  (KS)  equation.  This  mapping  uses  a  numerical  coarse-graining 
procedure  and  shows,  both  in  one  and  two  spatial  dimensions,  that  the  KS  equation  is  in  the 
universality  class  of  the  Kardar-Parisi-Zhang  (KPZ)  equation,  i.e.,  the  long-distance  and 
long-time  behaviours  of  their  correlations  functions  are  the  same.  (See:  S  Zaleski,  Physica 
D34,  427  (1989);  F  Hayot,  C  Jayaprakash  and  Ch  Josserand,  Phys.  Rev.  E47,  911  (1993); 
C  Jayaprakash,  F  Hayot  and  R  Pandit,  Phys.  Rev.  Lett.  71,  15  (1993).)  The  case  of  the 
de termini stically  forced  Navier-Stokes  equation  is  considerably  more  subtle;  however,  in  the 
absence  of  a  direct  mapping,  it  has  been  conjectured  that  the  appropriate  PDE  is  the  NS 
equation  with  an  additive,  Gaussian  white  noise  whose  variance  has  a  power-law  dependence 
on  the  wave-vector  [16,17,18] 

[16]  C  DeDominicis  and  P  C  Martin,  Phys.  Rev.  A19,  419  (1979) 

[17]  V  Yakhot  and  S  A  Orszag,  /.  Sci.  Comput.  1,  3  (1986);  Phys.  Rev.  Lett.  57,  1722  (1986) 
[18]  J  K  Bhattacharjee,  J.  Phys.  A21,  L  551  (1988);  Phys.  Rev.  A40,  6374  (1989);  Phys.  Fluids 

A3,  879  (1991) 

[19]  C-Y  Mou  and  P  B  Weichman,  Phys.  Rev.  Lett.  70,  1101  (1993) 
[20]  J  K  Bhattacharjee,  Pramana  -  J.  Phys.  48,  365  (1997) 

[21]  P  Bak,  C  Tang  and  K  Wiesenfeld,  Phys:  Rev.  Lett.  59,  381  (1987);  Phys.  Rev.  A38,  364  (1988) 
[22]  G  Stolovitzky  and  K  R  Sreenivasan,  Phys.  Rev.  E48,  R33  (1993) 
[23]  S  Douady,  Y  Couder  and  M  E  Brachet,  Phys.  Rev.  Lett.  67,  983  (1991) 
[24]  O  Cadot,  S  Douady  and  Y  Couder,  Phys.  Fluids  7,  630  (1995) 
[25]  E  Vlllermaux,  E  Sixou  and  Y  Gagne,  Phys.  Fluids  7,  2008,  (1995) 
[26]  L  F  Richardson,  Weather  prediction  by  numerical  process  (Cambridge  University  Press, 

Cambridge,  1922) 

[27]  A  N  Kolmogorov,  C.R.  Acad.  Sci.  USSR  30,  301  (1941) 
[28]  N  Cao,  S  Chen  and  K  R  Sreenivasan,  Phys.  Rev.  Lett.  77,  3799  (1996) 
[29]  H  L  Grant,  R  W  Stewart  and  A  Moilliet,  J.  Fluid  Mech.  12,  241  (1962) 
[30]  K  R  Sreenivasan,  Phys.  Fluids  7,  2778  (1995) 

[31]  F  Anselmet,  Y  Gagne,  E  J  Hopfinger  and  R  A  Antonia,  /.  Fluid  Mech.  140,  63  (1984) 
[32]  R  Benzi,  S  Ciliberto,  R  Trippiccione,  C  Baudet,  F  Massaioli  and  S  Succi,  Phys.  Rev.  E48, 

R29  (1993) 

[33]  J  Herweijer  and  W  van  de  Water,  Phys.  Rev.  Lett.  74,  4651  (1995) 
[34]  T  Katsuyama,  Y  Horiuchi  and  K  Nagata,  Phys.  Rev.  E49,  4052  (1994) 
[35]  S  Grossman,  D  Lohse,  V  L'vov  and  I  Procaccia,  Phys.  Rev.  Lett.  73,  432  (1994) 
[36]  V  S  L'vov  and  I  Procaccia,  Phys.  Rev.  Lett.  74,  2690  (1994) 
[37]  G  I  Barenblatt.and  N  Goldenfeld,  Phys.  Fluids  7,  3078  (1995) 
[38]  G  Zocchi,  J  Maurer,  P  Tabeling  and  H  Williame,  Phys.  Rev.  E50,  3693  (1994) 
[39]  B  Chabaud,  A  Naert,  J  Peinke,  F  Chilla,  B  Castaing  and  B  Hebral,  Phys.  Rev.  Lett.  73,  3227 
(1994) 

Pramana  -  J.  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
362  Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


Homogeneous  isotropic  turbulence 

[40]  V  Emsellem,  L  P  Kadanoff,  D  Lohse,  P  labeling  and  J  Wang,  chao-dyn/9604009,  Phys.  Rev. 

E  to  appear 

[41]  A  N  Kolmogorov,  J.  Fluid  Mech.  13,  82  (1962) 
[42]  A  A  Praskovsky,  Phys.  Fluids  A4,  2589  (1992) 
[43]  G  Stolovitzky,  P  Kailasnath  and  K  R  Sreenivasan,  /.  Fluid  Mech.  297,  275  (1995) 

G  Stolovitzky  and  K  R  Sreenivasan,  Rev.  Mod.  Phys.  66,  229  (1994) 
[44]  S  T  Thoroddsen  and  C  W  Van  Atta,   Phys.  Fluids  A4,  2592  (1992) 
[45]  Y  Gagne,  M  Marchand  and  B  Castaing,  J.  Phys.  4,  1  (1994) 

[46]  S  Chen,  G  D  Doolen,  R  H  Kraichnan  and  L  P  Wang,  Phys.  Rev.  Lett.  74,  1755  (1995) 
[47]  K  R  Sreenivasan.  and  P  Kailasnath,  Phys.  Fluids  AS,  512  (1993) 
[48]  U  Frisch,  P  L  Sulem  and  M  Nelkin,  /  Fluid  Mech.  87,  719  (1978) 
[49]  G  Parisi  and  U  Frisch,  in  Turbulence  and  predictability  in  geophysical  fluid  dynamics  edited 

by  M  Ghil,  R  Benzi  and  G  Parisi  (North-Holland,  Amsterdam,  1985)  pp.  84-87 
[50]  C  Meneveau  and  K  R  Sreenivasan,  J.  Fluid  Mech.  224,  429  (1991) 
[51]  Z  S  She  and  E  Leveque,  Phys.  Rev.  Lett.  72,  336  (1994) 
[52]  E  D  Siggia,  J.  Fluid  Mech.  107,  375  (1981) 

[53]  Z  S  She,  E  Jackson  and  S  A  Orszag,  Nature  (London)  344,  226  (1990) 
[54]  G  R  Chavarria,  C  Baudet  and  S  Ciliberto,  Phys.  Rev.  Lett.  74,  1986  (1995) 
[55]  B  Dubrulle,  Phys.  Rev.  Lett.  73,  959  (1994) 

[56]  D  Segel,  V  L'vov  and  I  Procaccia,   Phys.  Rev.  Lett.  76,  1828  (1996) 
[57]  C  Meneveau,  Phys.  Rev.  E54,  3657  (1996) 
[58]  R  Benzi,  L  Biferale,  S  Ciliberto,  M  Struglia  and  R  Tripiccione,  Europhys.  Lett.  32,  709 

(1995) 
[59]  S  Chen,  G  Doolen,  J  R  Herring,  R  H  Kraichnan,  S  A  Orszag  and  Z  S  She,  Phys.  Rev.  Lett.  70, 

3051  (1993) 

S  Chen,  G  D  Doolen,  R  H  Kraichnan  and  Z  S  She,  Phys.  Fluids  AS,  458  (1993) 
[60]  R  H  Kraichnan,  J.  Fluid  Mech.  5,  497  (1959) 

[61]  D  Lohse  and  A  Muller-Groeling,  Phys.  Rev.  Lett.  74,  1747  (1995);  Phys.  Rev.  E54,  (1996) 
[62]  A  Praskovsky  and  S  Oncley,  Phys.  Rev.  Lett.  73,  3399  (1994) 
[63]  R  Benzi,  L  Biferale,  G  Paladin,  A  Vulpiani  and  M  Vergassola,  Phys.  Rev.  Lett.  67,  2299 

(1991) 

[64]  Z  S  She,  Phys.  Rev.  Lett.  66,  600  (1991) 

[65]  P  Kailasnath,  K  R  Sreenivasan  and  G  Stolovitzky,  Phys.  Rev.  Lett.  68,  2766  (1992) 
[66]  G  Stolovitzky,  The  statistical  order  of  small  scale  turbulence,  Ph.D.  Thesis,  Yale  University, 

USA  (1994),  cited  in  ref.  [2] 

[67]  Z  S  She  and  E  C  Waymire,  Phys.  Rev.  Lett.  74,  262  (1995) 
[68]  S  A  Orszag  and  G  S  Patterson,  Phys.  Rev.  Lett.  28,  76  (1972) 
[69]  R  M  Kerr,  J.  Fluid  Mech.  153,  31  (1985) 
[70]  M  M  Rogers  and  P  Moin,  J.  Fluid  Mech.  176,  33  (1987) 
[71]  K  Yamamoto  and  I  Hosokawa,  J.  Phys.  Soc.  Jpn.  57,  1532  (1988) 
[72]  R  H  Kraichnan  and  R  Panda,  Phys.  Fluids  31,  (1988) 
[73]  M  Meneguzzi  and  A  Vincent,  in  Advances  in  turbulence  3  edited  by  A  V  Johansson  and 

P  H  Alfredsson  (Springer,  Berlin,  1991)  pp.  211-220;  A  Vincent  and  M  Meneguzzi,  J.  Fluid 

Mech.  258,  (1994) 

[74]  S  Kida  and  K  Ohkitani,  Phys.  Fluids  A4,  1018  (1992) 

[75]  J  Jimenez,  A  A  Wray,  P  G  Saffman  and  R  S  Rogallo,  /.  Fluid  Mech.  255,  65  (1993) 
[76]  N  Cao,  S  Chen,  and  Z-S  She,  Phys.  Rev.  Lett.  77,  3711  (1996) 
[77]  E  B  Gledzer,  Sov.  Phys.  Dokl  18,  216  (1973) 
[78]  K  Ohkitani  and  M  Yamada,  Prog.  Theor.  Phys.  81,  329  (1989) 
[79]  A  M  Obukhov,  Atmos.  Oceanic  Phys.    7,  41  (1971) 
[80]  V  N  Desnyansky  and  E  A  Novikov,  Atmos.  Oceanic  Phys.  10,  127  (1974) 
[81]  M  Yamada  and  K  Ohkitani,  J.  Phys.  Soc.  Jpn.  56,  4210  (1987) 
[82]  M  H  Jensen,  G  Paladin  and  A  Vulpiani,   Phys.  Rev.  A43,  798  (1991) 
[83]  D  Pisarenko,  L  Bieferale,  D  Courvoisier,  U  Frisch  and  M  Vergassola,  Phys.  Fluids  A5,  2533 

(1993) 


Sujan  K  Dhar  et  al 

[84]  L  Kadanoff,  D  Lohse,  J  Wang  and  R  Benzi,  Phys.  Fluids  7,  617  (1995) 
[85]  Sujan  K  Dhar,  Ph.D.  thesis,  Indian  Institute  of  Science,  Bangalore  (1996)  unpublished 
[86]  L  Kadanoff,  D  Lohse  and  N  Schorghofer,  Physica  D160,  165  (1997) 
[87]  L  Biferale,  A  Lambert,  R  Lima  and  G  Paladin,  Physica  D80,  105  (1995) 
[88]  Ohkitani  and  Yamada  [79]  have  shown  that  the  quadratic  invariant  Y^(-kn)a\vn\2  of  order  a 
is  conserved  in  the  inviscid,  unforced  GOY  shell  model  provided  one  chooses  8  =  1  —  A~Q. 
If  the  invariant  of  order  a.  is  conserved,  then  all  lower-order  invariants  are  also  conserved. 
The  choice  a  =  1,  i.e.,  8  =  1/2  for  A  =  2  thus  conserves  both  the  GOY-model  energy  and 
helicity  in  the  inviscid,  unforced  limit;  a  =  2,  i.e.,  S  =  5/4  also  conserves  the  GOY-model 
enstrophy  in  this  limit  also,  which  is  of  relevance  in  models  for  two-dimensional  turbulence 
[89]  S  K  Dhar,  A  Sain  and  R  Pandit,  Phys.  Rev.  Lett,    (to  appear)  (1997) 
[90]  L  Biferale  and  R  Kerr,  Phys.  Rev.  E52,  61133  (1995) 
[91]  R  Benzi,  L  Biferale  and  G  Parisi,  Physica  D65,  163  (1993) 
[92]  N  Schorghofer,  L  Kadanoff  and  D  Lohse,  Physica  D88,  40  (1995) 
[93]  Z  S  She,  E  Jackson  and  S  A  Orszag,  Proc.  R.  Soc.  London  A434,  101  (1991) 
[94]  M  E  Brachet,  D  I  Meiron,  S  A  Orszag,  B  G  Nickel,  R  H  Morf  and  U  Frsich,  J.  Fluid  Mech. 

130,411  (1983) 

[95]  J  Eggers  and  S  Grossmann,  Phys  Fluids  A3,  1958  (1991) 
[96]  S  Grossmann  and  D  Lohse,  Phys.  Rev.  ESO,  2784  (1994) 
[97]  E  Leveque  and  Z  S  She,  Phys.  Rev.  Lett.  75,  2690  (1995) 
[98]  V  Borue  and  S  A  Orszag,  Europhys.  Lett.  29,  6875  (1995) 
[99]  C  Foias,  0  Manley  and  L  Sirovich,  Phys.  Fluids  A2,  464  (1990) 
[100]  L  Biferale,  Phys.  Fluids  AS,  428  (1993) 
[101]  J  Eggers  and  S  Grossmann,  Phys.  Lett.  A156,  444  (1991) 
[102]  D  Lohse,  Phys.  Rev.  Lett.  73,  3223  (1994) 


physics  pp.  365-378 


The  randomly  stirred  fluid  -  Turbulence  as  a  problem  in 
statistical  mechanics 


J  K  BHATTACHARJEE 

Department  of  Theoretical  Physics,  Indian  Association  for  Cultivation  of  Science,  Calcutta  700032, 
India 

Abstract.  Properties  of  the  randomly  stirred  fluid  and  the  relevance  to  the  problem  of 
homogeneous  isotropic  turbulence  are  discussed. 

Keywords.     Homogeneous  isotropic  turbulence;  randomly  stirred  fluids. 
PACS  Nos    47.27;  05.70 

1.  Preliminaries 

This  article  deals  with  applications  of  the  methods  of  stastistical  mechanics  to  the 
problem  of  homogeneous,  isotropic  turbulence.  Turbulence  [1-7]  deals  with  a  wide  class 
of  flows  which  seem  to  possess  complex  and  seemingly  random  structure  at  some 
macroscopic  scale  of  dynamic  importance.  The  chief  physical  characteristic  is  a 
tremendous  enhancement  in  transport  properties — transport  of  momentum,  energy  and 
particles  is  orders  of  magnitude  higher  than  possible  by  molecular  process.  Equally 
important  is  the  sensitive  dependence  on  initial  conditions  [8]  of  a  turbulent  flow.  Two 
turbulent  flows  which  are  nearly  identical  at  a  given  time  do  not  remain  so  on  timescales 
of  dynamical  interest.  It  is  the  sensitivity  to  initial  conditions  which  makes  the  methods  of 
statistical  mechanics  applicable  to  the  problem  of  turbulence.  The  key  to  this  is  the  fact 
that  while  details  of  fully  developed  turbulence  are  sensitive  to  triggering  disturbances, 
average  properties  are  not.  Hence  statistically  averaged  quantities  are  the  dynamical 
variables  of  interest. 

The  governing  dynamical  equation  for  the  study  of  turbulence  is  the  Navier-Stokes 
equation  for  the  velocity  field  v(r,t), 

^+(v-V)v=-  —  +  zA72v  (1.1) 

where  P  is  the  pressure,  p  the  density  and  v  the  kinetic  viscosity.  For  an  incompressible 
fluid,  there  is  the  additional  condition  that 

V-v  =  0.  (1.2) 

All  the  information  regarding  the  turbulence  should  be  contained  in  (1.1),  when  it  is 
supplemented  with  the  boundary  conditions  and  initial  conditions.  We  now  demonstrate 
that  maintained  turbulent  motion  in  a  closed  system  cannot  be  described  by  (1.1).  To  do 
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boundaries,  then 

(1.3) 


and  since  the  rate  of  change  is  always  negative,  motion  has  to  cease  at  t  =  oo.  Thus,  it  is 
necessary  to  augment  the  right  hand  side  of  (1.1),  by  a  force  f,  which  feeds  energy  into 
the  system.  We  will  endow  this  force  f,  in  what  follows,  with  statistical  property. 

Even  if  the  flow  is  not  bounded,  it  is  natural  to  think  of  an  additional  force,  if  we  are 
worrying  about  the  dynamics  of  the  fluctuating  part  of  the  velocity  field.  To  see  this,  we 
decompose  the  total  velocity  field  as 

v(r,0=V(r)  +  u(r,0  (1.4) 

where  V  is  a  mean  flow  and  u  is  a  fluctuating  field  of  zero  mean.  In  other  words, 
(v(r,  t}}  =  V(r),  where  the  prescription  for  taking  averages  will  have  to  be  provided.  If 
we  insert  the  above  decomposition  into  the  Navier-Stokes  equation  for  incompressible 
flow,  then  we  arrive  at 

|i  =  "VV  +  ,VV  -  |  -  |  -  ±  (VlVj  +  UiUj  +  VlUj  +  Vjui}        (L5) 

(note  that  pressure  has  been  decomposed  as  P  +  /?). 
Statistical  average  yields 


and  thus 

where 

9 

The  fluctuating  velocity  field  thus  experiences  an  extra  force  coming  mainly  from  the 
mean  flow.  It  is  this  interaction  of  the  fluctuating  part  with  the  mean  flow  that  sustains  the 
turbulence. 

Now,  about  the  averaging  process.  The  available  averages  are  space  averages,  time 
averages  and  ensemble  averages.  Spatial  averages  make  sense  only  if  scales  over  which 
the  turbulence  is  homogeneous  or  approximately  so  are  much  larger  than  the  scale  of 
turbulent  fluctuations.  Similarly,  the  time  averaging  is  useful  if  the  time  over  which  the 
turbulence  is  statistically  stationary  is  longer  than  the  time  for  turbulent  fluctuations. 
Experiments  usually  deal  with  time  averages  obtained  by  using  a  local  probe.  The  third 
average  is  the  ensemble  average  where  the  average  is  taken  over  different  realizations  of 
the  turbulent  field.  This  ensemble  averaging  makes  sense  and  can  be  equal  to  the  time 
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turbulent  flow  strongly  mixing  and  ergodic.  However,  this  would  imply  the  existence  of  a 
stable  asymptotic  stationary  state.  It  is  likely  that  such  a  state  exists  but  its  existence  has 
not  been  proven. 

We  now  return  to  (1.7)  and  try  to  specify  the  properties  of  f  .  To  ignore  the  questions  of 
boundary  conditions,  mean  flow,  initial  conditions  etc.,  we  make  f  a  random  force.  We 
then  need  to  specify  the  correlations  of  f  .  To  do  this,  we  need  to  keep  in  mind  that  the  role 
of  /  is  to  supply  energy  at  large  length  scales  —  preferably  near  the  boundaries.  If  e  is  the 
rate  per  unit  time,  per  unit  mass  at  which  energy  is  injected  into  the  system  and  fco  is  a 
small  wavenumber  (ko  ~  O(L~})},  L  being  the  system  size),  then  a  possible  prescription 
for  the  correlation  is  (note,  dimension  of  e  is  L2/T3,  L.and  T  being  scales  for  length  and 
time) 

))  =  e6(k2  -  k2}6D(k  +  k'Wu,  +  J).  (1.9) 


In  the  above  Py(k)  is  the  projection  operator  which  ensures  that  V  •  f  =  0[Py-(&)  = 
Sy  —  (kikj/k2}},  D  is  the  dimensionality  of  space  and  the  factor  kD~2  is  required  to 
maintain  the  correct  dimensions.  We  now  use  the  approximation  that  instead  of  being 
strictly  zero  for  k  >  ko  as  required  by  the  <5-function  in  (1.9),  the  correlation  will  fall  off 
as  a  power  law  f(k)  expressed  by 

f(k}  =  k2~y  (i.io) 

to  write  the  correlation  of  (1.9)  as 

(1.11) 


with  y  taken  to  be  an  arbitrary  parameter.  We  thus  arrive  at  the  randomly  stirred  model  [9] 
of  De  Domincis  and  Martin  [10,  11].  The  question  arises  if  this  model  is  to  describe  fully 
developed  turbulence,  then  is  there  a  special  value  of  y. 

To  address  the  issue  of  what  value  of  y  is  relevant,  we  need  to  discuss  correlation 
functions,  energy  spectrum,  inertial  range  [12]  etc.  To  begin  with,  let  us  consider  the 
velocity  correlation  function  C//(fc,  <J)  defined  as 


)Pu(k)  (1.12) 

The  expectation  value  is  taken  over  the  probability  distribution  associated  with  the  noise 
in  the  randomly  stirred  model  introduced  above.  The  total  energy  in  the  turbulence  field  is 
given  by  /  d°k  du>C(k,  u}  and  the  energy  spectrum  E(k)  is  defined  as 

f  E(k)dk=  f  dDkdujC(k,u).  (1.13) 

A  dimensional  analysis  yields  the  scaling  behaviour  of  E(k)  in  what  is  known  as  the 
inertial  range.  The  inertial  range  encompasses  fc-values  which  are  neither  too  small,  nor 
too  big.  The  small  ^-values  correspond  to  the  length  scales  at  which  the  forcing  occurs 
and  energy  is  injected  into  the  system.  The  large  ^-values  correspond  to  small  length 
scales  where  the  molecular  viscosity  is  important  and  dissipates  the  injected  energy.  The 

Pramana  -  J.  Phys.,  Vol.  48,  No.  1,  January  1997  (Part  I) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences"  367 


/  K  Bhattacharjee 

result  is  a  maintained  stationary  state  where  the  energy  flows  from  the  large  length  scales 
to  the  short  length  scales  at  a  constant  rate,  which  is  the  e  introduced  before.  The  main 
assumption  is  that  in  the  inertial  range  E(k}  is  determined  by  e  and  k  and  does  not  depend 
upon  the  phenomena  at  long  or  short  length  scales.  This  leads  to 

E(k}  =  CK<?'\-5l\  (1.14) 

where  CK  is  a  universal  number  found  to  be  somewhere  between  1.5  and  2.0  in  many 
experiments  [13,  14].  If  we  use  dimensional  analysis  to  get  the  characteristic  decay  rate  F 
in  this  stationary  state,  we  find 

(1.15) 


where  FQ  is  another  universal  constant.  It  is  clear  from  (1.13)  and  (1.14),  that  in  three 
dimensions 

C(k]  =   I  C(k,u}du  ~  k~n/3  (1.16) 

In  general,  in  D-dimensions,  C(k]  ~  &-(D+(2/3)).  To  probe  the  energy  transfer  rate,  let  us 
write  the  Navier-Stokes  equation  ((1.7)),  in  momentum  space  as 


Ui(k]  +  v&ut  =  £%(k)u,(p)M/(k  -  P)  +//(k)  (1.17) 

p 

where 
and 


From  (1.17),  we  have 
dr^   >,„ 


If  we  integrate  over  all  k  in  (1.18),  then 

=  0 


-P) 
(1.18) 


p 

in  the  stationary  state.  We  define  the  quantity 

f* 


I  ^  E%(-k)<M'(k/)MXP)MKk  -  p)},  (1.19) 

Jo 


which  now  stands  for  the  rate  at  which  energy  is  flowing  from  wavenumbers  below  k  to 
wavenumbers  above  k.  Kolmogorov  picture  asserts  that  II  (k)  is  independent  of  k, 

We  now  turn  to  the  randomly  forced  model.  To  find  out  how  the  relaxation  rate  T(k) 
scales  with  k,  we  use  the  effective  viscosity  argument  of  Heisenberg.  In  the  inertial  range, 


where  the  non-linear  term  in  Navier-Stokes  equation  is  dominant,  it  is  this  non-linear 
term  which  gives  rise  to  an  effective  scale  dependent  viscosity.  Noting  that  the  velocity 
field  can  be  written  as 

//• 
d°p  I  dt'Gin(k,  t  —  t'}MnjiU;(p,  t'}u/(k  —  p,  t'}  (1.20) 

J 

the  averaged  effect  of  the  non-linear  term  is  clearly  expressible  as 

ddpdt'MiJi(\si)Mmn(p)us(}s}Gjm(p,  t  -  t'}(ur(k  -  p,  r)w,(-k  -  p,  t'}). 
The  Greens  function  G/-n(p,  t  —  t1}  can  be  written  as 


p 

rjn 


where  ]£(/?)  *s  tne  effective  relaxation  rate  or  the  self  energy  in  technical  terms.  The 
above  expression  then  acquires  the  form 

dDpdt'Mijl(]<.)Minrs(p}Pjm(p}Prl(k  -  p) 

x  e~  £>)('-')  C(k  -  P,  *  -  t')us(k,  t'} 

where  C(k  —  p,  t  —  r7)  is  the  time  Fourier  transform  of  C(k  —  p,  ui).  Now  C(q,  uj],  has  the 
form  q~D+4~y[uJ2  +  X^O?)]"1  and  thus  the  last  expression  reduces  to 


jTis(k,t-t'}us(k,t'}dt' 


where  the  normalized  trace  of  Tis(k,  t  —  t')  is  the  effective  relaxation  rate,  whose  Fourier 
transform  yields  the  frequency  dependent  rate  XX^W)  as> 


p,u}-uj'}  (1.21) 

Noting  that  JC(k,uj}duj  ~  [Y^(k)}~1  k~D+4~y  ,  it  is  easy  to  see  that  power  counting 
consistency  of  (1.21)  gives 

J>)oc£2-(y/3).  (1.22) 

Turning  to  (1.19)  using  (1.20)  and  following  an  identical  line  of  argument  that  led  to 
(1.22),  we  find  that  II(&)  ~  k4~y.  Kolmogorov  asserts  that  II(&)  =  e  and  hence  one 
requires  y  =  4  for  the  Kolmogorov  picture  of  turbulence  [15-18].  That  raises  the  central 
question  is  the  theory  well  behaved  at  y  =  4? 


2.  Technicalities 

Kolmogorov   scaling   and  the  randomly   stirred  Navier-Stokes   equation   would  be 
completely   compatible  if  (1.21)   would  yield   in   addition   to   the  power   counting 
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consistency  as  demonstrated  towards  the  end  of  the  last  section,  a  finite  number  for 
universal  amplitude  ratios.  With  vertex  corrections  not  yielding  any  new  power  laws 
because  of  Galilean  invariance,  the  problem  of  turbulence  would  reduce  to  calculating  the 
universal  numbers  in  a  loop  ordered  perturbation  scheme.  We  now  demonstrate  that  there 
are  technical  problems  with  this.  To  exhibit  this,  we  explore  the  integrand  on  the  rhs  of 
(1.21).  In  particular,  we  probe  the  situation  where  the  external  momentum  k  is  carried 
almost  entirely  by  the  propagator  G  and  the  momentum  associated  with  the  correlator  C 
is  very  small.  The  zero  frequency  self  energy  can  then  be  written  as  (suppressing 
projection  operators  and  indices). 

,-«/).  (2-1) 

Now,  G~1(k,u/}  =  —id  +  ^(^)w/)  —  ZX^)  as  we  cover  the  frequency  range  where  a/ 
matches  p2/3  and  hence  is  small  compared  to  £](&).  Equation  (3.1)  thus  becomes 
(for  y  =  4) 

£(*)< 


^cM 

\ —  / 

M>«    fp       1 


This  integral  diverges  due  to  the  zero-p  range  contribution  and  needs  to  be  cut  off  at  some 
low  momentum  JCQ.  The  scaling  solution  is  consequently  no  longer  valid. 

The  above  divergence  comes  from  the  very  strong  dynamical  coupling  between 
the  eddies  (fourier  modes)  with  short  wavelength  and  the  eddies  with  long 
wavelength.  This  effect  is  spurious.  The  expected  role  of  the  large  eddies  is  to 
simply  transport  the  much  smaller  ones  and  not  to  have  a  strong  dynamical  coupling. 
This  spurious  effect  was  attributed  to  the  use  of  the  Eulerian  picture  and  Kraichnan 
tried  to  remedy  it  by  going  into  the  Lagrange  description.  This  removes  the  problem 
in  principle  but  actual  calculations  are  virtually  impossible.  A  different  approach 
was  tried  by  Yakhot  and  Orszag  who  used  the  standard  renormalization  group  technique 
to  circumvent  the  problem.  The  idea  was  that  one  splits  the  velocity  field  into  two  parts  — 
one  with  high  momentum  Fourier  modes  and  the  other  with  low  momentum  Fourier 
modes,  integrates  out  the  high  momentum  components  and  studies  the  effect  of  that 
on  the  low  momentum  ones.  This  leads  to  a  new  viscosity  and  after  the  usual  rescalings 
of  space,  time  and  the  velocity  field  a  flow  equation  for  the  viscosity  emerges.  The 
fixed  point  of  the  flow  corresponds  to  a  scale  dependent  viscosity.  By  construction, 
this  yields  long  wavelength,  low  frequency  properties  as  stressed  by  Forster  et  al.  In  the 
limit  of  extremely  high  Reynolds  number,  one  is  not  particularly  worried  by  this 
limitation.  The  process  of  integrating  over  the  high  momentum  modes  bypasses  the 
difficulty  that  we  encountered  with  the  low-p  divergence.  We  sketch  the  steps  in  the 
following: 

(i)  Split  the  velocity  field  as 
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where 


and 


(ii)  Find  the  equations  satisfied  by  u>  and  wf . 
(iii)  Integrate  perturbatively  the  equation  for  uf 

(iv)  Insert  the  solution  for  uf  into  the  equation  for  uf  and  find  the  equation  of  motion  for 
uf.  The  linear  part  gives  Ai',  the  change  in  kinematic  viscosity. 

(v)  Rescale  space  and  length  and  the  velocity  field,  so  that  the  new  viscosity  is  given  by 

z/  =  vtf-i  ~  8b4-y~2zFQk-2  r     ^-~D  f  ^Mp/37(k) 

r        \)  *  ' 

x  Ma57(k— p)P^(k— p)P(Tp(p)G(k— p  —  w)|G(p,u;)|  p4~D~y.       (2.3) 
Requiring  that  the  two  terms  on  the  rhs  have  the  same  dimensions  ensures 

z  =  2-|  (2.4) 

and  hence 

/  7  \  j  — V/3  /O     CN 

Ml  If  \    — ^-    7 1  If     * I  I    x       i   I 

>S  \fv )    ~~~   ts*\i  \  J 

as  in  (1.22).  Evaluating  the  integral  in  (2.3)  in  the  limit  of  k  — »•  0  gives 

Dn  D-l 


where  SD  is  the  surface  area  of  a  D-dimensional  sphere.  The  fixed  point  condition  yields 
z/3       3     SD    D-l 


2y    27rD  D  +  2 


(2-7) 


From  (1.19),  one  can  determine  the  value  of  Dp/i^3  and  thus  DQ/V  which  sets  the  scale  for 
the  Kolmogorov  spectrum  is  obtained.  For  y  =  4  and  D  =  3,  one  does  find  a  value  very 
close  to  the  experimental  results  and  similar  success  in  calculating  other  universal 
numbers  certainly  indicates  that  this  is  a  successful  program. 

The  primary  difficulties  of  this  approach  were  pointed  out  by  De  Dominicis  and  Martin 
[10]  at  the  time  they  introduced  the  model  at  the  Kolmogorov  limit  i.e.  y  =  4,  there  are  an 
infinite  number  of  marginal  operators  and  the  self  consistent  perturbation  theory  has  an 
infrared  divergence  at  y  =  3.  In  the  last  few  years,  both  these  effects  have  been 
investigated  in  a  different  context — the  problem  of  growth  by  deposition  of  atoms  on  a 
substrate  [19-22].  The  existence  of  an  infinite  number  of  marginal  operators  and  infrared 
divergences  seem  to  change  the  roughening  and  the  dynamical  exponents  of  the  problem 
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[23-26],  Consequently,  in  spite  of  impressive  numerical  agreement  with  experiments,  the 
renormalization  group  approach  is  somewhat  suspect. 

We  now  note  that  the  infrared  divergence  that  started  the  problem  in  the  first  place  is 
caused  by  the  strong  dynamical  coupling  between  small  scale  (large  A;-Fourier 
components)  energy  dissipating  eddies  and  the  large  scale  (small  ^-Fourier  components) 
energy  containing  eddies.  For  the  validity  of  Kolmogorov  scaling,  the  small  eddies  should 
simply  be  advected  by  the  large  eddies  without  any  dynamical  coupling.  Hence 
Kolmogorov  scaling  requires  a  screening  [27,  28]  of  the  interaction  between  the  large  and 
small  Fourier  coefficients.  This  is  provided  by  viscoelasticity  [29].  Turning  to  (1.21),  we 
note  that  the  effective  self  energy  £(fc,u;)  is  strongly  frequency-dependent  for 
frequencies  greater  than  XX&,  0).  If  we  write  the  ^-dependence  of  the  relaxation  rate  as 


T(k)  =  Fofc"  (2.8) 

and  that  of  the  equal  time  correlation  function  as 

C(k)  =  cQk-m  (2.9) 

then  dynamic  scaling  yields 

£(*>0)  =  ro*"  (2-10a) 

and 

]T(fc,u;)  =  fc2Hw)(2-")/n    forw»r(fc).  (2.10b) 

The  corresponding  correlation  function  is  <? 


for  frequencies  u  such  that  U>TS  »  1,  where  rs  is  the  sweeping  time.  Light  scattering  from 
the  randomly  stirred  fluid  under  the  right  conditions  [28]  should  be  able  to  establish  the 
validity  of  (2.11). 

We  now  introduce  the  form  of  C(k,  uS)  given  above  and  using  a  Lorentzian  G(k,  uS)  i.e., 
,  arrive  at  (see  eq.  (2.1)) 


f   d°P 

I  2« 


If  we  now  explore  the  region  of  the  integrand  where  p  — »  0,  then 

c(p)  r  ™-  -       n~' 


P2[E(k)](n~2/n\ 
/(rf 
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The  infrared  divergence  now  occurs  for  m  +  n  >  D  +  2.  For  the  Kolmogorov  case  of 
m  =  11/3  and  n  =  2/3  in  D  =  3  one  is  safe. 

The  above  manipulations  bear  some  similarity  (in  the  final  results)  to  the  quasi- 
Lagrangian  [30]  approach  advocated  by  Belinicher  and  L'vov  [31,  32].  This  has  recently 
been  taken  up  again  by  L'vov  and  Procaccia  [33]  who  work  entirely  in  coordinate  space  to 
show  that  the  Kolmogorov  spectrum  does  lead  to  a  finite  theory  in  the  perturbative 
analysis  of  the  forced  Navier-Stokes  equation. 

In  all  our  considerations  so  for,  dimensionality  plays  a  very  minor  role.  The  exponent  z 
is  independent  of  dimension  while  the  exponent  n  carries  a  trivial  dependence  on 
dimensionality.  This  implies  that  one  may  possibly  consider  a  one  dimensional  randomly 
stirred  model  and  pick  up  the  features  that  we  have  talked  about  so  far.  In  one  dimension 
it  is  convenient  to  consider  Burger's  equation  which  is  our  Navier-Stokes  equation 
without  the  pressure  term.  For  the  incompressible  flows  that  we  were  dealing  with,  the 
pressure  term  simpy  modified  the  coupling  constant  associated  with  the  nonlinear  term 
and  hence  did  not  cause  any  qualitative  change.  Dropping  it  and  considering  the 
randomly  stirred  Burgens  equation  is  one  dimension  is  not  going  to  cause  any  qualitative 
changes  in  the  central  problem.  Thus 

u  +  u  —  u  =  vV2u+f  (2-14) 

dx 

with 

</(*,  /)/(*',  0)  =  D0k-l+y-46(x  -  j!)6(t  -  0  (2.15) 


is  expected  to  capture  the  essential  features  of  fully  developed  Kolmogorov  turbulence  for 
y  —  4,  there  is  an  additional  complication.  Without  the  drive/,  Burgers  equation  is  known 
to  have  coherent  structures  in  the  form  of  shock  waves  [34].  To  see  Kolmogorov  scaling, 
the  random  drive  has  to  overcome  the  shocks.  This  idea  that  (2.14)  contains  the  basic 
ingredients  for  understanding  fully  developed  turbulence  has  recently  been  supported 
very  stronglly  by  Cheklov  and  Yakhot  [35],  Polyakov  [36]  and  Gurarie  and  Migdal  [37]. 
The  thrust  of  our  discussion  so  far  has  been  that  Kolmogorov  scaling  is  exact.  Our 
calculations  have  been  in  the  inertial  range  where  the  effective  relaxation  rate  £}(&) 
dominates  the  molecular  relaxation  rate  vk2.  This  means  that  we  are  working  with 
k  <C  &D  where  kD  is  the  wavenumber  formed  from  the  rate  of  dissipation  e  and  the 
kinematic  viscosity  v.  Clearly 

(2.16) 

and  in  the  inertial  range  k^L  »  1,  where  L  is  the  system  size.  With  e  ~  v3/L  (v  is  a  mean 
velocity) 

kDL~(Re}3/4  (2.17) 

Where  Re  =  vL/v  is  the  Reynold's  number  and  the  self  consistent  approach  that  we  have 
adopted  (and  which  works!)  requires  Re  — >•  oo.  Thus,  in  the  infinite  Reynold's  number 
limit,  the  results  for  the  relexation  rate  and  the  two  point  function  (v(£,  w)v(A/,o/)} 
(alternatively  ([v(x  +  r,  t)  —  v(x,  /)]2})  seem  to  be  exact. 
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We  now  have  to  face  the  fact  that  a  whole  class  of  experiments  yield  results  for  higher 
order  structure  factors  [38^41]  that  are  at  variance  with  the  Kolmogorov  expectations.  If 
one  investigates  the  structure  factor  (\v(x  +  r,  t)  —  v(x,  ?)|  ),  then  the  exponent  specifying 
the  r-dependence  does  not  depend  linearly  on  p.  On  the  other  hand  assuming  that  once  the 
two  point  function  has  been  shown  to  be  finite,  there  would  be  no  new  divergences  in  the 
higher  order  correlations  and  hence  the  exponent  for  the  pth-order  structure  factor  should 
be  a  linear  function  of  p.  The  correlation  function  (e(x  +  r)e(x)}  in  particular  is  of  great 
interest  (e(x]  is  the  dissipation  rate  at  the  point  x,  averaged  over  a  local  area)  since  in  the 
Kolmogorov  picture  it  has  no  r-dependance.  Various  experiments  reveal 
(e(x  +  r)e(x}}  ~  r~^  with  //  a  small  number  (about  0.20).  This  phenomenon  is  known 
as  intermittency  and  jj,  is  often  called  the  intermittency  exponent.  We  will  return  to  this 
issue  in  §  3. 

The  Kolmogorov  picture  in  two  dimensions  is  special  because  there  are  two  cascades 
to  contend  with — energy  and  enstrophy.  Enstrophy  is  defined  as  (V  x  v)2.  The  direction 
of  the  cascades  can  be  determined  by  an  argument  due  to  Kraichnan  [42].  With  two 
conserved  quantities, 

v2MH2r-IV^Iv^l2  - 
v  \j(,j\ji  j(.  —  —  /     KvA/l    — 

2    k 
and 

N-- 

~2 


=   fw(k)dk 
J 


where  E(k]  and  W(k}  are  the  energy  and  enstrophy  spectrum  respectively.  The  canonical 
probability  distribution  for  fluctuating  E  and  N  is  clearly 


This  leads  to 

(\vk\2}  =  (P  +  ak2rl  (2.19) 

and  consequently 


and 

W(k)  = 


ak2 


The  enstrophy  spectrum  W(k]  ~  k  for  high  wave  number  and  hence  is  clearly 
concentrated  towards  high  wave  number  side.  A  spectrum  with  W(k)  ~  k*,  x  <  1,  will 
be  out  of  equilibrium  and  will  proceed  towards  equilibrium  by  cascading  enstrophy  from 
low  to  high  wave  numbers.  Energy  conservation  would  then  demand  an  inverse  cascade 
of  energy.  The  enstrophy  cascade  is  dissipated  by  molecular  viscosity  at  high  wave 
vectors,  while  the  inverse  energy  cascade  causes  a  condensation  phenomenon  at  low  wave 
numbers.  The  Kolmogorov  argument  holds  for  the  inverse  energy  cascade  and  one  ends 
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Randomly  stirred  fluid 

p  with  E(k)  ~  &~5/3,  while  for  the  enstrophy  cascade  one  has  to  set  up  the  dimensional 
rgument  once  more,  postulating  that  in  the  inertial  range  E(K]  is  determined  by  the  rate 
f  injection  of  enstrophy  (es  say  )  and  the  local  wave  number  k.  This  leads  to  E(k)  ~  fc~3. 
he  infrared  difficulties  that  we  talked  about  in  the  beginning  of  the  section  will  appear 
ere  as  well  and  a  screening  approximation  will  produce  a  self  energy  of  the  form  given 
i  (2.13).  For  the  energy  cascade  in  D  =  2,  m  =  8/3  and  n  =  2/3  and  the  inequality 
i  +  n  <  D  +  2  is  satisfied  which  makes  the  theory  finite.  For  the  enstrophy  cascade  on 
le  other  hand,  m  =  4  and  n  —  0  and  we  have  m  +  n  =  D  +  2.  That  means  a  genuine 
)garithimic  divergence  and  hence  for  the  enstrophy  cascade  [44-46] 


E(k)  =  c0(es)23(ln/c/fcor1.  (2.20) 


et  another  problem  where  there  exists  multiple  cascades  is  the  rnagnetohydrodynamic 
irbulence  [47-50].  The  conserved  quantities  in  the  inviscid  limit  are  the  kinetic  energy 
tid  the  magnetic  energy.  Consequently,  there  is  a  magnetic  energy  flux  and  a  kinetic 
ciergy  flux  and  one  can  determine  [51]  the  Kolmogorov  constants  etc.,  for  this  flow  in  a 
lanner  analogous  to  that  for  the  pure  fluid.  The  binary  liquid  is  another  example  of  a 
.tuation  with  different  fluxes  [52]  —  an  energy  flux  and  a  concentration  flux  and  a 
^normalization  group  program  for  that  has  been  carried  out  [53,  54]  recently. 

.  Subtleties 

'his  short  section  has  to  do  with  the  question  whether  Kolmogorov  scaling  is  exact  for 
le  forced  Navier-Stokes  equation  or  not.  To  set  the  stage,  we  begin  with  the  earliest 
iscussion  [54,  55]  of  this  topic.  Soon  after  the  publication  of  Kolmogorov's  work,  it  was 
ointed  out  by  Landau  that  the  theory  could  be  internally  inconsistent.  The  dissipation 
ite  e  is  found  from  Navier-Stokes  equation  to  be  expressible  as 


=  v  /  d"r(v  •  V2v)z  =  , 

J  t?2J         ^dx? 

leing  an  integral  over  a  fluctuating  quantity,  namely  the  velocity  field,  the  dissipation 
ite  e  is  expected  to  show  fluctuations  which  are  ignored  in  the  Kolmogorov  picture.  It 
'as  not  till  two  decades  had  passed  that  Kolmogorov  and  Obukhov  returned  to  this  issue 
nd  took  Landau's  objection  into  account  by  assuming  that  the  local  dissipation  rate  er  at 
le  point  r  (averaged  over  a  small  ball  around  r)  had  a  log  normal  distribution  i.e.,  lner 
ad  a  Gaussian  distribution. 
Kolmogorov  assumed  that  the  width  a  of  the  distribution  had  the  form 

_z      /n  i2\       f  A  +  96lnL/r.      far »  1  ,_  _,. 

o2  =  {[lner  -  m]z)  =  <  A,^QX,     '  ,'       *    ^  -  (3.2) 


Wiere  m  =  (lner)  and  6  is  an  universal  number.  The  constants  A  and  A'  are  determined  by 
ic  large  scale  structure  of  the  flow.  The  log  normal  distribution  of  er  requires  that  the 
robability  distribution  Pr(a]  for  er  =  a  is  given  by 

Pr(a]  = J_--On*-m)V2^ 

2 
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This  distribution  leads  to 

(€P}  =  eP^p2^  (3.4) 

and  subsequently 

/7\(«/2Mp-3) 

<|v(x  +  r,  0  -  v(x,  OP)  -  («y/3    -  (3-5) 


The  intermittency  exponent  defined  in  the  previous  section  is  obtained  from  p  =  6  and 
found  to  be  \L  =  98. 

Experimental  results  on  the  high  order  structure  factors  do  not  quite  agree  with  the  p- 
dependence  of  the  exponent  shown  in  (3.5),  but  they  certainly  follow  the  qualitative 
feature  that  the  exponents  are  not  a  linear  function  of  p.  The  deviation  from  linearity 
prompts  a  multifractal  description,  which  in  turn  inspires  a  dynamical  system  description 
of  turbulence  in  the  form  of  shell  models.  We  shall  not  discuss  these  issues  here  they  will 
be  addressed  in  detail  by  Pandit  et  al  [56]  in  this  volume.  We  will  confine  our  discussion 
to  the  randomly  stirred  model. 

The  discussion  of  the  previous  section  leads  us  to  believe  that  the  scaling  of  the  two 
point  function  is  exact  in  the  high  Reynold's  number  limit  (finite  Reynolds  number 
corrections  [57,58]  have  been  found  which  provide  correction-to-scaling  and  hence  an 
appearance  of  deviation  from  Kolmogorov  scaling).  However,  the  case  of  higher  order 
correlation  functions  had  not  been  discussed  till  almost  the  mid-nineties.  The  natural 
quantity  to  focus  on  first  is  the  exponent  fj,  and  a  lowest  order  calculation  of  the 
corresponding  correlation  function  showed  [59]  a  logarithmic  behaviour  i.e., 
(e(x-f-r)e(x)}  ~  6\n(L/r).  For  small  fj,,  the  r~^  behaviour  can  be  expanded  to  give 
//  ln(l/r)  and  6  can  be  identified  with  fi.  This  yields  a  reasonable  value  of  p.  The  work  of 
Lebedev  and  L'vov  [60]  and  subsequent  calculations  [61,  62]  examine  the  structure  of  the 
relevant  correlation  function  to  all  orders  and  conclude  that  a  resummation  can  lead  to  a 
power  law.  However,  they  do  not  calculate  the  exponent  explicitly. 

Recent  developments  [35-37]  very  strongly  support  the  idea  that  the  higher  order 
structure  factors  in  the  randomly  stirred  fluid  do  show  deviations  from  pure  Kolmogorov 
scaling.  These  have  to  do  with  the  one  dimensional  Burgers  equation-turbulence 
discussed  before.  Numerical  work  [35]  does  clearly  show  a  non-zero  p,  and  analytical 
work  from  two  different  directions  —  operator  products  expansion  [36]  and  WKB  type 
analysis  —  reveals  a  non-Gaussian  probability  distribution  [63]. 
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Chaos  in  the  hydrogen  atom  interacting  with  external  fields 
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Abstract.  In  this  review  we  discuss  the  chaotic  dynamics  (both  classical  and  quantal  aspects)  of  a 
simple  atomic  system,  namely  hydrogen  atom  interacting  with  time  independent  and  time 
dependent  external  fields.  These  include:  i)  static  electric  field,  ii)  static  magnetic  field,  iii) 
combined  electric  and  magnetic  fields,  in  parallel  and  perpendicular  configuration,  iv) 
instantaneous  and  generalized  van  der  Waals  field,  v)  mass  anisotropy  and  vi)  linearly  and 
circularly  polarized  microwave  fields. 

Keywords.  Classical  and  quantum  chaos;  hydrogen  atom  in  external  fields;  Stark  and  Zeeman 
effect;  van  der  Waals  field;  linearly  and  circularly  polarized  microwave;  ionization;  diffusion; 
Rydberg  states. 

PACS  Nos    05.45;  31.15;  32.30;  32.60;  32.80 


1.  Introduction 

Most  of  the  naturally  occurring  phenomena  belonging  to  various  disciplines  of  science 
are  generally  governed  by  nonlinear  differential  equations,  but  for  the  sake  of 
mathematical  tractability  and  simplicity  they  have  been  approximated  to  linear  forms. 
In  recent  years,  with  the  advent  of  powerful  computers  and  advanced  numerical 
techniques,  these  nonlinear  differential  equations  can  be  solved  to  a  greater  extent. 
Moreover  analysing  the  long  time  behaviour  of  such  systems  is  also  feasible  now. 

In  addition  to  the  above,  recently  many  advanced  analytical  techniques  have  been 
devised  to  solve  some  of  these  nonlinear  differential  equations.  In  fact  a  large  class  of 
nonlinear  partial  differential  equations  are  shown  to  exhibit  so-called  soliton  solutions 
[1].  Even  though  these  systems  are  governed  by  infinite  dimensional  equations,  their 
solutions  exhibit,  coherent  and  regular  structures.  On  the  other  extreme  there  are  systems 
governed  by  very  simple  nonlinear  (differential)  equations  whose  (numerical)  solutions 
exhibit  complicated  structures.  This  class  of  systems  whose  solutions  are  very  sensitive  to 
initial  conditions  are  generally  called  chaotic  systems  [2]. 

Generally  the  dynamics  of  macroscopic  systems  follows  the  principles  of  classical 
mechanics  while  microscopic  systems  (atomic  and  sub  atomic  systems)  obey  the  princi- 
ples of  quantum  mechanics.  The  definition  and  the  meaning  of  chaos  in  classical  systems 
is  well  understood  now.  In  fact  there  are  many  numerical  techniques  (both  qualitative  and 
quantitative)  to  characterize  chaos  in  classical  systems.  On  the  other  hand  the  underlying 
equation  of  motion  in  quantum  mechanics,  namely  the  Schrodinger  equation,  is  linear  in 
nature.  Moreover  the  finiteness  of  Planck's  constant  impose  problems  in  carrying  over  the 
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tools  used  for  defining  chaos  in  classical  mechanics.  Hence  characterizing  chaos  in 
quantum  systems  is  still  a  daunting  task.  However  significant  breakthroughs  have  been 
achieved  by  probing  the  appropriate  quantum  mechanical  quantities  (mainly  eigenvalues 
and  eigenfunctions).  In  fact  some  fingerprints  and  signatures  of  chaos  in  quantum 
systems  are  well  known  [3,4].  In  this  review  we  concentrate  on  the  manifestations  of 
chaos  in  atomic  systems  which  are  not  only  suitable  for  both  classical  and  quantal 
treatments  but  also  amenable  to  real  time  laboratory  experiments. 

By  classical  chaos  in  atomic  systems  we  mean  that  the  atomic  electron  moves  in  a 
seemingly  erratic  fashion  in  the  phase  space  (although  governed  by  deterministic 
equations)  and  as  a  result  of  it,  explores  most  of  the  available  phase  space  if  allowed  to 
evolve  for  sufficiently  long  time.  Even  if  the  electron  starts  its  journey  from  an 
infinitesimally  different  point  in  the  phase  space,  its  dynamics  will  follow  an  entirely 
different  path.  It  has  been  shown  that  the  paths  separate  exponentially  and  the  degree  of 
their  separation  is  characterized  by  the  so-called  Lyapunov  exponent  [2].  This 
exponential  deviation  of  nearby  trajectories  is  characteristic  of  many  nonlinear  and 
non-integrable  (i.e.  which  do  not  posses  sufficient  number  of  independent  constants  of 
motion)  systems. 

Energy  level  clustering  is  the  hallmark  of  quantal  systems  whose  classical  counterparts 
exhibit  regular  motion.  That  means  there  is  a  large  probability  for  small  spacing  and  the 
spacing  between  adjacent  levels  follow  the  Poisson  distribution.  Level  repulsion  is  the 
predominant  fingerprint  of  chaos  in  quantum  systems.  As  a  result  of  this,  the  spacing 
between  the  adjacent  energy  levels  follows  the  Wigner  distribution.  Moreover,  if  we 
probe  different  regions  of  the  parameter  space,  one  may  be  able  to  notice  many  avoided 
level  crossings.  In  fact  this  conjecture  has  been  successfully  verified  in  many  model 
calculations  [5].  In  this  review  we  plot  the  energy  levels  of  different  atomic  systems  for  a 
range  of  parametric  values. 

The  hydrogen  atom  is  the  simplest  atomic  system.  Pure  hydrogen  atom  is  of  not  much 
interest  as  both  its  classical  and  quantal  behaviour  are  well  known.  But  hydrogen  atom  in 
an  external  electromagnetic  field  turns  out  to  be  one  of  the  best  testing  grounds  for 
understanding  the  concepts  of  chaos  [2-4].  In  §2  of  this  review  we  consider  the  chaotic 
dynamics  of  hydrogen  atom  in  time  independent  external  fields  which  are  accessible  for 
laboratory  experiments  such  as  static  electric  or/and  magnetic  fields.  Section  3  deals 
with  the  chaotic  dynamics  of  hydrogen  atom  in  other  interesting  external  fields  such  as 
van  der  Waals  field  and  mass  anisotropy.  In  §  4  we  discuss  the  chaotic  dynamics  of  the 
hydrogen  atom  in  time  dependent  external  electromagnetic  fields.  In  particular  we 
concentrate  on  linearly  and  circularly  polarized  microwave  fields.  In  §  5,  we  enumerate 
our  conclusions. 


2.  Hydrogen  atom  in  static  electric  or/and  magnetic  fields 

We  assume  that  the  nucleus  has  an  infinite  mass  and  is  at  rest,  and  neglect  the  spin  of  the 
electron  for  simplicity.  Relativistic  effects  are  negligible  and  further  we  always  consider 
the  vanishing  magnetic  quantum  number  case  wherever  it  is  a  good  quantum  number.  All 
Hamiltonians  are  written  in  dimensionless  atomic  units  (me  =  \e\  =  \h\  =  1)  unless 
otherwise  stated. 
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Figure  1.  The  (regular)  trajectory  in  the  p-z  plane  of  a  hydrogen  atom  in  an  electric 
field  associated  with  the  Hamiltonian  #ef  for  F  =  9950  V/cm  and  n  =  10  manifold  of 
the  field  free  case. 

2.1  Hydrogen  atom  in  an  electric  field  (Stark  effect) 

To  begin  with,  let  us  consider  the  well  known  and  the  simplest  external  field  problem, 
namely  the  hydrogen  atom  in  a  static  electric  field.  The  Hamiltonian  of  the  hydrogen 
atom  in  an  uniform  static  electric  field  applied  parallel  to  the  z-axis  is  given  as  [6, 7] 

Here  r  and  p  are  the  position  and  momentum  coordinates  expressed  in  three 
dimensional  Cartesian  coordinate  system.  F  represents  the  strength  of  the  external 
electric  field  usually  measured  in  the  units  of  V/cm.  In  some  sense  this  is  one  of  the 
least  complicated  problems:  it  is  simple  because  it  is  separable  (and  hence  integrable) 
in  many  coordinate  systems.  That  means  one  can  find  the  required  number  of  independent 
constants  of  motion,  so  its  classical  behaviour  is  regular  for  arbitrary  field  strengths  F 
[3].  In  figure  1,  we  have  shown  a  representative  classical  trajectory  (in  cylindrical 
coordinates  which  are  defined  as  p  =  -\/jc2  +  v2, 0  =  tan"1  (y/x),  z  =  z)  for  a  (high)  field 
strength  of  F=9950  V/cm  and  an  energy  value  which  corresponds  to  n  =  10  manifold  of 
the  field  free  case.  From  figure  1  it  is  evident  that  the  electron  follows  a  well  defined 
regular  path. 

The  electric  field  problem  at  the  same  time  is  complicated  in  a  sense  that  any 
atomic  state  ionizes  spontaneously  in  the  presence  of  an  uniform  electric  field.  The 
ionization  process  exhibits  a  threshold  behaviour  which  is  related  to  the  critical  point 
(saddle  point)  of  the  underlying  potential  energy  surface.  The  energy  of  the  saddle 
point  is  given  as  Esp  =  E,  -  1\ff  where  £,  is  the  ionization  potential  of  the  unperturbed 
atom  (it  is  the  amount  of  energy  required  to  remove  an  electron  out  of  atom  —  we  choose 
Ei  =  0)  [8].  For  energy  less  than  ESp,  the  excited  electron  remains  essentially  bound 
and  the  Stark  states  are  quasi-stable  with  respect  to  the  ionization,  while  for  energy  higher 
than  Esp,  the  atom  ionizes.  At  this  stage,  the  quasi-stable  energy  levels  of  the  atomic 
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Figure  2.  The  regular  arrangement  of  resonances  (complex  eigenvalues)  of  a 
hydrogen  atom  in  an  electric  field  associated  with  the  Hamiltonian  7/ef  for 
F  =  9950V/cm  from  just  below  the  classical  ionization  limit  Esp  up  to  positive 
energies. 


states  are  superimposed  on  ionization  continua.  Now  the  atomic  states  are  associated  with 
complex  energy  eigenvalues  whose  real  part  Er  gives  the  energy  and  imaginary  part  £im 
gives  the  half-  width  of  the  decaying  states  [8,9]. 

In  figure  2  we  have  shown  the  resonances  (the  real  and  imaginary  parts  of  the  complex 
eigenvalues)  of  the  hydrogen  atom  in  an  electric  field  of  strength  F  =  9950  V/cm 
from  just  below  the  classical  ionization  limit  £sp  up  to  positive  energies.  Far  from 
being  randomly  scattered  around,  they  appear  in  lines  spreading  out  in  the  lower  half 
of  the  complex  energy  plane.  Each  such  string  is  made  up  of  resonances  of  one  n- 
manifold  [9]. 

Thus  what  we  notice  is  that  in  the  case  of  hydrogen  atom  in  an  uniform  static  electric 
field,  the  bounded  classical  trajectories  are  regular  while  the  quantum  resonances  (above 
the  classical  saddle  point)  also  follow  a  definite  regular  pattern  for  any  arbitrary  electric 
field  strength.  This  might  be  due  to  the  integrable  nature  of  the  problem. 


2.2  Hydrogen  atom  in  a  magnetic  field  (Zeeman  effect) 

Unlike  the  electric  fields,  magnetic  fields  do  not  ionize  atoms.  They  tend  to  compress  the 
charge  to  the  nucleus  and  increase  the  binding.  Magnetic  fields  have  played  useful  and 
important  role  in  the  history  of  atomic  physics.  In  fact  the  Zeeman  effect  provided  the 
first  direct  evidence  of  the  electromagnetic  origin  of  light  and  the  Stern-Gerlach 
experiment  proved  the  reality  of  spatial  quantization.  The  Hamiltonian  of  the  hydrogen 
atom  in  an  uniform  static  magnetic  field  can  be  written  as  [3,  .10] 


p 


B 


(2.2) 


where  Lz  is  the  z-component  of  the  angular  momentum  and  B  is  a  measure  of  the  strength 
of  the  magnetic  field  usually  measured  in  units  of  Tesla.  For  weak  magnetic  fields,  the 
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last  term  which  is  quadratic  in  B  (usually  called  the  diamagnetic  term),  is  negligible.  By 
treating  the  third  term  (the  paramagnetic  term)  as  a  perturbation,  we  obtain  the  familiar 
result  (available  in  many  text  books  on  quantum  mechanics)  [11] 

E(n,/,m)  =  -^2+|fl.  (2.3) 

Here  n,  /  and  m  are  respectively  the  principal,  orbital  and  magnetic  quantum  numbers. 
In  this  perturbative  regime,  all  the  three  quantum  numbers  n,/  and  m  are  good,  and 
therefore  the  linear  Zeeman  term,  for  a  fixed  magnetic  quantum  number  'm',  just  adds  a 
static  quantity,  a  constant  shift.  As  Lz  (which  is  quantized  as  Lz  =  mK)  is  a  constant  of 
motion,  the  classical  motion  remains  regular  in  this  regime. 

2.2.1  Quadratic  Zeeman  effect:  If  we  include  the  quadratic  term  in  magnetic  field  as 
well,  many  interesting  features  come  into  play  [12, 13].  In  fact  one  has  to  include  these 
quadratic  terms  when  the  strength  of  the  magnetic  field  is  high.  The  leading  terms  in  the 
average  value  of  (x2  +  v2}  are  of  the  order  of  B2n4,  while  for  the  Coulombic  potential  the 
gap  between  adjacent  n-manifolds  is  of  the  order  of  n~3.  Hence  we  see  that  the 
diamagnetic  term  has  greatest  effect  at  high  n  values.  In  fact  there  are  3  regimes  for  this 
system,  1)  low,  2)  high  and  3)  intermediate  fields  [14].  We  discuss  the  salient  features  of 
each  of  these  in  the  following  subsections. 

Low  field  regime.  In  this  regime  the  diamagnetic  term  is  treated  as  a  perturbation 
to  the  Coulombic  potential.  Here  /-mixing.  (/  is  not  a  good  quantum  number:  at 
moderate  field  strengths  different  /-states  within  a  given  n-manifold  mix)  occurs  but 
there  is  no  n-mixing.  As  a  result,  the  classical  motion  remains  predominantly 
regular  in  this  regime,  and  for  small  but  finite  field  strengths,  there  exist  an 
adiabatic  invariant  (discussed  later  in  detail  for  the  Stark-quadratic  Zeeman  problem) 
which  is  helpful  in  identifying  the  so  called  rotator  and  vibrator  states  [15].  Actually 
the  ratio  of  the  diamagnetic  and  Coulombic  terms,  namely  B2n7  mainly  determines 
the  behaviour  of  the  system.  It  has  been  shown  qualitatively  that  the  onset  of  n- 
mixing  occurs  when  B2n7  ~  y,  so  for  n  =  6  manifold  one  needs  to  have  a  magnetic 
field  of  strength  about  794  Tesla  while  for  n  =  80  manifold  only  0.09  Telsa  is  sufficient 
to  induce  n-mixing.  Therefore  it  is  clear  that  for  highly  excited  states  (the  Rydberg 
states),  a  small  magnetic  field  is  sufficient  to  induce  n-mixing;  energy  levels  belonging 
to  different  n-manifolds  may  interpenetrate  [12],  and  the  perturbative  treatment  is  not 
valid. 

High  field  regime.  For  very  high  magnetic  fields,  the  diamagnetic  potential  dominates 
and  the  Coulomb  potential  may  be  neglected.  This  is  the  Landau  regime  [11],  and  the 
Hamiltonian  simplifies  to 

Hhmf=^  +  ~(^+y2).  (2.4) 

If  we  neglect  the  free  motion  in  the  z  direction,  we  get  a  two  dimensional  isotropic 
harmonic  oscillator  with  oscillator  energy  nu  =  B,u  =  ^c  (uc  is  the  cyclotron 


frequency).  The  quantum  energy  levels  are  given  by 

ELM  =  (N  +  $B,  (2.5) 

where  N  =  0,1,2,...  labels  the  Landau  levels.  If  we  include  the  Coulombic  terms 
perturbatively,  then  one  can  show  that  [16]  the  quantum  energy  levels  are  expressed  as 

ENm=Em+NB,     £m  =  (|m[  +  l)|,  (2-6) 

where  'm'  is  the  magnetic  quantum  number.  The  classical  motion  is  regular  in  this 
regime. 

Quasi-Landau  resonances.  In  1969,  Garton  and  Tomkins  [17]  discovered  a  dramatic 
modulation  in  the  absorption  spectrum  of  barium  in  a  field  of  2.5  Tesla  (hereafter 
abbreviated  as  T).  The  modulation  pattern  now  called  as  quasi-Landau  resonances, 
extends  above  the  zero-field  ionization  limit  of  the  atom  and  has  a  period  close  to  1.5u;c 
(wc  is  the  cyclotron  frequency).  At  higher  energies,  the  period  approaches  wc,  as  might  be 
expected  for  a  free  electron. 

To  understand  these  periods,  we  neglect  the  z-motion  and  treat  the  problem  as  2 
dimensional.  Using  the  primitive  WKB  expression 

r  v/2E-2V(p)d/9  =  (n  +  i)7r,  (2.7) 

J  PI 

where  p  =  \A2  +  y2,  p\  and  pi  are  the  classical  turning  points  and  V  (p)  is  the  effective  2 
dimensional  potential  given  by 

VM  =  -i  +  §  +  fiz  +  yP2,  (2.8) 

p      2fr       2  o 

the  separation  between  the  levels  can  be  found  as  [18] 


/  _ 
V2E-2V(p)dp      .  (2.9) 

Integrating  this  equation  between  the  turning  points  at  E  =  0,  we  get  d£/dn  =  1.5wc. 
Numerical  evaluation  for  higher  energies  shows  that  the  spacing  slowly  diminishes. 
Several  researchers  have  studied  these  quasi-Landau  resonances  experimentally  and 
confirmed  these  findings  [19]. 

Intermediate  field  regime.  In  this  regime,  the  magnetic  field  strength  becomes 
comparable  to  the  Coulombic  field  strength.  The  problem  loses  both  its  spherical 
symmetry  (due  to  the  Coulombic  term)  and  cylindrical  symmetry  (due  to  the  quadratic 
Zeeman  term).  The  problem  is  not  only  non-separable  and  non-integrable,  but  reaches  a 
non-perturbative  regime.  The  problem  cannot  be  solved  by  any  analytical  method.  One 
needs  to  resort  to  numerical  methods. 

In  this  regime,  adjacent  n-manifolds  interpenetrate  [20],  The  levels  repel  strongly  and 
the  regularities  of  the  energy  spectrum  are  lost.  In  figure  3  we  show  a  chaotic  classical 
trajectory  (in  cylindrical  coordinates)  at  a  magnetic  field  strength  of  B  =  1  T  and  for  an 
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Figure  3.  The  (chaotic)  trajectory  in  the  p-z  plane  of  a  hydrogen  atom  in  a  magnetic 
fie  d  associated  wjth  the  Hamiltonian  tfmf  for  5  =  6T  and  n  =  40  manifold  of  the 
ircc  C3.sc. 


-35.0 


6.4          6.6 

B  (in  Tesla) 

Figure  4.    The  energy  levels  belonging  to  the  410th  to  438th  eigenstates  of  a 
B =  I™?™  m  "  magnetlC  fldd  aSSOdated  Wi*  ^  Hamiltonian  *U  for  the  range 


energy  value  that  corresponds  to  the  n  =  40  manifold  of  the  field  free  case.  Quantal 
spectra  for  magnetic  field  strengths  between  6  T  to  7  T  are  shown  in  figure  4  We  have 
included 129  energy  levels  between  the  energy  range  of  E  =  -45.0 cm- >  and  -35  Ocrn^ 
By  careful  visual  inspection,  one  can  notice  many  avoid  crossings  arising  due  to  the 
mixing  of  adjacent  states.  These  avoided  crossings  are  considered  to  be  the  fingerprints  of 
classical  chaotic  motion  [5].  In  fact  from  extensive  numerical  investigation  it  has  been 
found  that  as  the  strength  of  the  magnetic  field  is  increased  from  low  to  high  vies  Ae 
system  gradually  makes  a  transition  from  ordered  to  chaotic  motion"  [21]  In  fact 

avaiiabie 


2.3  Hydrogen  atom  in  a  parallel  electric  and  magnetic  field  (Stark-quadratic  Zeeman 
effect) 

After  investigating  the  quadratic  Zeeman  effect  (QZE)  in  more  details,  it  has  now  become 
desirable  to  find  other  paradigmatic  and  experimentally  realizable  systems  which  have  all 
the  desirable  properties  of  QZE,  but  contain  additional  simple  perturbations.  One  such  a 
system  can  be  produced  by  adding  an  electric  field.  The  resulting  Stark-quadratic  Zeeman 
effect  (SQZE)  is  attracting  the  interest  of  theorists  [25]  and  experimentalists  [26,27], 
Among  the  various  possible  relative  orientations  of  the  fields,  parallel  field  alignment 
alone  ensures  the  survival  of  the  magnetic  quantum  number  (m)  as  a  good  quantum 
number  throughout  the  chaotic  regime  and  hence  reduces  the  phase  space  dimension  of 
the  problem  to  a  manageable  size. 

The  Hamiltonian  of  a  hydrogen  atom  in  a  parallel  electric  and  magnetic  field 
configuration  is 


where  F  is  the  strength  of  the  electric  field  which  is  applied  along  the  z-axis  parallel  to 
the  magnetic  field.  For  the  parallel  field  problem  an  approximate  constant  of  motion  Ap 
can  be  found  for  the  weak  field  limit  (in  the  perturbative  regime)  [26,  27] 

Ap  =  4A2  -  5A\  +  10/5AZ,  (2.1  1) 

where  A  =  L  x  p  +  r/r  is  the  Runge-Lenz  vector  and  (3  =  l2F/5B2n4  measures  the 
relative  strength  of  the  electric  and  magnetic  fields.  In  fact  for  F  =  0  (QZE),  we  get  the 
approximate  constant  of  motion  for  the  magnetic  field  problem.  The  total  energy  within  a 
given  n-manifold  (neglecting  the  paramagnetic  term)  is  given  as 


By  quantizing  Ap  semiclassically,  3  different  types  of  states  can  be  identified.  Type  I 
and  n  states  arise  from  degenerate  odd  and  even  librational  diamagnetic  states  (they  lie  at 
Ap  =  25ft2  for  0  <  ft  <  ^).  Type  ffl  states  arise  from  the  evenly  separated  odd  and  even 
rotational  states  of  diamagnetic  manifold  (they  lie  at  Ap  =  10/3  -  1  for  5  <  f3  <  1).  These 
theoretical  predictions  have  been  checked  experimentally  for  hydrogen  and  lithium  atoms 
[26,27]. 

In  order  to  understand  these  regimes,  in  figure  5  we  have  plotted  all  the  (numerically 
computed)  energy  levels  of  n  =  30  manifold  of  the  hydrogen  atom  for  F  ranging  from  0 
to  40  V/cm,  at  magnetic  field  2.33  T.  Our  calculations  were  performed  at  unit  intervals  of 
F  and  hence  finer  details,  such-  as  avoided  crossings,  may  be  incorrect.  All  the  above 
mentioned  three  regions  are  clearly  marked  in  the  figure. 

As  far  as  non-perturbative  calculations  are  concerned,  Richter  et  al  [28]  carried  out 
numerical  quanta!  calculations  of  cases  where  (i)  the  magnetic  field  was  dominant,  (ii) 
the  electric  field  was  dominant  and  (iii)  both  electric  and  magnetic  fields  were  of 
comparable  magnitude.  Surprisingly  different  n-manifolds  overlapped  without  significant 
mutual  interaction.  The  avoided  crossings  of  states  belonging  to  adjacent  n-manifolds 
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Figure  5.  The  three  distinctive  regions  of  energy  levels  of  n  =  30  manifold  of  a 
hydrogen  atom  in  a  parallel  electric  and  magnetic  field  associated  with  the 
Hamiltonian  Hp  for  B  =  2.33  T  and  E  over  the  range  [0-40]  V/cm. 
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Figure  6.  The  patternless  arrangement  of  resonances  (complex  eigenvalues)  of  a 
hydrogen  atom  in  a  parallel  electric  and  magnetic  field  associated  with  the 
Hamiltonian  Hp  for  E  =  5 14200  V/cm  and  B  over  the  range  [0,  1504]  T.  Open 
ckcles  belong  to  B  =  0  case  while  +  signs  belong  to  B  ^Q  cases. 


were  very  small  and  were  beyond  graphical  resolution.  As  a  consequence  of  this  most  of 
the  classical  phase  space  regimes  remained  regular  (of  course,  all  the  calculations  have 
been  reported  around  n  =  24  manifold).  We  believe  that  even  though  the  hydrogen  atom 
in  parallel  fields  is  a  non-separable  problem,  identifying  chaotic  regimes  seems  to  be  a 
difficult  task  [28];  recently  the  bifurcations  of  periodic  orbits  of  this  system  have  been 
reported  [29, 30]  which  may  lead  to  the  understanding  of  manifestation  of  chaos  in  this 
system. 


When  the  energy  and  field  strengths  are  increased  to  far  beyond  the  Stark  saddle  point, 
one  can  observe  irregularity,  particularly  in  the  arrangement  of  resonances  in  the  complex 
energy  plane  [8,9].  In  figure  6  (ref.  [8,9])  the  resonances  for  a  fixed  electric  field 
F  =  514200  V/cm  and  magnetic  fields  B  varying  from  0  T  to  1504  T  are  shown. 
Resonances  for  B  =  0  case  are  indicated  as  open  circles,  and  for  B  ^  0  as  plus  signs.  The 
overall  picture  looks  quite  complicated  as  the  resonances  are  arranged  in  a  patternless 
fashion,  contrary  to  the  electric  field  case  (figure  2). 

2.4  Hydrogen  atom  in  crossed  electric  and  magnetic  fields 

Among  the  hydrogen  atom  in  static  (time  independent)  external  field  problems,  the  case 
of  crossed  electric  and  magnetic  fields  is  considered  to  be  the  most  difficult  (at  least  for 
theoretical  investigations)  [31].  Here  the  magnetic  field  is  usually  applied  along  the  z  axis 
while  the  static  electric  field  is  applied  along  the  x  axis.  The  Hamiltonian  can  be  cast  in 
the  form, 

Hc=P~-l-  +  ^Lz  +  ^(x^+y^-Fx.  (2.13) 

The  crucial  point  to  note  down  here  is  that  the  z  component  of  angular  momentum, 
namely  Lz  is  not  a  conserved  quantity,  so  the  problem  cannot  be  reduced  to  2  dimensions 
in  a  straightforward  way  as  in  the  case  of  parallel  electric  and  magnetic  fields.  Moreover, 
like  the  Stark  effect  case,  the  dynamics  is  scattering  above  the  (ionization)  threshold 
value,  and  quasi-bound  below  it. 

In  recent  experimental  investigations,  a  class  of  quasi-Landau  resonances  in  the  spectra 
of  rubidium  Rydberg  atoms  in  crossed  electric  and  magnetic  fields  have  been  observed 
[32].  These  set  of  resonances  are  associated  with  a  small  number  of  planar  periodic 
orbits,  so  it  is  believed  that  even  the  planar  limit,  i.e,  z  —  pz  =  0  of  the  above  problem 
may  reveal  interesting  dynamical  behaviour  [33]. 

Using  simple  critical  point  analysis,  it  is  very  easy  to  show  that  there  is  a  critical  point, 
independent  of  the  magnitude  of  the  magnetic  field  B,  at  E  =  — 2\/F  (for  the  planar  case) 
which  coincides  with  the  approximate  Stark  saddle  point  discussed  in  §2.  In  our 
investigations  we  stick  ourselves  to  the  planar  system. 

For  convenience  we  scale  the  position  and  momentum  variables,  r  — »•  fi~2/3r  and 
p  -s-  5!/3p.  The  advantages  of  using  such  scaling  relations  are  described  later  in  detail  in 
§  4.  In  terms  of  scaled  variables  the  above  Hamiltonian  becomes  [34] 

HI  =  EB~^  =  t-  - 1  +  ilz  + 1  tf  +  y2}  -  FB'^x.  (2.14) 

L        r       L  o 

In  figure  7  we  have  sliced  the  classical  phase  space  that  corresponds  to  Es  =  -1.0 
using  y  =  0  and  py  —  arbitrary  plane.  Whenever  the  classical  trajectory  pierce  through 
this  plane  we  record  the  x  and  px  values  and  obtain  the  so  called  Poincare  surface  of 
section  (PSS)  for  a  set  of  initial  conditions.  To  start  with,  we  choose  the  value  of/  well 
below  the  critical  point,  /  =  FB~4/3  =  0.2.  Each  set  of  initial  condition  leads  to  an 
invariant  curve  on  the  surface  of  section  which  indicates  that  the  motion  is  quasi  periodic 
in  the  actual  phase  space  and  hence  the  resulting  dynamics  is  largely  regular. 
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Figure  7.  The  Poincare  surface  of  section  for  various  initial  conditions  of  a 
hydrogen  atom  in  a  crossed  electric  and  magnetic  field  associated  with  the 
Hamiltonian  Hsc  for  the  scaled  energy  Es  =  -1.0  and  scaled  electric  field/  =  0.2. 
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Figure  8.  The  Poincare  surface  of  section  for  various  initial  conditions  of  a 
hydrogen  atom  in  a  crossed  electric  and  magnetic  field  associated  with  the 
Hamiltonian  Hsc  as  in  figure  7  but  for/  =  0.2499. 

However  if  we  increase  the  value  of/,  when/  approaches  the  critical  value,  we  observe 
chaotic  motion.  In  figure  8  we  show  a  representative  PSS  plot  for  /  =  0.2499  and 
Es  =  -1.0.  Some  initial  conditions  lead  to  invariant  curves  while  many  others  lead  to 
chaotic  motion:  a  sizable  fraction  of  the  phase  space  is  occupied  by  chaotic  trajectories. 

If  we  increase/  further  and  cross  the  critical  value,  the  system  enters  the  scattering 
regime  [33],  giving  a  'hole',  appearing  in  certain  regions  of  the  PSS  plot.  As/  increases, 
the  hole  region  expands  in  size.  The  origin  of  this  hole  is  attributed  to  the  velocity 
dependent,  Corriolis  like  force  in  the  Newton's  equations  of  motion  (transparent  when 
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written  down  in  semi-parabolic  coordinates),  arising  due  to  the  non  conservation  of  Lz  in 
the  planar  system  [33]. 

.  Thus,  unlike  the  parallel  field  problem,  as  the  electric  field  strength  is  increased  from 
far  below  critical  point  to  just  below  the  critical  point,  the  system  undergoes  a  clear 
transition  from  regular  to  chaotic  motion,  even  in  the  2  dimensional  planar  case. 

As  far  as  the  3  dimensional  case  is  concerned,  if  we  increase  the  electric  field  far  above 
the  critical  point,  resonances  (complex  eigenvalues)  start  to  appear.  If  we  increase  the 
magnetic  field  also  sufficiently,  a  large  number  of  resonances  begin  to  appear.  In  fact,  the 
nearest  neighbour  spacing  distribution  of  the  complex  eigenvalues  which  belong  to  the 
chaotic  scattering  region  follow  the  so-called  Ginibre  distribution  given  as  [35] 


At  the  same  time  the  classical  trajectory  calculations  reveal  fractal  structures  in  classical 
chaotic  ionization  in  these  regimes  [35]. 

3.  Hydrogen  atom  in  other  time  independent  external  fields 

So  far  we  have  discussed  the  chaotic  dynamics  of  hydrogen  atom  in  static  electric  or/and 
magnetic  fields.  Now  we  will  consider  some  other  interesting  time  independent  external 
fields  which  can  be  realized  physically. 

3.1  Hydrogen  atom  near  a  metal  surface 

The  force  between  an  atom  and  a  nearby  metal  surface  is  given  by  the  so-called 
(instantaneous)  van  der  Waals  force.  The  Hamiltonian  of  a  hydrogen  atom  near  a  metal 
surface  can  be  derived  using  the  method  of  images  as  [36] 

„  _P2    i     i  .  i 

~-- 


Here  d  is  the  atom-surface  distance.  When  the  atom  is  at  a  larger  distance  in  comparison 
to  its  size  (d2  >  r2),  the  above  Hamiltonian  simplifies  to  the  so  called  instantaneous  van 
der  Waals  Hamiltonian  H-wp  given  as 

(3'2) 


The  above  Hamiltonian  //ivp  is  particularly  useful  near  the  ground  sfate  when  the  size  of 
the  atom  is  quite  small,  whereas  the  Hamiltonian  Hm  is  quite  useful  when  dealing  with 
Rydberg  states  for  which  the  size  of  the  atom  is  relatively  big.1a$  figure  9  we  have  shown 
the  quantum  spectrum  of  a  hydrogen  atom  over  an  atom-surface  distance  range  of  100  nm 
to  300  nm.  The  calculations  were  performed  for  n  =  20  Rydberg  states  using  the 
Hamiltonian  Hm.  A  number  of  avoided  crossings  indicate  that  there  can  be  chaos  at  the 
classical  level  in  this  regime.  On  the  other  hand,  if  we  use  Hamiltonian  #ivp  and  also 
consider  low-lying  states,  no  such  avoided  crossings  appear  [37].  Moreover  in  both  the 


Chaos  in  the  hydrogen  atom 
0.0 

-10.0 
-KT     -20.0 

!jr    -3o.o 

I     -40.0 


u. 


-50.0 


-60.0 


-70.0. 

100.0          130.0 


«nge  of  KXK300  nm. 


160.0          190.0          226:0"  ^0.0™'  280.0  " 
d  (in  nm) 

-free  position  for  an  atom-surface  separation 


tates  and  at  larger 
mailer  atom-surlce 
are  yet  to  be 


hence  d°  not  i 

S  "  regular  near  *e  «rol"«i 
states  »d 
Classical 


•2  Hydrogen  au,m  in  a  generalized  van  der  Vfaals  potential 


amiltonian  given  as 


van  der  Waak  (GVP) 


*man,  (iii)  instantaneous  van 
-e  guide  problems. 


(3-3) 
quadratic 


Table  1.    Classical  and  quantal  results  of  hydrogen  atom  in  a  generalized 
van  der  Waals  potential  for  various  (3  values. 


Classical  phase  space 


Quantal  NNS  distribution 


0.25  Predominantly  chaotic 

0.5  Completely  regular 

A/O4  Both  regular  and  chaotic 

1.0  Fully  regular 

1.5  Both  regular  and  chaotic 

2.0  Completely  regular 

3.0  Predominantly  chaotic 
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Figure  10.    The  NNS  distribution  of  a  hydrogen  atom  in  a  generalized  van  der  Waals 
potential  associated  with  the  Hamiltonian  H^p  for  ft  =  \. 


of  a  two  coupled  sixth-power  anharmonic  oscillators  problem.  Then  by  using  the  Painleve 
singularity  structure  analysis  and  dynamical  symmetry  approach  it  is  possible  to  show 
that  the  Hamiltonian  Hgvp  is  integrable  for  (3  =  \,  1  and  2  cases  only.  Then  all  the 
recently  developed  (numerical)  techniques  to  deal  with  chaotic,  nonintegrable 
Hamiltonian  systems  can  be  used  to  show  that  there  is  a  choas-order-chaos-order- 
chaos-order-chaos  type  of  transition.  In  fact  it  is  possible  to  perform  Einstem-Brillouin- 
Keller  (EBK)  type  of  (semiclassical)  quantization  and  obtain  exact  analytical  expression 
for  the  energy  level  shifts  only  for  the  integrable  cases  namely  (3  =  $,  1  and  2.  Other  near 
integrable  cases  can  be  quantized  using  numerical  methods.  All  our  semiclassical 
predictions  were  on  par  with  our  classical  results  [40].  Some  of  the  classical  and  quantal 
results  have  been  summarized  in  table  1. 

As  already  stated,  the  nearest-neighbour  spacing  distribution  (NNS)  (the  probability  P 
for  finding  a  separation  s  of  neighbouring  energy  levels  in  the  spectrum)  of  a  quantal 
system  follow  the  Poisson  distribution  i.e,  P(s)  =  exp(-,s)  when  its  classical  counterpart 
exhibits  regular  behaviour.  Quantal  systems  follow  the  so-called  Wigner  distribution  (or 
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Figure  11.    The  NNS  distribution  of  a  hydrogen  atom  in  a  generalized  van  der  Waals 
potential  associated  with  the  Hamiltonian  //gvp  for  (3  =  3. 


Gaussian  Orthogonal  Ensemble  distribution  in  our  case  of  time  independent  system) 
when  the  corresponding  classical  system  exhibits  chaotic  behaviour.  The  quantum 
problem  associated  with  the  above  Hamiltonian  Hgvp  can  be  analysed  by  solving  the 
associated  Schrodinger  equation  numerically  in  terms  of  the  SO (4,2)  group  generators 
using  a  scaled  set  of  normalized  Coulombic  wavefunctions.  We  get  a  large  number  of 
converged  eigenvalues  and  they  can  be  used  for  the  level  statistics  analysis.  As  the 
parameter  (3  is  varied,  we  obtained  a  GOE-Poisson-Brody-Poisson-Brody-Poisson- 
GOE  type  of  transition  corresponding  to  classical  dynamics.  In  figures  (10)  and  (11)  we 
show  the  NNS  distribution  of  the  ger'^alized  van  der  Waals  Hamiltonian  for  (3  =  ^  and  3 
which  correspond  to  classically  regular  and  chaotic  situations. 

The  detailed  study  of  generalized  van  der  Waals  problem  implies  that  the  system 
makes  a  chaos-order-chaos-order-chaos-order-chaos  type  of  transition  when  the  /3 
parameter  is  varied  over  the  range  [0, 3].  In  fact  similar  conclusions  have  been  reported  in 
the  case  of  Paul  trap  Hamiltonian  problem  [39].  Some  of  the  ion  trap  experiments  have 
verified  these  results  [41]. 


3.3  Anisotropic  hydrogen  atom  problem 

So  far  we  have  discussed  systems  in  which  the  nonlinearity  was  brought  in  by  the 
interplay  between  the  Coulomb  field  and  the  external  electric  or/ and  magnetic  field(s). 
Now  we  consider  the  motion  of  a  charged  particle  with  an  anisotropic  mass  tensor  [42- 
44],  Let  the  mass  of  the  electron  along  x  and  y  directions  be  mp  while  that  along  the  z 
direction  be  m\\.  Then  the  mass  ratio  7  =  mp/m\\  is  used  to  define  the  so  called  mass 
anisotropy  as  (1  —  7).  In  reality  different  semiconducting  materials  have  different 
effective  mass  (7)  values.  For  example  using  cyclotron  resonance  work  it  has  been  found 
that  the  7  value  for  silicon  and  germanium  are  respectively  0.2079  and  0.05134  [45]. 
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Figure  12.    The  Poincare  surface  of  section  for  a  single  initial  condition  of  an 
anisotropic  hydrogen  atom  associated  with  the  Hamiltonian  #a  for  7  =  0.85. 
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Figure  13.  The  energy  levels  belonging  to  n  =  20  and  21  manifolds  of  the 
anisotropic  hydrogen  atom  associated  with  the  Hamiltonian  Ha  for  7  over  the  range 
[0.8, 1.0]. 


For  simplicity  we  consider  the  hydrogen  atom  and  introduce  the  mass  anisotropy.  The 
Hamiltonian  can  be  cast  in  the  form 

Px   +/>v  7      n  1  /o     A\ 

ij  —  H£___I  _i_  JL  n2 (3.4) 

"a  —          ~  i^  -.  Pi  •  V         / 


Here  7  =  1  corresponds  to  the  isotropic  situation.  We  gradually  decrease  7  from  unity 
and  study  the  dynamics:  for  7  =  1  (isotropic  case),  the  system  is  integrable  while  for 
7  <  1  the  system  abruptly  becomes  chaotic.  As  the  accessible  phase  space  in  the 
Cartesian  coordinate  space  is  quite  awkward  for  visual  inspection,  we  have  used  the  semi 
parabolic  coordinates  which  are  defined  as  p  —  A/Jt2  4-  y2  =  wv,  z  =  (u2  —  v2)/2.  In  these 


calculations  for  the  classical  trajectory  can  be  carried  out  in  a  nice  fashion.  The  Poincare 
surface  of  section  plot  (for  u  =  0  and  pu  >  0  plane)  in  the  v  versus  pv  plane  for  7  =  0.85 
and  an  energy  value  that  corresponds  to  n  =  20  manifold  of  the  isotropic  case  is  shown  in 
figure  12.  Even  a  single  trajectory  is  able  to  traverse  almost  the  entire  accessible  phase 
space  and  hence  the  v  —  pv  plane  is  filled  with  randomly  distributed  points,  implying  that 
the  system  becomes  completely  chaotic. 

In  figure  13  we  plot  the  quantal  eigenvalues  for  n  =  20  and  n  =  21  manifolds,  for  7  in 
the  range  [0.8, 1.0].  In  the  isotropic  case  namely  7  =  1  the  adjacent  manifolds  are  well 
separated.  As  the  anisotropy  is  gradually  introduced,  the  adjacent  n-manifolds 
interpenetrate,  which  might  be  due  to  the  existence  of  chaotic  motion  at  the 
corresponding  classical  system.  Unlike  the  other  systems  discussed  so  far,  here  the 
transition  from  regular  to  chaotic  motion  is  considered  to  be  abrupt  [43]  and  is  often 
referred  to  as  hard  chaos  [24]. 


4.  Hydrogen  atoms  in  strong  oscillating  electromagnetic  fields 

4.1  Early  experiments  and  theory 

The  hydrogen  atom  interacting  with  a  strong,  periodic  monochromatic  electromagnetic 
field  is  an  example  of  another  nonlinear  system,  which  in  a  certain  range  of  parameters 
may  display  signatures  of  classically  chaotic  behaviour. 

Classical  and  quantal  analytical  solutions  are  very  well  known  separately  for  both, 
namely  the  field-free  hydrogen  atom  and  the  free  electron  in  the  electromagnetic  field 
(see  e.g.  [11]).  Since  the  motion  of  the  electron  is  governed  by  the  internal  Coulombic 
and  external  laser  field,  it  is  expected  that  chaotic  dynamics  may  be  observed  when  both 
internal  and  external  fields  have  comparable  effect  on  the  electron.  Therefore  in  order  to 
create  'strong  fields'  and  introduce  conditions  beyond  the  perturbative  regime  for  the 
ground  state  or  low  lying  states  we  will  have  to  apply  ultrashort  intense  pulses  of  laser 
light,  whereas  for  high  Rydberg  levels  the  required  electric  fields  may  be  quite  small  in 
terms  of  the  absolute  value.  Nevertheless,  the  experiments  with  Rydberg  atoms  are  very 
difficult  due  to  the  presence  of  'stray'  electric  fields  which  may  easily  ionize  these  atoms. 
Stray  (unknown)  electric  fields  originate  from  electric  charges  present  in  the  experimental 
apparatus.  At  the  same  time,  the  experiments  with  ultra-short  pulses  are  by  no  means 
easier — it  is  extremely  difficult  to  obtain  a  good  repetition  rate  of  parameters  of  the  laser 
pulse  and  it  is  also  difficult  to  measure  their  peak  intensity  accurately  [46]. 

Rydberg  hydrogen  atoms  interacting  with  microwave  field  have  been  of  primary 
interest  for  over  20  years  [47].  Microwaves  have  a  dramatic  effect  on  excited  hydrogen 
because  the  splitting  between  levels  n  and  (n  •+- 1)  is  approximately  SE(n,n-}-l}  « 
6.6  x  106  n~3  GHz,  which  is  in  the  microwave  region  for  n  ~  50-100.  Thus  micro- 
waves cause  resonant  transitions  between  adjacent  levels  over  a  wide  range  of  Rydberg 
states.  In  a  pioneering  microwave  experiment  [47],  hydrogen  atoms  were  initially 
prepared  in  a  Rydberg  state  with  principal  quantum  number  no  «  66  and  then  were 
introduced  into  the  interaction  region  (microwave  cavity)  with  the  microwave  field  of 
frequency  i/  =  9.9  GHz.  Although  for  this  case  about  N/  =  80  photons  were  to  be 
absorbed  in  order  to  reach  the  ionization  limit  from  the  initial  state  in  the  perturbative 
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aiKaii  atoms 

Typically  in  the  ionization  experiments  with  Rydberg  atoms  one  measures  the 
electric  field  ionization  threshold,  F10%,  which  is  an  amplitude  of  the  electric  field 
for  which  10%  ionization  probability  occurs.  It  has  to  be  stressed  that  both  true 
ionization  and  the  excitation  to  above  a  certain  n-cutoff,  nc  (nc  >  no  is  determined 
experimentally)  contribute  to  the  experimental  value  of  the  ionization  field  threshold 
for  the  atom  prepared  initially  in  the  state  MO-  This  is  due  to  the  presence  of  the 
stray  electric  fields,  which  would  necessarily  cause  the  ionization.  Therefore 
experimentally  one  is  not  able  to  tell  the  difference  between  the  true  ionization  and 
the  excitation  higher  than  nc  [24].  In  order  to  explain  experimental  results  classical 
Monte-Carlo  type  theories  were  applied  to  the  problem  [51].  The  set  of  classical 
initial  conditions  were  chosen  so  as  to  mimic  the  quantum  wavefunction  probability 
distribution  and  the  ionization  probability  was  determined  as  a  ratio  of  the  number  of 
ionizing  initial  conditions  to  the  number  of  all  chosen  initial  conditions.  These 
simulations  have  shown  that  the  ionization  threshold  is  related  to  the  onset  of  chaotic 
motion  of  trajectories,  which  in  the  presence  of  critical  threshold  field  become  unbounded 
in  the  phase-space. 


4.2  Classical  model 

Let  us  write  the  Hamiltonian  of  the  hydrogen  atom  in  a  laser  field  of  an  arbitrary 
polarization  (in  the  dipole  gauge  approximation) 

(4.1) 


where  r  =  -\/x2  +  y2  +  z2  and  F,  w,  <p  are  respectively  the  electric  field  amplitude,  field 
frequency  and  initial  phase  of  the  field.  We  do  not  include  influence  of  the  magnetic  field 
because  the  associated  Lorentz  force  is  much  smaller  than  the  Coulombic  force  for  non- 
relativistic  field  intensities  (less  than  1018W/cm2 — [52])  as  the  Rydberg  electron 
velocity  v  <C  c.  Through  the  function /(f),  the  pulse  envelope,  the  turn-on  and  off  of  the 
pulse  of  length  T  over  finite  time,  r  may  be  included, 

Isin2(7rt/2r),  0  <  t  <  r 

1,  r<t<T~-r                              (42] 

cos2[7r(f  -  (T  -  r))/2r],  T  -  r  <  t  <  T                               v  '  ' 

0,  elsewhere. 

The  angle  x  £  [0»  n/4]  parametrizes  the  degree  of  ellipticity  of  the  field  polarization: 
X  =  0  for  linear  polarization  (LP)  and  %  =  7r/4  for  circular  polarization  (CP)  in 
particular.  Notice  that  in  the  case  of  u  — >  0  we  have  the  well  known  Stark  effect  described 
earlier  in  §  2. 
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r  ->    -r, 
P-A1/2?, 


F  ->  A2F,  (4.3) 

for  any  real  parameter  A.  Considering  the  hydrogen  atom  with  energy  En  =  —  l/(2n2)  in 
the  electric  field  of  amplitude  F  and  frequency  uj,  suggests  setting  A  =  n2,  and 
consequently  UJQ  =  n3w  and  FQ  =  n4F.  Thus  it  is  possible  to  study  only  the  classical 
dynamics  of  the  atom  with  initial  energy  E  =  -1/2  irrespective  of  the  initial  state  n,  in 
terms  of  the  scaled  variables:  E,  a;0  and  FQ.  Non-scaled  values  of  the  parameters  may  be 
reconstructed  from  the  scaled  variables  for  any  arbitrary  no-  The  scaled  frequency  UQ  can 
be  interpreted  as  non-scaled  frequency  expressed  in  units  of  the  Kepler  frequency 
u)K  =  (-2£n)~3y/2  of  the  electron  on  its  initial  orbit. 

However,  the  commutation  rule  is  not  preserved  under  the  scaling  transformation 
eq.  (4.3): 

scaling 

[*,/>,]  -  m  —  >  [x)A]  =  m  •  A-1/2  -  m/n.  (4.4) 

Nevertheless,  as  the  semiclassical  limit  of  n  —  »  oo  (effective  Planck  constant  H/n  —*  0) 
is  approached,  the  commutation  rule  [x,px]  —  >•  0  becomes  less  affected  by  the  scaling 
transformation.  Again,  we  see  the  reason  why  Rydberg  levels  are  of  great  interest.  The 
investigation  of  their  properties  helps  us  to  understand  the  relation  between  the  classical 
(H  =  0)  and  the  quantum  mechanics  (H  =  \  in  atomic  units)  and  the  way  of  reaching  the 
semiclassical  limit  (H  —  »  0)  in  particular.  For  the  low  frequency  limit  UJQ  <  0.2  the 
classical  scaling  is  visible  quite  clearly  in  experimental  (quantal)  results  for  the  ionization 
threshold  field  when  24  <  n  <  56  [50].  It  supports  the  validity  of  the  classical  approach 
to  the  problem,  which  in  many  respects  is  much  easier  than  the  quantum  case. 

As  in  the  case  of  static  magnetic  field,  we  transform  the  Hamiltonian  equation  (4.1) 
into  the  problem  of  two  coupled  oscillators  using  semiparabolic  coordinates  [21]  (2D)  or 
using  Kustaanheimo-Stiefel  (KS)  transformation  to  the  problem  of  four  coupled 
oscillators  with  a  common  frequency.  The  (KS)  transformation  [38,  53,  54]  exploits  the 
50(4)  symmetry  of  the  hydrogen  atom.  Then  a  new  Hamiltonian  is  used  to  write  a  set  of 
equations  of  motion,  which  form  a  set  of  coupled  differential  equations.  The 
transformation  to  the  oscillator  problems  is  advantageous,  because  the  equations  of 
motion  are  free  from  the  Coulomb  singularity  and  hence  can  be  integrated  (solved) 
numerically  to  a  very  high  accuracy.  Initial  conditions  are  chosen  according  to  the 
microcanonical  ensemble  [55].  This  ensemble  represents  a  subspace  of  constant  energy 
(corresponding  to  initially  field-free  atom)  of  the  phase-space  which  is  formed  by 
generalized  coordinates  and  canonically  conjugated  momenta.  Then  Monte-Carlo  type  of 
calculations  are  made  and  the  ionization  probability,  P,  is  obtained  as  a  ratio  of  the 
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number  of  initial  conditions  which  lead  to  the  ionization  to  the  total  number  of  initial 
conditions. 

As  it  was  mentioned  above,  classically,  the  ionization  occurs  due  to  the  break  up  of  the 
Kolmogorov-Arnold-Moser  (KAM)  tori  when  the  microwave  amplitude  is  large  enough. 
Hence  the  ionization  threshold  can  be  associated  with  the  onset  of  classical  chaos. 
Therefore,  in  the  presence  of  the  external  periodic  force  we  can  observe  escape  of  the 
electron  to  the  continuum  from  a  certain  phase-space  region  defined  by  the  initial 
conditions.  For  Hamiltonian  systems,  the  escape  may  be  significantly  slowed  down  due  to 
the  presence  of  the  remnants  of  KAM  tori  (Canton)  resulting  in  a  power-law  (algebraic) 
decay  S  ~  t~z  from  the  region  containing  KAM  stability  islands,  rather  than  exponential 
decay  S  ~  exp(~7f),  in  the  limit  of  large  time  t.  This  slow-down  may  be  greatly 
enhanced  in  the  quantum  description  because  of  the  coarse  graining  of  the  phase-space 
[56, 57]  due  to  the  finite  value  of  H.  To  get  more  insight  into  the  exponential  and  power- 
law  decay,  let  us  consider  a  mathematical  model  of  the  ionization  [58].  First,  let  us 
consider  the  canonical  Cantor  set  (the  "middle-third"  set).  We  take  interval  of  the  unit 
length.  In  the  first  step  we  reject  g=\  fraction  of  points  from  the  middle  of  the  interval, 
thus  creating  2  smaller  intervals  of  length  |  each.  In  following  steps,  repeated  ad 
infinitum,  we  just  apply  the  above  procedure  to  each  newly  created  interval  in  the 
preceding  step,  obtaining  finally  the  Cantor  set,  which  is  a  well  known  fractal  object.  The 
survival  probability  (of  not  being  deleted),  s,  of  the  point  from  the  original  unitary 
interval  after  N  +  1  steps  is  equal  to  the  Lebesgue  measure  of  the  remaining  set  of  points: 

s(N+l)  =  (l-g}s(N)  (4.5) 

and  consequently,  it  can  be  easily  verified  that  the  resulting  survival  probability  is  an 
exponential  function  of  the  step  N  (which  is  a  discretized  equivalent  of  time  t): 

s(#)=exp(-JV7),  (4.6) 

where  7  =  —  In  |1  —  g|  is  positive  (if  g  <  1).  Modifying  the  Cantor  set  construction,  and 
assuming  that  the  fraction  deleted,  gN,  is  not  constant,  but  decreases  from  one  step  to 
another  eg.:  g#  =  g/N,  we  can  write  the  survival  probability 

s(N+l}  =  (l-gN}S(N).  (4.7) 

For  large  N  one  can  check  that  a  power-law  dependence 

s(N)*N-8,  (4.8) 

fulfills  the  iterative  equation  (4.7) 

Indeed,  such  a  power-law  behaviour  of  the  survival  probability  s  has  been  observed  in 
physical  situations  approached  by  a  one-dimensional  (ID)  model  of  hydrogen  interacting 
with  the  LP  field,  included  as  a  periodic  train  of  very  short  perturbations  (the  so  called 
kicked  hydrogen  model)  [58].  This  behaviour  has  been  supported  by  a  ID  model  of 
hydrogen  with  a  monochromatic  type  of  excitation  [59].  Classical  simulations  indicate 
that  both  in  the  LP  [60]  and  the  CP  field  [61, 62]  problem,  this  sort  of  long  time  behaviour 
could  be  observed.  Quite  recently,  both  experimental  and  quantum  (numerical)  evidence 
has  been  provided  for  the  existence  of  the  power-law  (algebraic)  decay  over  several 
orders  of  magnitude  [62]. 
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In  the  quantum   approach  to  the  problem  we  have  to  solve  the  time-dependent 
Schrodinger  equation 

(4.9) 


. 

One  may  directly  integrate  this  time  dependent  equation.  It  is  however  only  possible  to  do 
so  for  pulses  (interaction  times)  no  longer  than  30-50  field  cycles  [63,  64],  because  of  the 
increasing  importance  of  interaction  of  the  wave  function  with  the  edges  (the  so  called 
absorbing  boundary  conditions)  of  the  integration  grid.  Alternatively  we  may  take  an 
advantage  of  periodic  time  dependence  in  the  Hamiltonian  equation  (4.1).  We  represent 
the  solution  of  the  Schrodinger  equation  in  the  Floquet-Fourier  expansion  as  [65] 


j 
We  may  express  each  component  \ijjj  (t))  in  terms  of  the  multiphoton  expansion 


We  substitute  this  to  the  Schrodinger  equation  and  the  resulting  Floquet  Hamiltonian  H? 
[65],  which  is  time-independent,  describes  "dressed-atom"  model.  Thus  instead  of 
discrete  states  we  have  resonances  (complex  energies).  The  time  evolution  operator  U  for 
Floquet  Hamiltonian  Hp  acting  in  the  'photon-space'  is  written  in  the  following  form  [65] 


One  can  see  that  the  time  evolution  is  dominated  by  poles  of  the  operator  (z  —  Hp)~l  near 
the  real  axis  in  the  complex  energy  plane.  These  complex  poles  (resonances)  may  be 
found  from  the  analytical  continuation  of  the  Floquet  Hamiltonian  #F(0),  which  is 
obtained  by  using  the  dilatation  transformation  [65,  66] 


re''e 


(4.13) 


The  eigenenergies  (quasi-energies)  Ej  of  the  Hp  are  found  usually  in  the  harmonic 
oscillator  basis  representation  [66]  and  if  r  <C  T  (a  flat  top  pulse  with  narrow  edges  of 
turn-on  and  off  time  —  a  rectangular  pulse  shape  approximation  (RPSA))  the  ionization 
probability  can  be  expressed  as  [62] 

r^,  (4.14) 

where  T  is  the  interaction  time  (length  of  the  pulse).  The  case  with  a  smooth  turn-on 
could  be  dealt  with  on  the  grounds  of  the  single  state  approximation  (SSA)  [67].  F/ 
[F/  =  -253(Ej)]  is  a  field-induced  width  of  theyth  Floquet  eigenstate,  which  accounts  for 
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$o  (describing  the  initial  Rydberg  state),  in  the  sum  eq.  (4.14),  then  we  may  well  apply 
the  SSA  in  which  only  this  Floquet  state  of  width  F  may  be  included  to  obtain  the  total 
ionization  probability 

?SSA  =  1  - exp(-  t  r[F0/(r)]dr).  (4.15) 

(.Jo  J 

In  the  following  subsections  we  will  consider  special  cases  of  the  hydrogen  atom 
interacting  .with  the  LP  and  CP  fields.  Then  we  will  show  some  typical  results  for  the  CP 
case  using  the  above  mentioned  classical  and  quantum  methods. 

4.4  LP  microwave  field  problem 

In  the  LP  case,  since  the  projection  of  the  angular  momentum  operator  onto  the 
polarization  axis  is  conserved,  we  may  consider  a  general  case  which  is  effectively  two- 
dimensional  (2D).  The  PSS  portrait  of  the  classical  phase-space  is  subdivided  into  regions 
from  which  a  classical  trajectory  cannot  escape  due  to  the  presence  of  invariant  KAM  tori 
[2].  If  the  perturbation  parameter  (electric  field)  exceeds  a  certain  critical  value,  FCT, 
KAM  tori  break  up,  resonances  begin  to  overlap  [68]  and  trajectories  may  enter  regions 
previously  non-accessible  to  them.  This  property  of  2D  phase-space  (and  ID  as  well)  may 
be  of  importance  if  one  is  interested  in  investigating  global  diffusion  as  a  signature  of  the 
global  chaotic  behaviour. 

Using  theoretical  and  experimental  methods  one  can  find  the  ionization  field  threshold 
in  a  vast  region  of  scaled  frequencies.  The  experiments  for  hydrogen  covered  the  scaled 
frequency  OJQ  region  from  0.021  to  2.8.  It  has  been  achieved  by  choosing  initial  states  HQ 
between  24  and  90  for  fixed  frequencies  from  the  interval  of  7.58-36.02  GHz  [24].  Thus 
for  cuo  ~  1  atoms  were  subject  to  pulses  of  length  300-400  external  field  oscillations. 
Results  of  the  experiments  allowed  to  determine  regimes  of  dynamical  behaviour  of  the 
system  and  discuss  mechanism  of  the  ionization.  We  refer  the  reader  to  ref.  [24]  for 
detailed  discussions  and  here  we  just  sketch  a  rough  description  of  main  frequency 
regions.  For  very  low  frequencies  (much  below  the  Kepler  frequency)  we  have  a  situation 
similar  to  the  static  homogeneous  field,  and  classically  ionization  occurs  as  an  escape 
above  the  potential  barrier.  The  motion  is  quasi-regular  and  quanta!  corrections  due  to  the 
tunneling  may  be  included  through  semiclassical  methods  [69].  In  the  high  frequency 
regime  (much  greater  than  the  Kepler  frequency  UJQ  >  1)  it  is  possible  to  obtain  an 
effective  classical  potential  with  respect  to  the  fast  field  oscillations  [54, 70].  In  the  high 
frequency  regime  quantum  results  indicate  much  higher  field  threshold  for  the  ionization 
than  the  classical  simulation,  which  is  referred  to  as  the  so-called  quantum  photonic 
localization  [71].  Classical  aspects  of  quantum  localization  in  microwave  ionization  has 
been  discussed  recently  [72]  and  it  has  been  found  that  part  of  the  discrepancies  with 
quantum  predictions  may  be  removed  if  the  Coulomb  singularity  is  'softened'  [72].  In  the 
intermediate  region  of  frequencies,  excitation  occurs  due  to  the  diffusive  gain  of  energy 
which  finally  leads  to  the  ionization  [24,71],  This  is  the  main  ionization  mechanism  in 
this  region,  but  some  local  maxima  on  the  field  threshold  dependence  on  the  frequency 
can  occur  due  to  the  stabilizing  influence  of  classical  resonances  [24]. 
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Chaos  in  the  hydrogen  atom 

In  the  theory  of  hydrogen  atoms  interacting  with  the  LP  field  the  classical  Kepler 
model  and  its  corresponding  quantum  counterpart  are  of  paramount  importance  [71,73]. 
The  derivation  of  the  Kepler  map  is  based  on  the  observation  that  if  the  field  frequency  is 
much  higher  than  the  Kepler  orbital  frequency,  the  strongest  interaction  between  the 
classical  electron  and  the  oscillating  field  occurs  when  the  electron  is  close  to  the  nucleus. 
Far  away  from  the  nucleus  the  electron  is  a  quasi-free  particle  and  negligible  work  is  done 
on  the  electron.  Because  the  interaction  between  nucleus  and  the  electron  in  the  field 
occurs  only  during  the  short  time,  when  the  electron  is  close  to  the  nucleus,  the  effect  of 
perturbation  could  be  approximated  by  an  instantaneous  impulse  (or  a  'kick')  which 
appears  once  every  orbital  period  of  motion.  Therefore  we  may  iteratively  write  an 
averaged  over  one  motion  period  change  of  the  electron  energy  and  its  phase  as 


<pt+i  =  Vi  +  27ro;/(-2a;M-+1)3/2,  (4.16) 

where  N  =  —  l/(2w2w)  is  the  electron  energy  in  units  of  the  photon  energy  (frequency).  N 
increases  or  decreases  depending  on  the  phase,  <f>,  which  in  turn  depends  on  the  period  of 
the  orbit  corresponding  to  a  certain  n  value,  T  —  27rn3  =  27r/(—  2uNi+i)  '2.  The 
parameter  k  =  0.822-TrF/w5/3  measures  the  strength  of  the  'kick'  for  each  orbit.  The 
long  time  evolution  (measured  in  Kepler  periods)  of  this  map  could  be  easily  found  for 
field  frequencies  UQ  >  1,  however,  the  applicability  of  this  method  may  be  restricted  for 
the  situation  where  the  significant  ionization  occurs. 

The  ID  Kepler  map  model  proved  to  be  extremely  successful  in  explaining  the  full  3D 
behaviour  in  experimental  situation  where  we  have  a  very  'elongated'  (/  <C  «o)  high 
Rydberg  states  in  the  LP  field.  The  results  of  the  quantum  model  and  comparison  with  the 
classical  predictions  for  the  high-frequency  region  (UQ  »  1)  showed  the  discrepancies  in 
predictions  of  the  ionization  threshold  which  'then  led  to  the  formulation  of  the  photonic 
localization  theory  [71]. 


4.5  CP  microwave  field  problem 

In  the  CP  case,  unlike  the  LP  situation,  the  magnetic  quantum  number  (projection  of  the 
angular  momentum  onto  the  quantization  axis)  is  not  conserved  and  there  is  no  other 
constant  of  motion  except  for  the  weak  field  limit  [74,75].  Thus  the  CP  case  needs  in 
general  a  full  3D  treatment  which  is  a  huge  task  for  quantum  calculations.  In  the  3D 
classical  model,  for  an  arbitrary  small  perturbation  it  is  possible  to  observe  the  Arnold 
diffusion  [2].  However,  it  is  possible  to  consider  a  simpler  2D  system,  for  which  KAM 
tori  still  divide  phase-space  into  secluded  regions.  The  application  of  the  2D  model  could 
be  especially  justified  if  the  initial  electron  orbit  plane  coincides  with  the  field 
polarization  plane  [76]. 

A  transformation  of  the  Hamiltonian  eq  (4.1)  (written  for  x  —  ^/4)  to  the  rotating 
frame  with  the  field  [76],  which  is  quantum  mechanically  equivalent  to  making  a  unitary 
transformation  Umt  =  exp^'u^L^)  [67],  yields  a  new  Hamiltonian 
*?         *y        9 

+  Pz  '  (4.17) 
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Figure  14.  Statistical  moments  mi  (open  symbols),  m-i  (filled  symbols)  as  a  function 
of  time.  The  pulse  has  length  of  T  =  500  field  cycles,  turn-on  and  turn-off  is  made 
over  r  =  250  field  cycles.  The  amplitude  envelop  of  the  pulse  f(t)  is  shown  with  a 
dashed  curve.  Circles  show  data  for  u  =  1.9  and  FO  =  0.07  and  diamonds  for  u  =  2.2 
and  FO  =  0.13.  The  field  values  are  taken  so  as  to  get  similar  ionization  probability,  P, 
after  the  pulse  ends  (P  K  0.04). 

Apart  from  the  missing  diamagnetic  term,  this  Hamiltonian  resembles  the  Hamiltonian 
equation  (2.13)  for  the  crossed  electric  and  magnetic  static  field  problem,  and  is  time- 
independent  if  the  change  of  the  pulse  envelope  is  negligible. 

Experiments  with  the  alkali  atoms  [50]  have  shown  that  the  intensity  for  the  CP  case 
needs  to  be  higher  than  for  the  LP  case  in  order  to  obtain  the  same  ionization  probability. 
Moreover,  for  hydrogen  in  a  Rydberg  state  the  ionization  threshold,  F^,  scales  with  the 
principal  quantum  number  n,  and  for  the  LP  case  the  threshold  scales  according  to 
Fth  =  l/(9n4),  whereas  for  the  CP  field  it  scales  as  F&  =  l/(6n4)  [49].  The  latter 
experimental  finding  has  been  supported  by  theoretical  calculations  [75].  Note  that  the 
scaling  of  the  field  amplitude  is  typical  for  the  classical  mechanics  (see  eq.  (4.3)). 
Intensive  studies  of  the  CP  case  have  been  recently  conducted  only  for  a  few  years  now, 
despite  the  fact  that  early  theoretical  works  [77]  started  to  emerge  before  the  experimental 
data  were  even  available  for  the  CP  case.  The  ionization  threshold  in  the  classical  model 
displays  an  asymmetry  for  UQ  =  0  [76]  (a  change  in  the  sign  of  u  is  equivalent  to  the 
m  -*  -m  change),  which  is  supported  by  results  of  the  quantum  model  [67].  Subsequent 
studies  provided  an  extension  of  the  classical  and  quantum  results  to  the  high  frequency 
region  [54, 78].  In  the  CP  problem  there  were  also  investigations  on  the  applicability  of 
the  Kepler  map  [79]  and  the  classical  resonance  overlap  criterion  [80]  to  the  ionization 
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Figure  15.  lonization  probability  P  as  a  function  of  maximal  amplitude  FQ  for  long 
interaction  times  in  the  2D  model;  T  =  500  -  open  symbols  and  T  =  1500  Kepler 
cycles  (cycles  of  electron  oscillations  on  its  orbit)  -  filled  symbols;  in  each  case  turn- 
on  time  r  =  50.  Triangles  show  data  for  u>o  =  1,  and  squares  for  u>o  =  1-2. 

phenomena.  The  influence  of  the  pulse  shape  and  initial  state  preparation  on  the 
ionization  mechanism  in  classical  [54,61,81,82]  and  quantum  models  [78]  were  also 
discussed.  Recently  some  stable  packets  were  observed  in  2D  and  3D  quantum  models 
for  the  CP  [83-85],  which  demonstrate,  stabilising  influence  of  the  classical  resonance 
island.  However,  it  is  worth  to  mention  that  such  non-dispersive  packets  have  been 
studied  theoretically  in  the  LP  problem  earlier  [13]. 

4.6  Classical  versus  quantum  model 

We  will  discuss  here  some  results  for  the  2D  hydrogen  atom  interacting  with  the  CP  field 
in  the  intermediate  region  of  frequencies  (UJQ  ~  1),  where  classical  diffusion  is  the  main 
ionization  mechanism.  We  will  consider  here  only  initial  conditions  which  correspond  to 
hydrogen  circular  states  (with  the  maximum  angular  quantum  number  /)•  The  2D  model 
well  approximates  full  3D  behaviour  in  particular  when  initially  \m\  —  I  (the  classical 
electron  orbit  coincides  with  the  CP  field  polarization  plane)  [76].  For  the  sake  of 
demonstration  of  diffusive  behaviour  we  may  consider  statistical  properties  of  the 
ensemble  of  initial  conditions.  Let  us  define  statistical  moments  [54,  86] 


m2(t)  =  <(n(0  -  <« 


(4.18) 
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Figure  16.  Shanon  entropy  and  maximal  overlap  onto  the  initial  state  coefficient,  c\ 
as  a  function  of  the  field  amplitude  FQ  for  the  fixed  WQ  =  1-  Initial  circular  state  of  the 
field-free  hydrogen  atom  «o  =  25  (field  free  energy  E  of  the  2D  field-free  hydrogen 
defined  as  E  =  -(l/(2(n0  -  I/2)  ))  -  see  e.g.  [67]).  Note  that  a  jump  in  value  of  the 
Shanon  entropy  S  —  —  X}/"""'  c./m(c./)>  where  jmax  number  of  overlap  matrix  elements 
Cj  in  descending  order  yielding  99.5%  of  the  total  population  —  '%jjm  c/  =  0.995. 

Here  n  corresponds  to  the  principal  quantum  number  and  is  calculated  from  the  energy  of 
the  electron  En(t)  —  -l/[2n2(f)].  We  assume  that  initially,  n(t  =  0)  =  8(n0  -  1).  In 
figure  14  we  have  shown  the  statistical  moments  mi  (open  symbols),  m-i  (filled  symbols) 
as  a  function  of  time.  The  pulse  has  length  of  T  =  500  field  cycles,  turn-on  and  turn-off  is 
made  over  r  =  250  field  cycles.  The  amplitude  envelope  of  the  pulse /(f)  is  shown  with  a 
dashed  curve.  Circles  show  data  for  cj  =  1.9  and  FQ  =  0.07  and  diamonds  for  u  =  2.2 
and  FQ  =  0.13.  The  field  values  are  taken  so  as  to  get  similar  ionization  probability,  P, 
after  the  pulse  ends  (P  w  0.04).  Note  that  during  the  interaction  with  the  CP  field  the 
average  excitation  m\,  as  well  as  its  standard  deviation  (related  to  square  root  of  m^) 
grow. 

Figure  15  shows  the  ionization  probability  versus  the  field  amplitude  for  frequency 
UQ  =  1  and  UQ  =  1.2  and  two  interaction  times  T  =  500  and  T  =  1500  field  cycles.  For 
the  frequency  u;0  =  1  the  curves  for  both  the  time  scales  exhibit  a  plateau  for  a  range  of 
values  of  the  field,  which  is  an  indication  of  a  stabilizing  effect  of  the  strong  1:1 
resonance.  Irrespective  of  the  interaction  time  we  observe  a  non-monotonic  dependence 
of  the  ionization  probability  on  the  field  amplitude,  demonstrating  the  trapping  influence 
of  the  classical  resonance,  and  indicates  the  possibility  of  non-hyperbolic  scattering  or 
power-law  dependence  of  the  survival  probability  on  the  interaction  time  [62],  Hence,  the 
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Figure  17.  The  10%  classical  and  quantum  amplitude  threshold  in  the  2D  model. 
The  length  of  the  pulse  T  =  500,  turn-on  (off)  time  r  =  25.  Quantum  results  obtained 
for  initial  circular  state  no  =  25  in  the  single  state  approximation  are  denoted  with 
triangles;  clasical  data  shown  with  open  dots. 


ionization  is  slowed  down  due  to  stabilizing  structures  of  phase-space.  Moreover,  there  is 
numerical  evidence  for  wave  packets  tracing  the  classical  dynamics  without  dispersion, 
which  are  localized  in  a  stable  1:1  classical  resonance  island  [83-85].  For  a  slightly 
greater  frequency  (UJQ  =  1.2)  we  observe  much  efficient  ionization  for  the  longer 
interaction  time,  which  again  supports  the  diffusive  ionization  mechanism. 

To  get  more  insight  in  the  corresponding  quantum  model  behaviour,  we  define  the 
Shanon  entropy  of  Floquet  levels,  which  corresponds  to  the  initial  population  distribution 
over  a  range  of  Floquet  states,  obtained  for  a  given  value  of  the  field  amplitude  FQ  and 
frequency  UQ  as 


./max 


(4.19) 


where  ;max  is  the  number  of  overlap  matrix  elements  c/  (calculated  as  an  overlap  of  the 
j-th  Floquet  eigenstate  onto  the  initial  state  $o)  in  descending  order,  which  yield  99.5%  of 
the  total  population  —  Jjj1""  cj  —  0-995.  In  figure  16  we  depict  the  Shanon  entropy  of  the 
overlap  matrices  for  cu0  =  1  (the  2D  model,  no  =  25)  in  the  RPSA  approximation 
equation  (4.14).  Note  a  sudden  jump  in  the  Shanon  entropy  around  the  frequency  value 
for  which  we  observe  a  growth  of  the  classical  ionization  probability  (see  figure  15). 
Dotted  line  with  squares  show  value  of  the  maximal,  c\,  overlap  coefficient  (in  the  figure 
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the  values  of  c\  are  multiplied  by  10  in  order  to  use  the  same  scale  as  for  the  Shanon 
entropy).  It  indicates  that  for  FO  >  0.04  more  Floquet  states  are  needed  in  the  expansion 
equation  (4.14)  and  the  initial  state  is  spread  over  more  Floquet  states  for  these  values  of 
the  field  amplitude. 

In  figure  17  classical  and  quantum  ionization  field  thresholds  are  compared  for  a  wider 
range  of  scaled  frequencies.  Note  that  for  the  higher  frequency  CJQ  >  3.2  the  classical 
curve  suddenly  grows  to  the  high  values,  which  indicates  stabilization  against  the 
ionization,  whereas  the  quantal  results  yield  a  much  lower  field  for  the  10%  ionization 
probability. 

The  model  of  diffusive  excitation  leading  to  ionization  has  been  used  in  the  interaction 
with  linearly  and  circulary  polarized  field.  Apart  from  the  critical  value  of  the  field,  Fcr 
(numerically  approximated  by  the  F10%  value),  it  is  interesting  to  study  a  diffusion  time 
or  more  strictly  speaking  an  escape  time  from  a  certain  phase-space  region  to  the  infinity 
[87].  Investigating  the  long  time  behaviour  may  provide  us  with  the  information  on 
properties  of  the  classical  phase-space.  It  is  therefore  arguable,  that  a  10  %  threshold  time 
(length  of  the  interaction/pulse),  T10%,  is  more  informative  than  the  field  threshold 
F10%  [78J.  In  particular,  the  latter  may  be  sometimes  ambiguous  due  to  the  non- 
monotonic behaviour  ('sub-thresholds')  [88]  of  ionization  probability  as  a  function  of  the 
field  amplitude. 

5.  Conclusions 

In  this  review  we  have  discussed  classical  and  quantum  signatures  of  the  chaotic 
behaviour  of  the  hydrogen  atom  interacting  with  various  kinds  of  physically  accessible 
static  and  time-dependent  external  fields.  In  the  case  of  time  independent  external  fields, 
we  explored  bounded  classical  phase  space  regions  using  phase  space  techniques  like 
trajectory  plot  and  PSS.  In  the  corresponding  quantal  case,  we  probed  the  energy- 
parameter  dependence.  For  certain  cases,  we  discussed  the  structure  of  quantal 
resonances  in  the  complex  energy  plane. 

As  far  as  the  time  dependent  external  field  problems  are  concerned,  we  have  stressed 
validity  of  the  classical  approach  for  Rydberg  initial  states.  This  is  supported  by  the 
scaling  properties  of  quantum  and  experimental  thresholds  with  the  principal  quantum 
number  for  low  frequency  regime,  as  well  as  a  qualitative  agreement  between  classical 
and  quantum  results  in  the  diffusive  frequency  regime.  In  the  latter  regime  the  onset  of 
classical  chaos  well  approximates  the  ionization  field  threshold.  We  gave  an  example  of 
the  quantum  signature  of  classical  chaos  in  this  regime  of  frequencies  and  initial 
conditions,  as  a  sudden  growth  of  the  Shanon  entropy  (defined  over  the  distribution  of  the 
initial  population  over  Floquet  states).  This  indicates  a  complicated  level  dynamics  and 
abundance  of  level  anticrossings  [89,90],  which  give  rise  to  redistribution  of  the  initial 
state  over  a  wide  range  of  Floquet  states.  This  means  that  the  application  of  the  single 
state  approximation  for  the  calculation  of  the  ionization  probability  becomes  more 
restricted  than  the  rectangular  pulse  shape  approximation. 

In  table  2,  we  have  summarized  both  classical  and  quantal  features  of  hydrogen  atoms 
interacting  with  external  fields.  From  this  table  it  is  evident  that  the  investigation  of  even 
a  rather  simple  system,  like  the  hydrogen  atom  interacting  with  external  fields  leads  to  a 
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Abstract.  The  collinear  atom-diatom  collision  system  provides  one  of  the  simplest  instances  of 
chaotic  or  irregular  scattering.  Classically,  irregular  scattering  is  manifest  in  the  sensitive 
dependence  of  post-collision  variables  on  initial  conditions,  and  quantally,  in  the  appearance  of  a 
dense  spectrum  of  dynamical  resonances.  We  examine  the  influence  of  kinematic  factors  on  such 
dynamical  resonances  in  collinear  (He,  JEJ")  collisions  by  computing  the  transition  state  spectra  for 
collinear  (He,HD+)  and  (He,DH+)  collisions  using  the  time-dependent  quantum  mechanical 
approach.  The  nearest  neighbor  spacing  distribution  P(s)  and  the  spectral  rigidity  AS  (L)  for  these 
resonances  suggest  that  the  dynamics  is  predominantly  irregular  for  collinear  (He,HD+)  and 
predominantly  regular  for  collinear  (He,  DH"1").  These  findings  are  reinforced  by  a  significantly 
larger  "correlation  hole"  in  ensemble  averaged  survival  probability  {(/D(/)))  values  for  collinear 
(He,HD+)  than  for  collinear  (He,DH+).  In  addition  we  have  also  examined  measures  of  classical 
chaos  through  the  dependence  of  the  final  vibrational  action,  n/,  on  the  initial  vibrational  phase  $,• 
of  the  diatom,  and  Poincare  surfaces-of-section.  They  show  that  (He,HD+)  collisions  are  partly 
chaotic  over  the  entire  energy  range  (0-2.78  eV)  while  (He,  DH"1")  collisions,  in  contrast,  are  highly 
regular  at  collision  energies  bejiow  the  classical  threshold  for  reaction.  Above  the  threshold,  the 
scattering  remains  regular  for  initial  vibrational  states  v  —  0  and  1  of  DH+. 

Keywords.    Dynamical  resonances;  irregular  scattering;  collinear  collisions. 
PACS  Nos    05.45;  34.10;  34.50 


1.  Introduction 

Dynamical  resonances  in  atom-molecule  collisions  [1]  have  been  of  great  interest  since 
they  give  rise  to  unusual  variations  in  the  scattering  amplitude  and  related  quantities. 
Reactive  scattering  resonances,  which  have  been  identified  in  a  number  of  systems,  one 
of  which  is  (He,!^)  in  collinear  as  well  as  noncollinear  configurations  [2],  are 
characterized  by  oscillations  in  the  reaction  probability  PR  as  a  function  of  the  collision 
energy  E.  Many  of  them  are  of  the  Feshbach  type  and  can  be  interpreted  in  terms  of 
bound  states  supported  by  vibrational  adiabatic  potentials  in  hyperspherical  coordinates 
[2(d),3].  A  closer  examination  of  PR(E)  curves  for  collinear  (He.H^),  for  example, 
reveals  that  there  are  additional  oscillations  [2(h)]  that  are  irregularly  spaced  as  a  function 
of  the  energy  and  which  can  not  be  assigned  easily.  The  connection  between  such 
(quanta!)  resonances  and  the  features  of  the  classical  dynamics  is  the  subject  of  this 
article. 
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reaction  on  the  McLaughlin-Thompson-Joseph-Sathyamurthy  (MTJS)  potential  energy 
surface  (PES)  [5]  revealed  that  plots  of  final  vibrational  action  («/)  of  the 
reactant/product  molecule  as  a  function  of  the  initial  vibrational  phase  (0;)  of  the 
reactant  molecule  consists  of  reactive  (R)  and  nonreactive  (MR)  bands  and  a  number 
of  "chattering"  or  irregular  trajectories  in  between.  Upon  magnification,  it  was 
found  that  the  primary  chattering  region  actually  consists  of  a  number  of  alternating 
R  and  NR  bands  with  secondary  chattering  regions  separating  them.  On  further 
magnification,  it  became  evident  that  there  is  a  self-similar  pattern  repeating 
itself  endlessly  on  increased  resolution:  n/(0,-)  is  a  fractal  [6]  curve.  Similar 
behavior  has  been  found  for  several  other  systems  like  collinear  (H,  Hk)  [7]  and  (Li,  FH) 
[8].  Trajectories  in  the  chattering  regions  tend  to  be  chaotic,  and  the  origin  of  classical 
chaos  in  collinear  (He,!!^)  system  has  been  examined  [9,10]  by  plotting  the  Poincare 
surface  of  section  (SOS)  for  a  number  of  trajectories  at  energies  below  the  reaction 
threshold. 

It  has  been  shown  from  time-dependent  quantal  wave  packet  (WP)  calculations  [2(h)] 
that  the  dynamics  in  collinear  (He,^)  collisions  is  "turbulent"  when  viewed  in 
terms  of  probability  density  plots  in  configuration  space.  The  transition  state  spectrum 
[11]  is  obtained  by  time  evolving  a  wave  packet  centered  in  the  interaction  region, 
followed  by  computation  of  the  Fourier  transform  of  the  autocorrelation  function.  Peaks 
in  this  spectrum  correspond  to  dynamical  resonances,  and  statistical  analysis  of  the 
resonance  energies,  through  the  nearest  neighbor  spacing  distribution  (NNSD)  or  the 
spectral  rigidity  revealed  an  intermediate  (partly  regular  and  partly  irregular)  behavior 
for  the  system  [12].  An  ensemble  averaged  survival  probability  {{P  (?)}),  computed  [12] 
by  Fourier  transforming  the  spectral  autocorrelation  function  to  time  domain  followed 
by  averaging  over  the  initial  states  and  Hamiltonian  revealed  the  existence  of  a 
"correlation  hole"  that  is  graphic  evidence  for  "irregularity"  or  quantum  chaos  [13-16]. 
A  one-to-one  correspondence  between  resonant  periodic  orbits  (RPOs)  and  many 
transition  state  resonance  eigenfunctions  for  the  system  also  has  been  established  [11, 
17].  Therefore,  on  the  whole,  it  could  be  concluded  that  there  is  a  strong  correlation 
between  classical  chaos,  quantum  chaos  and  dynamical  resonances  for  systems  like 
collinear  (He,  Hf). 

Since  dynamical  resonances  are  intimately  connected  with  the  nature  of  the  motion,  it 
may  be  anticipated  that  kinematic  factors  play  a  crucial  role  in  such  systems.  In  the 
present  study  we  report  on  the  spectral  characteristics  and  the  behavior  of  quasiclassical 
trajectories  for  the  two  widely  differing  dynamical  systems  which  arise  from  isotopic 
substitution  of  HeHj,  in  order  to  investigate  the  role  of  kinematic  factors  in  dynamical 
resonances.  Within  the  Born-Oppenheimer  approximation,  isotopic  substitution  has  no 
influence  on  the  PES.  Therefore,  any  difference  hi  dynamical  results  must  arise  from  the 
difference  in  masses  of  the  atoms  concerned. 

We  have  computed  the  /*(£)  curves  for  collinear  He  +  HD+  and  He  +  DH+  collisions, 
with  HD+  in  different  initial  vibrational  states,  v,  and  find  that  Pj(£)  in  either  case 
behaves  very  differently  [18]  (from  each  other  as  well  as  from  the  case  of  HeHj).  Our 
results  suggest  that  the  wild  oscillations  in  jP*(£)  may  not  necessarily  mean  that  the 
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Figure  1.  State-selected  reaction  probabilities  (solid  curve)  computed  from  time- 
dependent  quantal  wave  packet  calculations  and  reproduced  from  Ref.  [18]  for 
different  initial  vibrational  (v)  states  of  HD+  for  the  reaction  He  +  HD+(v)  — »  HeH+-(- 
D.  The  corresponding  quasiclassical  trajectory  results  are  included  as  dashed  curves. 
The  threshold  energies  of  various  vibrational  channels  of  the  product  HeH"1"  are 
marked  along  the  abscissa  of  panel  (a)  to  illustrate  the  occurrence  of  sharp  variations 
in  PJ?  near  each  channel  threshold. 


dynamics  is  chaotic  from  a  quantal  point  of  view.  Collinear  (He,  HD+)  would  correspond 
to  a  (H,  LH)  mass  combination,  while  (He,DH+)  would,  to  a  (H,HL)  mass  combination. 
Here  H  and  L  refer  to  heavy  and  light  atoms,  respectively.  Based  on  the  known 
P*  (£)  results  for  other  systems  of  (H,LH)  type,  it  is  expected  that  (He,HD+)  would 
have  an  oscillatory  P$(E)  as  a  light  atom  is  exchanged  between  two  heavy  atoms,  and 
that  (He,DH+)  would  have  a  relatively  structureless  P^(E)  since  the  leaving  atom  is 
light.  However,  we  find  that  P*(E)  for  collinear  (He,HD+)  has  a  staircase-like 
structure,  with  minor  oscillations  about  each  "step",  as  illustrated  in  figure  1  for 
different  initial  v  states  of  HD+.  The  sharp  variations  in  P%  (£)  (from  one  step  to 
another)  can  be  readily  identified  [18]  as  threshold  resonances  [19]  corresponding  to 
the  opening  up  of  either  reactant  or  product  vibrational  channels.  The  undulations 
of  smaller  magnitude  about  the  steps  are  due  to  the  weak  dynamical  resonances 
found  in  the  transition  state  spectrum.  The  P*(E)  curve  for  collinear  (He,DH+) 
collisions,  on  the  other  hand,  is  highly  oscillatory  as  shown  in  figure  2  and  such  an 
interpretation  would  not  suffice  to  explain  the  structure.  We  therefore  examine  the 
transition  state  spectra  of  both  systems  for  signatures  of  quantum  chaos,  and  find  that 
collinear  (He,  HD+)  collisions  are  predominantly  chaotic,  whereas  (He,  DH+)  collisions 
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Figure  2.  State-selected  quantal  reaction  probabilities  (solid  curve)  reproduced  from 
Ref.  [18]  for  the  reaction  He  +  DH+(v)  ->  HeD+  +  H.  The  corresponding  quasiclas- 
sical  trajectory  results  are  included  as  dashed  curves.  For  v  =  0  and  1,  the 
quasiclassical  trajectory  calculations  yield  P^  =  0  (the  statistical  uncertainty  level  is 
0.001). 


are  not.  From  a  classical  point  of  view,  however,  both  systems  show  characteristics  of 
chaos,  albeit  to  different  degrees. 


2.  Quantum  chaos 

We  obtain  the  transition  state  spectra  [18]  for  collinear  (He,HD+)  and  (He,DH+) 
collisions  by  evolving  a  wave  packet  tjj(t  =  0)  initially  centered  in  the  transition  state 
region  and  computing  the  autocorrelation  function  C(t)  —  (^>(0)|V>(V)},  and  its  Fourier 
transform.  There  are  a  large  number  of  quasibound  states,  i.e.  transition  state  resonances 
for  both  systems.  These  resonance  energies  {£,,  i=  1, 2 ..  .N}  provide  the  data  for  our 
subsequent  analysis.  Over  the  past  couple  of  decades,  considerable  evidence  has 
accumulated  to  support  the  hypothesis  that  the  statistical  properties  of  the  eigenvalues  of 
systems  that  have  chaotic  classical  limits  are  identical  to  those  of  ensembles  of  random 
matrices  [20]  of  the  appropriate  symmetry  or  universality  class.  Here  we  apply  such 
analysis  to  the  systems  at  hand. 

The  cumulative  level  density  N(E),  which  is  a  staircase  function,  is  first  unfolded 
in  order  to  make  the  average  density  of  energy  levels  uniform  over  the  entire  energy 
range.  There  are  no  clear-cut  methods  to  unfold  the  spectrum  of  resonance  states. 
Therefore  we  have  adopted  the  procedure  that  is  followed  for  nuclear  resonances  and 
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bound  states,  namely  that  the  unfolded  energy  levels  £/  are  calculated  from  the  energy 
levels  in  the  raw  spectrum  (£,•)  by  dividing  all  the  level  spacings  by  the  local  average 
spacing  [21,22] 


where  k  is  the  number  of  consecutive  spacings  used  to  calculate  the  local  average  spacing 
and  n  is  the  total  number  of  energy  levels. 

From  the  unfolded  spectrum,  the  nearest  neighbor  spacing  distribution,  P(s]  can  be 
obtained;  the  spacings  are  {si  =  Ei+\  —  E^i  =  1,2. .  .N  -  I}.  In  the  limiting  cases  of 
purely  chaotic  or  purely  regular  systems,  the  distributions  that  obtain  are  well  known:  the 
former  is  (to  a  high  degree  of  accuracy)  given  by  the  Wigner  surmise, 
P(s]  =  (?rj/2)  exp(-(7T1y2/4))  [23],  while  the  latter  is  the  Poisson  distribution 
P(s]  =  exp(-s)  [24]. 

The  spectral  rigidity  AS  (L),  computed  from  the  unfolded  spectrum  measures  the  least 
squares  deviation  of  the  staircase  function  (N(E))  from  the  best  straight  line  in  the  energy 
interval  of  length  L  [25]  and  it  gives  information  on  the  long-range  correlations — how 
one  state  influences  another  that  is  well  separated  from  it.  AS  (L)  is  computed  from  the 
unfolded  energy  values  lying  in  the  interval  [0, . . . ,  L]  as  [26] 


where  n  is  the  number  of  levels  within  the  energy  interval  L  and  the  ensemble  average  {} 
is  carried  out  by  shifting  the  box  of  length  L  over  the  entire  spectrum.  The  limiting 
behaviours  are  again  well-known,  and  are 


and 

A?°E(L)  «  \\ogL  -  0.00695  (4) 


respectively  for  the  regular  and  chaotic  cases  [20,23,27-29],  the  superscript  GOE 
referring  to  the  Gaussian  Orthogonal  Ensemble  of  random  matrices  [20,30].  Molecular 
systems  are  often  neither  purely  regular  nor  purely  chaotic,  in  which  case  the  mixed 
distribution  [31,32]  applies, 

A3(L;  q)  =  Af*"«((l  -  q)L]  +  Af^L),  (5) 

where  q  represents  the  fraction  of  phase  space  that  is  chaotic. 

For  HeHD+  the  NNSD  distribution,  shown  in  figure  3(a),  is  more  similar  to  the  chaotic 
case  than  the  regular,  indicative  of  level  repulsion.  This  is  confirmed  by  the  behaviour  of 
the  A3(L)  statistic,  shown  in  figure  3(b):  shown  along  with  the  data  (triangles)  for 
HeHD+  are  the  GOE  and  Poisson  values.  Except  at  small  L,  the  computed  points  for 
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Figure  3.  (a)  The  nearest  neighbor  spacing  distribution  P(s]  derived  from  the 
unfolded  eigenvalue  spectrum  for  collinear  HeHD+,  showing  short-range  correlations. 
Poisson  and  Wigner  indicate  the  distributions  in  the  regular  and  the  irregular  limits, 
respectively,  (b)  The  spectral  rigidity  &•$  (L)  indicating  the  long-range  correlations  as  a 
function  of  the  segment  length,  L,  in  the  above  spectrum  (triangular  points).  A3(L) 
computed  from  Eq.  5  for  mixed  behavior,  using  q  —  0.9  is  shown  by  the  dashed  curve. 
Poisson  and  GOE  indicate  the  regular  and  the  irregular  limits  respectively,  (c) 
Variation  of  ensemble  averaged  survival  probability  ((P(t)}}  with  time  t  (in  arbitrary 
units)  obtained  from  the  above  spectrum  (solid  curve).  The  RMT  ((P(t)))  values  for 
j3  =  0.9  are  included  as  the  dashed  curve,  for  comparison. 


HeHD+  are  closer  to  the  GOE  limit  than  to  the  Poisson,  thus  revealing  a  high  degree  of 
rigidity  in  the  spectrum,  more  so  than  the  case  of  collinear  (He,Hj).  Since  the  computed 
As(L)  distribution  does  not  fully  correspond  to  either  the  Poisson  or  the  GOE  value,  we 
have  fitted  it  to  the  intermediate  case  (Eq.  5),  and  obtain  the  dashed  line  in  figure  3(b), 
using  q  =  0.90. 

For  collinear  HeDH+  on  the  other  hand,  the  NNSD  in  figure  4(a)  is  more  ambiguous:  it 
is  difficult  to  decipher  whether  it  is  Poisson-like  or  Wigner-like.  The  A3(L)    values 
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Figure  4.  (a)  The  nearest  neighbor  spacing  distribution  derived  from  the  unfolded 
eigenvalue  spectrum  for  collinear  HeDH+.  (b)  The  spectral  rigidity  for  the  same 
spectrum  (triangular  points).  The  dashed  curve  corresponds  to  As(L)  values  obtained 
from  Eq.  5  for  q  =  0.2.  (c)  The  ensemble  averaged  survival  probability  with  time  t  (in 
arbitrary  units),  obtained  from  the  above  spectrum  (solid  curve).  The  RMT((P(f)}} 
values  for  /?  =  0.2  are  included  for  comparison  (the  dashed  curve). 

obtained  from  the  unfolded  spectrum,  shown  in  figure  4(b)  as  triangular  points,  are  close 
to  the  Poisson  distribution.  These  values  are  best  fitted  to  Eq.  5  (shown  as  the  dashed 
curve)  using  q  =  0.2,  which  indicates,  in  contrast  to  HeHD+,  a  high  degree  of  regular 
behavior. 

Further  analysis  of  the  dynamical  resonance  spectra  is  possible.  In  the  time-domain, 
analysis  of  the  transition  state  spectrum  proceeds  through  the  computation  of  the 
ensemble  averaged  survival  probability  ((P(t)}}  [13-16].  The  survival  probability, 
averaged  over  the  initial  states  (P(t)}  ,  is  computed  [16]  through 


N  +  2 


—   ]T    cos|27r(£n-£w) 

n,m;n>m 


(6) 
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and  further  averaging  over  the  Hamiltonian  is  done  by  averaging  over  n  levels  which  are 
divided  into  several  segments,  each  containing  N  levels,  to  obtain  ((P(t))).  (A£)  is  the 
average  level  spacing.  The  ((P(t)))  values  thus  obtained  from  the  unfolded  spectrum  of 
collinear  HeHD+  are  shown  by  the  solid  _curve  in  figure  3(c).  The  dip  below  the 
asymptotic  value  at  short  times  (f  «  27r7i/(A£}),  termed  a  "correlation  hole"  is  a  typical 
signature  of  irregularity  in  the  spectrum.  The  extent  of  irregularity  is  estimated  by 
comparing  the  ((P(t)}}  values  with  those  derived  from  random  matrix  theory  (RMT)  [20]. 
The  latter  are  obtained  from  [33] 

]},  (7) 


where  r  =  t/(2Tr(p}}(p)  is  the  average  level  density)  and  j3  =  1,  2  and  4  for  Gaussian 
orthogonal,  unitary  and  symplectic  ensembles,  respectively.  £2/3  (0  *s  me  two-level  form 
factor  obtained  by  Fourier  transformation  of  the  two-point  cluster  function,  Yip(u)  [20]. 
The  asterisk  implies  convolution  and  A#(/)  =N"1[sin(7rM)/7rf]2.  If  fraction  /3  of  the 
levels  obey  GOE  statistics  and  (1-/3)  obey  Poisson  statistics,  6^(0  (0  <  /3  <  1)  is  given 
by  [33]  ' 

MO  =  #2('/0)  +  [(1  -  jS)/rt]fi(0  (8) 


where  b2  indicates  the  GOE  value  that  is  obtained  from  the  two-level  cluster  function 
as  [20] 


-l)),     |r|>l.  (9) 

For  HeHD+,  the  ({P(t})}  values  obtained  from  Eq.  7  for  j3  =  0.9  (dashed  line)  are  found  to 
be  in  reasonable  agreement  with  those  derived  from  the  spectrum  through  Eq.  6  (the  solid 
line)  as  illustrated  in  figure  3(c).  We  conclude  that  approximately  90%  of  the  resonance 
spectrum  for  HeHD+  is  irregular  in  nature.  This  estimate  for  the  fractional  irregularity  of 
the  eigenvalue  spectrum  is  also  in  agreement  with  the  estimate  obtained  from  AS  (L)  (see 
above). 

For  HeDH+  on  the  other  hand,  the  «P(f))}  curve,  shown  in  figure  4(c)  reveals  only  a 
marginal  correlation  hole  (if  any)  implying  that  collinear  (He,DH+)  dynamics  is  highly 
regular.  In  addition,  the  above  ((P(t)))  values,  when  compared  to  those  obtained  from 
Eq.  7,  (the  dashed  line)  yield  a  value  of  /?  =  0.2,  which  is  consistent  with  the  fractional 
irregularity  of  the  spectrum  obtained  from  A3(L). 

3.  Classical  chaos 

In  order  to  see  how  the  classical  dynamics  of  the  two  systems  differ  we  have  computed 
quasiclassical  trajectories  for  both  over  a  wide  range  of  energies.  The  resulting  Pj(£) 
values  are  included  in  figures  1  and  2  in  the  form  of  dashed  lines.  As  is  often  the  case,  the 
classical  threshold  (Eg)  values  for  the  exchange  reaction  in  the  case  of 
He+HD+(v  =  0,  1)  are  lower  than  the  quantal  (E*™)  values  because  products  with 
nf  <  0  are  possible  in  classical  mechanics  [34]  and  are  indeed  found  to  be  formed.  For 
v  =  2,  Eg  =  £^m.  For  v  =  3,  E%  >  E%?  as  the  exchange  reaction  is  dynamically 
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Figure  5.  Action- angle  plots  for  a  family  of  1000  trajectories  for  various  initial 
vibrational  (v)  states  (indicated  in  each  panel)  of  HD+  in  collinear  He,HD+(v) 
collisions  a.t'E,ot  =  1.0  eV.  R  and  NR  stand  for  the  reactive  and  the  nonreactive  band 
of  trajectories,  respectively. 


forbidden  in  the  vicinity  of  Ejj".  For  He  +  DH+(v  =  0, 1),  classical  mechanics  predicts 
pR  •-  0  while  quantum  mechanics  yields  extremely  low  (but  nonzero)  reaction 
probabilities,  P*.  For  v  =  2  and  3,  E%  <  Ef™  as  was  discussed  above.  We  note, 
parenthetically,  that  it  is  clear  that  on  average,  classical  mechanics  gives  a  good 
description  of  collinear  He,HD+  and  He,DH+  dynamics.  The  only  major  discrepancy 
(over  and  above  the  threshold  effect)  between  the  classical  and  quantal  results  is  seen  to 
occur  for  He,DH+(v  =  2)  at  high  energies  and  its  origin  is  not  clear. 

Scattering  chaos  is  most  evident  [35]  when  the  final  quantum  number  (related  to  the 
final  classical  action)  n/  is  plotted  versus  the  initial  phase,  <&,  as  shown  in  figures  5  and  6 
for  collinear  (He,HD+)  and  (He,DH+)  collisions  for  different  v  states  of  HD+  at  a  total 
energy  (Etot)  of  1.0  eV.  It  is  clear  from  figures  5(a)  and  5(b)  that  there  are  R  and  NR 
bands  and  chattering  regions  in  between,  for  (He,HD+(v  =  0, 1)).  For  v  =  2,  the  R  band 
has  shrunk  in  size  and  the  chattering  region  has  become  wider,  when  compared  to  that  for 
v  =  0  and  1.  For  v  =  3,  the  R  band  has  disappeared  completely.  There  are  only  NR  bands 
and  chattering  regions  in  between.  It  can  be  seen  that  there  is  a  prominent  parabola  or 
icicle  at  the  centre  and  there  are  narrower  icicles  on  either  side  [36].  A  larger  number  of 
trajectories  computed  in  the  chattering  region  would  reveal  fractal  characteristics.  Since 
(He,DH+)  collisions  are  nonreactive  for  DH+(v  =  0, 1)  we  obtain  only  an  NR  band 
with  slight  distortions,  as  can  be  seen  in  figures  6(a)  and  6(b).  For  v  =  2  and  3  we  obtain 
an  NR  band  and  a  set  of  reactive  trajectories  whose  «/  values  vary  rapidly  with  0,-.  A 


Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  H) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


419 


Susanta  Mahapatra  et  al 


0.5TT         TT          1.57T       27T    I>U0         0.5TT        IT 


Figure  6.    Same  as  figure  5  for  He,DH+(v)  collisions. 
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Figure  7.  Action-angle  plot  at  increased  resolution  for  He,  DH+  (v  =  2)  collisions.  R 
and  NR  represent  the  reactive  and  the  nonreactive  trajectories,  respectively.  The  root 
trajectories  that  determine  the  scattering  amplitude  in  the  classical  S-matrix  theory 
correspond  to  the  intersection  of  the  fractal  curve  (R)  with  the  line  n/  =  0;  it  is  evident 
that  there  are  a  very  large  number  (essentially  infinite)  of  these. 


blow-up  of  the  chattering  region  containing  reactive  trajectories  is  shown  in  figure  1.  The 
reactive  and  the  nonreactive  trajectories  are  interspersed  along  the  fa  axis.  But  because  of 
the  differing  range  of  nf  values  for  the  two  types  of  collisions,  they  are  well  separated 
along  the  vertical  axis,  but  both  reactive  as  well  as  nonreactive  trajectories  have 
characteristic  icicle  structure.  From  the  point  of  view  of  semiclassical  S-matrix  theory 
[37],  the  horizontal  line  at  nf  —  0  implies  that  there  is  a  large  number  of  "root" 
trajectories  and  that  the  expression  for  the  reaction  probability  would  consist  of  a  large 
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Figure  8.  Poincare  surface  of  section  obtained  from  400  trajectories  superimposed 
on  the  zero  (0)  and  the  first  (1)  order  separatrices  at  Etl),  =  0.6434  eV  for  collinear  (a) 
He,HD+(v  =  0)  and  (b)  He,DH+(v  =  0)  collisions. 


number  of  interference  terms,  thus  explaining  the  highly  oscillatory  nature  of  P*  (£) 
curve  for  the  reaction. 

In  order  to  investigate  the  classical  chaos  in  collinear  (He,  HD+)  and  (He,  DH+) 
collisions  further,  we  show  in  figure  8(a)  plots  of  Poincare  SOS  for  400  trajectories  with 
HD+(v  =  0)  at  E'for=0.6434  eV,  along  with  the  zero  and  the  first  order  separatrices.  The 
zero  order  separatrix  is  obtained  by  solving  the  implicit  equation 


PL 

2M 


(10) 


where  M  is  the  atom-diatom  reduced  mass  and  V(x,y  =  ye)  is  the  potential  energy  of  the 
system,  with  the  diatom  in  its  equilibrium  configuration.  The  first  order  separatrix  is 
obtained  by  propagating  several  sets  of  (x,px)  values  on  the  zero  order  separatrix  forward 
in  time  by  setting  y  =  ye  and  py  =  \/2mEv,  where  m  is  the  diatomic  reduced  mass  and  Ev 
is  the  vibrational  energy  of  the  diatom,  equal  to  Etot,  until  it  crosses  the  SOS  again.  The 
first  order  separatrix,  when  superimposed  on  the  zero  order  separatrix,  reveals  a  turnstile 
or  broken  separatrix  [38].  The  latter  consists  of  flux-in  and  flux-out  regions.  Direct 
(regular)  trajectories  never  enter  the  intramolecular  bottleneck  (identified  with  the  zero 
order  separatrix)  whereas  indirect  (irregular)  trajectories  do,  invariably  entering  through 
the  flux-in  region  and  exiting  through  the  flux-out  region. 

The  results  in  figure  8(a)  are  very  similar  to  those  in  figure  3  of  Ref.  [10]  for  collinear 
(He,  Hj)  .  The  regular  trajectories  never  enter  the  intramolecular  bottleneck  and  they 


distribution  in  the  final  relative  translational  energy.  On  the  other  hand  the  irregular 
trajectories  enter  largely  through  the  broken  separatrix  and  they  undergo  multiple 
collisions  inside  the  strong  intramolecular  bottleneck  region  and  form  the  complementary 
dusty  pattern  between  the  baseball  bats. 

The  zero  and  the  first  order  separatrices,  along  with  the  Poincare  SOS  for  a  family  of 
regularly  sampled  400  trajectories  for  collinear  (He,  DH+(v  =  0))  at  Etot  =0.6434  eV  are 
shown  in  figure  8(b).  Again,  in  contrast  to  figure  8(a),  it  is  clear  that  all  the  trajectories 
are  directly  scattered.  They  never  enter  the  intramolecular  bottleneck,  emphasizing  the 
fact  that  collinear  (He,  DH+)  trajectories  are  predominantly  regular  at  energies  below  Eft. 
Even  for  energies  above  £$,  the  Poincare  SOS  plots  confirm  that  there  are  regular  and 
irregular  trajectories  for  collinear  He,HD+(v  =  0  -  3).  For  He,DH+(v  =  0,1)  the 
trajectories  are  nonreactive  but  entirely  regular.  For  v  =  2  and  3  however,  both  regular 
and  irregular  trajectories  occur. 


4.  Summary  and  conclusions 

Chaotic  scattering  is  sensitive  to  kinematic  effects,  and  with  this  in  mind,  in  the  present 
article  we  have  examined  the  transition  state  spectra  of  collinear  (He,HD+)  and 
(He,DH+)  systems  obtained  from  time-dependent  quantal  wave  packet  calculations. 
Signatures  of  quantum  chaos  become  evident  in  statistical  analyses  of  spacing 
distributions,  the  spectral  rigidity  and  ensemble  averaged  survival  probability.  In  this 
regard,  the  two  isotopic  variants  we  have  looked  appear  to  be  dynamically  very  different, 
although  the  interaction  potential  is  identical  in  both  the  cases. 

The  3-body  system  (He,  Hj  )  has  the  mass  combination  Heavy-Light-Light.  Changing 
the  terminal  atom  from  H  to  D,  namely  by  making  this  a  Heavy-Light-Heavy  system 
keeps  the  dynamics  irregular,  both  classically  as  well  as  quantum  mechanically.  Changing 
the  central  atom  to  D  gives  the  mass  combination  Heavy-Heavy-Light,  and  this 
dramatically  reduces  the  chaotic  behaviour:  the  HeDH+  spectrum  is  regular  despite  the 
wild  oscillatory  Pj(E)  curve.  Action-angle  plots  and  Poincare  SOS  plots  indicate  that 
(He,HD+)  dynamics  is  mixed  i.e.  partly  regular  and  partly  irregular,  over  the  entire 
energy  range  of  investigation.  Results  for  (He,DH+),  in  contrast,  suggest  that  the 
dynamics  is  nearly  completely  regular  for  v  =  0  and  1,  but  for  v  =  2  and  3,  it  assumes  a 
mixed  character. 

As  was  mentioned  in  the  Introduction,  classical  chaos  and  quantum  mechanical 
resonances  have  been  investigated  in  collinear  (H,  H2)[7],  (Li,  FH)  [8]  and  other  systems. 
Quantal  resonances  have  been  reported  for  three  dimensional  (He,  Hf)  -[2e-2g]  and 
(Li,FH)  [39]  collisions  as  well.  The  large  number  of  spectral  lines  observed  in  the 
infrared  predissociation  spectrum  of  Hj  [40]  have  been  traced  [41]  to  bound  states 
embedded  in  the  continuum  (read  transition  state  resonances)  of  the  H2-H+channel  and 
also  related  to  horse-shoe  periodic  orbits.  Indications  of  the  role  of  kinematic  factors  in 
the  system  came  from  the  D2H+  spectra  that  depended  on  the  formation  of  H+/D+. 

Attempts  have  been  made  [42]  in  the  past  to  verify  experimentally  the  existence  of 
resonances  in  (He.Hf)  collisions.  Following  our  suggestion  [2d]  that  observables  like 
state-to-state  differential  cross  sections  for  backward  scattering,  which  are  decided 
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largely  by  collinear  collisions  at  zero  impact  parameter,  might  be  amenable  to 
experimental  verification,  Pollard  et  al  [43]  measured  the  same.  Unfortunately,  their 
energy  resolution  (0.15  eV)  was  not  good  enough  to  detect  the  narrow  resonances  in  the 
system.  Since  PQ(E)  behaves  differently  for  HeH+  and  HeD+  formation  in  collinear 
collisions,  it  is  quite  likely  that  this  effect  would  persist  in  three  dimensions.  In  that  case, 
measurement  of  differential  cross  sections  for  the  two  different  channels  in  (He,HD+) 
collisions  might  reveal  some  of  the  interesting  features  reported  in  this  paper. 
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Abstract.  We  shall  discuss  the  role  of  chaotic  intrinsic  motion  in  dissipative  dynamics  of 
the  collective  coordinates  for  nuclear  systems.  Using  the  formalism  of  linear  response  theory,  it 
will  be  shown  that  the  dissipation  in  adiabatic  collective  motion  depends  on  the  degree  of  chaos 
in  the  intrinsic  dynamics  of  a  system.  This  gives  rise  to  a  shape  dependent  dissipation  rate  for 
collective  coordinates  when  the  intrinsic  motion  is  described  by  the  independent  particle  model  in  a 
nucleus.  The  shape  dependent  chaos  parameter  measuring  the  degree  of  chaos  in  the  intrinsic 
dynamics  of  the  nuclear  system  will  be  obtained  using  the  interpolating  Brody  distribution  of 
nearest  neighbour  spacings  in  the  single  particle  energy  spectrum.  A  similar  shape  dependence  is 
also  found  to  be  essential  for  phenomenological  dissipation  rates  used  in  fission  dynamics 
calculations. 
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1.  Introduction 

The  chaotic  aspects  of  nuclear  dynamics  have  invoked  considerable  interest  [1-11] 
during  the  last  decade.  The  nucleus  is  a  particularly  suitable  candidate  to  study  chaos 
because  of  its  rich  dynamics  displaying  both  single-particle  and  collective  features, 
interplay  between  microscopic  and  macroscopic  degrees  of  freedom  and  the  possibility  of 
perturbing  such  a  system  with  varying  degrees  of  violence  in  a  collision.  It  is  worthwhile 
to  clarify  at  this  point  that  though  the  nucleus  being  a  many  body  system  of  interacting 
fermions  is  expected  to  be  fully  chaotic,  it  is  always  not  necessarily  so.  It  may  be  recalled 
here  that  the  fluctuation  properties  of  low  energy  nuclear  spectra  follow  those  of 
eigenvalues  of  Gaussian  Orthonormal  Ensemble  (GOE)  of  random  matrices  [12,13], 
which  were  subsequently  found  [14]  to  characterize  the  fluctuation  statistics  of 
eigenvalues  of  systems  with  nonintegrable  classical  motion.  Fluctuation  analyses  of 
rotational  spectra  were  performed  [15]  to  gain  insight  into  the  structure  of  hot  rotating 
nuclei.  On  the  other  hand,  both  chaotic  and  regular  aspects  were  also  found  [16]  in 
different  regimes  of  nuclear  spectra.  Since  many  nuclear  properties  can  be  understood  in 
terms  of  a  relatively  few  degrees  of  freedom,  the  domain  of  the  nuclear  dynamics  relevent 
for  these  properties  can  display  chaotic  features  in  varying  degrees.  In  particular,  the  short 
range  nature  of  the  nucleon-nucleon  interaction  makes  the  mean  field  description  of  the 
nucleus  a  very  good  approximation.  In  its  simplest  form,  independent  particles  moving  in 
a  potential  well  is  known  [17]  to  explain  a  number  of  nuclear  bulk  properties  quite 
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successfully.  Thus  in  the  mean  field  regime,  the  nuclear  dynamics  will  be  regular  or 
chaotic  depending  on  the  nature  of  the  mean  field.  While  the  single  particle  motion  would 
be  completely  regular  in  a  spherical  potential,  it  would  be  more  and  more  chaotic  as  the 
potential  is  progressively  deformed  [18],  reaching  the  full  chaotic  limit  for  a  strongly 
deformed  nuclear  shape.  In  the  present  article,  we  shall  concern  ourselves  with  the  study 
of  chaos  in  the  nuclear  single  particle  dynamics  and  its  role  in  damping  of  nuclear 
collective  motions. 

Dissipation  occurs  in  nuclear  dynamics  when  the  kinetic  energy  associated  with  the 
collective  coordinates  are  irreversibly  transferred  to  the  intrinsic  (non-collective)  degrees 
of  freedom  of  a  nucleus.  Such  situations  arise  in  the  decay  of  the  giant  resonances,  fission 
and  in  collisions  between  two  heavy  nuclei.  Since  collisions  provide  the  opportunity  to 
study  the  dissipation  or  damping  phenomena  under  controlled  conditions,  the  early 
motivation  for  developing  theoretical  models  for  dissipation  in  nuclei  came  from  damped 
low  energy  heavy  ion  collisions  [19].  Gross  [20]  pioneered  the  concept  of  one-body 
mechanism  of  dissipation  in  which  the  nucleons  bouncing  off  the  moving  boundary  of  the 
time  dependent  average  one-body  potential  soak  energy  from  the  collective  motion 
resulting  in  a  damping  force  in  the  collective  degrees  of  freedom.  Blocki  and  his 
collaborators  [21]  subsequently  derived  a  simple  expression  namely  the  "wall  formula" 
for  one  body  dissipation.  The  wall  formula  was  also  obtained  from  a  formal  theory  of  one 
body  nuclear  dissipation  based  on  classical  linear  response  techniques  [22].  The  wall 
formula,  in  conjunction  with  the  "window  formula"  [21]  (to  account  for  the  role  of 
nucleon  exchange  through  the  neck  in  a  dinuclear  system),  was  found  to  be  quite 
successful  in  reproducing  a  large  volume  of  experimental  data  of  damped  heavy  ion 
collisions  [23],  fusion  [24]  and  fission  [21,25]. 

One  of  the  important  assumptions  of  the  wall  formula  concerns  the  randomization  of 
the  particle  motion.  It  is  assumed  [21]  that  the  successive  collisions  of  a  nucleon  with  the 
one  body  potential  give  rise  to  a  velocity  distribution  which  is  completely  random.  In 
other  words,  a  complete  mixing  in  the  classical  phase  space  of  the  particle  motion  is 
required.  This  is  normally  satisfied  for  one  body  potentials  whose  shapes  are  rather 
irregular.  It  was  early  realised  [21,22]  that  any  deviation  from  this  randomization 
assumption  would  give  rise  to  a  reduced  strength  of  the  wall  formula.  This  happens 
because  the  energy  transferred  to  a  particle  from  the  time-dependent  wall  could  be  partly 
reversible  if  the  motion  of  the  particle  is  not  completely  random.  In  the  present  paper,  we 
shall  re-examine  the  classical  wall  formula  in  order  to  distinguish  between  the  reversible 
and  irreversible  energy  transfers.  We  shall  identify  the  irreversible  energy  transfer  as  the 
true  one-body  dissipation.  We  shall  further  show  that  the  original  wall  formula  can  be 
modified  to  describe  the  irreversible  energy  transfer.  This  modification  would  be  obtained 
in  the  form  of  a  scaling  factor  which  would  account  for  the  degree  of  irregularity  of  the 
system.  The  modified  wall  formula  thus  obtained  would  be  applicable  to  systems  in 
which  the  particle  motions  are  not  fully  randomized.  In  order  to  achieve  the  above,  we 
shall  exploit  the  characteristic  features  of  the  phase  space  of  particle  motion. 

Recent  progresses  [26]  in  understanding  the  nature  of  dynamical  systems  have 
demonstrated  that  even  simple  systems  possess  a  rich  phase  space  structure  and  classical 
dynamics  can  be  idealized  for  the  two  extreme  cases  of  either  fully  regular  (integrable)  or 
fully  chaotic  (nonintegrable)  motions.  However,  most  of  the  systems  of  practical  interest 
fall  in  between  the  above  limiting  cases  and  one  has  to  often  deal  with  "mixed"  systems 


which  display  both  regular  and  chaotic  features.  Arvieu  et  al  [3]  considered  the  classical 
motion  of  a  particle  moving  in  a  deformed  potential  and  pointed  out  the  importance  of 
the  topology  of  phase  space  to  characterise  such  motions.  Heiss  et  al  [27]  studied 
single  particle  motion  in  axially  defonned  nuclei  both  classically  and  quantum 
mechanically.  Considering  shapes  with  quadrupole  and  higher  multipole  deformations, 
they  succeeded  in  explaining  the  distinguishing  features  of  shell  structure  in  prolate  and 
oblate  shapes  in  terms  of  the  underlying  classical  dynamics.  In  their  detailed  numerical 
study  of  classical  particles  in  vibrating  cavities  of  various  shapes,  Blocki  et  al  [9]  showed 
a  strong  correlation  between  chaos  in  classical  phase  space  and  the  efficiency  of  energy 
transfer  from  the  collective  to  the  intrinsic  motion.  In  particular,  it  was  demonstrated  in 
Ref.  [9]  that  while  the  energy  trasfer  is  much  smaller  than  the  wall  formula  limit  in  a 
cavity  undergoing  quadrupole  vibration,  it  reaches  the  wall  formula  limit  for  higher 
multipole  vibrations.  Similar  conclusions  were  also  reached  when  the  single  particle 
motion  was  treated  quantum  mechanically  [10].  As  mentioned  earlier,  these  essentially 
confirm  the  contention  that  the  wall  formula  is  valid  when  the  intrinsic  dynamics  is  fully 
chaotic. 

In  the  present  paper,  we  shall  review  some  of  our  works  [8, 18,28,29]  aimed  at  a 
modification  of  the  wall  formula  in  order  to  make  it  amenable  to  mixed  systems.  It  would 
be  argued  that  while  the  energy  transfer  to  the  particle  motion  described  by  the  regular 
part  of  the  classical  phase  space  is  reversible,  energy  flows  irreversibly  to  the  particle 
motion  which  belongs  to  the  chaotic  phase  space.  We  shall  subsequently  make  use  of  a 
measure  for  the  degree  of  chaos  or  nonintegrability  for  mixed  systems  which  is  defined  as 
the  relative  volume  of  the  phase  space  that  belongs  to  the  chaotic  trajectories.  We  shall 
incorporate  this  measure  of  nonintegrability  into  the  linear  response  formulation  of  the 
one  body  dissipation  [22].  We  would  obtain  a  "scaled  wall  formula"  which  would  be 
applicable  for  any  system  between  a  fully  regular  (no  dissipation)  and  a  fully  chaotic  one 
(original  wall  formula  dissipation).  Finally,  we  would  present  some  numerical  results  to 
illustrate  the  effect  of  deformation  on  one  body  dissipation.  We  would  further  point  out 
that  a  deformation  dependent  friction  is  found  to  be  essential  in  fission  dynamics 
calculations  [30, 31]  and  the  nature  of  this  deformation  dependence  is  very  similar  to  our 
present  theoretical  prediction. 

In  the  following  sections,  we  shall  first  briefly  review  the  classical  linear  response 
theory  for  one  body  dissipation,  developed  earlier  by  Koonin  and  Randrup  [22],  and  then 
describe  its  modification  [29]  to  obtain  the  scaled  wall  formula  in  §3.  In  §4,  the  scaling 
parameter  or  the  measure  of  chaos  will  be  numerically  obtained  for  different  defonned 
shapes.  Finally,  §  5  will  contain  a  summary  and  discussions  of  our  results. 


2.  One  body  dissipation  revisited 

We  shall  first  briefly  review  the  classical  linear  response  theory  of  one  body  dissipation 
developed  earlier  by  Koonin  and  Randrup  [22].  The  independent  particle  model  of  the 
nucleus  will  be  considered  here  in  which  the  motion  of  the  individual  nucleons  in  a  time- 
dependent  nuclear  single  particle  potential  is  governed  by  the  one  body  Hamiltonian 
H(r,  p;  t).  Here  r  and  p  denote  the  nucleon  position  and  momentum  respectively  whereas 
t  denotes  the  time.  The  Hamiltonian  H  may  be  split  into  a  time  independent  part  HQ  and  a 
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'/J 
with#i(r,p;r  =  0)  =  0. 

Under  the  linear  response  approximations  which  require  the  validity  of  a  perturbative 
treatment  and  the  assumption  that  the  relaxation  time  of  the  intrinsic  motion  is  short  in 
comparison  with  the  time  scale  for  collective  motion,  the  rate  of  energy  dissipation  from 
the  collective  motion  can  be  expressed  as, 


Q  =  -      dr 

J     (2TT)-    WO  / 

x  "W'P)  (2.2) 

This  equation  corresponds  to  a  physical  picture  in  which  a  particle  originating  from  a 
point  (r,  p)  in  phase  space  contributes  a  dissipation  rate  equal  to  the  product  of  the  initial 
impulse  received  H\  (t)  and  the  sum  of  all  impulses  received  subsequently  along  its  entire 
(unperturbed)  trajectory  (Ro,P0).  In  the  above  expression, /o  is  the  single  particle  phase 
space  distribution  function  governed  by  the  unperturbed  Hamiltonian  HO  and  the  factor 
dfo/dHo  ensures  that  for  a  Fermi-Dirac  distribution  only  the  particles  near  the  Fermi 
surface  contributes. 

The  assumption  of  a  adiabatic  collective  motion  required  to  arrive  at  Eq.  (2.2) 
was  critically  examined  in  ref.  [32]  where  the  wall  formula  was  obtained  as  the  low- 
frequency  limit  of  the  one-body  RPA  damping.  It  was  observed  that  the  relaxation 
of  the  ideal  adiabatic  requirement  to  realistic  collective  speeds  reduces  the  damping 
by  20%-30%  Therefore  the  above  eq.  (2.2)  shall  give  the  leading  contribution  to 
one-body  damping  even  when  the  collective  and  intrinsic  time  scales  become 
comparable. 

Considering  a  leptodermous  system  in  which  the  nuclear  potential  is  uniform 
throughout  the  volume  but  rises  steeply  at  the  surface,  the  time  integral  in  eq.  (2.2)  can  be 
written  as  a  sum  of  the  impulses  received  by  a  particle  during  its  successive  encounters 
with  the  nuclear  surface  along  its  unperturbed  trajectory.  Separating  the  contribution  of 
the  first  impulse  given  to  a  particle  near  its  point  of  origin  at  f7  =  0  (local  part)  from  those 
arising  out  of  the  successive  reflections  from  other  regions  of  the  nuclear  surface  (non- 
local part),  Koonin  and  Randrup  obtained  the  energy  damping  rate  as, 

Q  =  <2local  +  Snon-local  (2.3) 

where 

—     r  f    r  \ 

Qiocai  =  ~  I  u2(a]  (  I  d2^  Jd20  (2.4) 


and 


=  -~  I  u(a)  I  I  d2ttfl  cos2  Ba  ^  "0)  cos  9b  ]  d*a  (2.5) 
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Figure  1.     A  typical  trajectory  in  a  deformed  cavity. 


In  the  above  equations,  we  have  followed  the  notations  of  ref.  [22]  where  k  is  the 
nomentum  of  a  particle  as  it  leaves  the  surface  point  'a'  at  an  angle  fia,  u(a]  is  the 
lormal  component  of  the  surface  velocity  at  'a'  and  9b  is  the  angle  between  the  trajectory 
md  the  normal  at  a  point  '&'  on  the  surface  (figure  1).  The  summation  is  over  all  the 
lubsequent  reflecting  points  1V  along  the  trajectory  after  the  first  encounter  at  'a',  p  and  v 
ire  respectively  the  nuclear  mass  density  and  the  average  nucleon  speed  inside  the 
mcleus. 

For  regular  systems  such  as  a  spherical  cavity,  it  was  shown  in  ref.  [22]  that  the  non- 
ocal  contribution  exactly  cancels  the  local  part.  This  essentially  corresponds  to  an  energy 
ransfer  which  is  completely  reversible.  Thus  the  energy  gained  by  a  particle  from  the 
vail  is  eventually  fed  back  to  the  wall  when  the  particle  motion  is  regular.  The  regularity 
n  particle  motion  ensures  complete  reversibility  under  adiabatic  conditions.  An 
ntegrable  system  thus  becomes  completely  non-dissipative  in  this  picture.  On  the  other 
land,  the  non-local  part  becomes  zero  for  a  system  which  is  sufficiently  irregular.  This 
;an  also  be  seen  from  the  eq.  (2.5)  where  the  surface  velocity  components  at  successive 
•eflection  points  are  completely  random  for  a  cavity  of  highly  irregular  shape  and 
herefore  cancel  out  in  the  summation.  This  in  turn  corresponds  to  a  completely 
rreversible  energy  transfer.  The  dissipation  rate  is  thus  solely  governed  by  the  local  term 
x>r  an  irregular  system  which  now  reduces  to  the  wall  formula, 

fiwaii  =pv      u2(a)d2a  (2.6) 

It  is  to  be  noted  further  that  the  energy  damping  is  also  given  by  the  wall  formula 
irrespective  of  the  system  being  regular  or  chaotic  when  the  time  available  for  damping 
is  short  compared  to  the  interval  between  successive  collisions  of  a  particle  with  the 
ivall.  Such  situations  may  arise  when  other  competing  processes  reduce  the  interaction 
ime  during  which  the  one-body  damping  is  important.  If  this  interaction  time  is  too  short 
for  any  possible  transfer  of  particle  energy  to  the  wall  after  the  first  collision,  the  net 
;nergy  transfer  rate  would  be  given  by  the  local  term  or  equivalently,  the  original  wall 
formula. 


3.  One  foody  dissipation  in  mixed  systems 

We  have  considered  in  the  above  the  one-body  dissipation  rates  in  completely  regular  or 
fully  chaotic  systems.  However,  most  of  the  physical  systems  of  interest  do  not  belong  to 
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either  of  the  two  limits  of  full  regularity  or  complete  chaos.  The  dynamics  of  such 
systems  display  both  the  characteristic  features  of  regularity  (tori)  and  chaos  (fog)  in  the 
classical  phase  space.  To  obtain  the  dissipation  rate  in  such  mixed  systems,  we  shall 
proceed  as  follows. 

We  shall  first  distinguish  between  the  regular  and  chaotic  trajectories  which  contribute 
to  the  Eqs.  (2.4)  and  (2.5)  above.  Originating  from  a  given  point  on  the  cavity  wall  and 
moving  in  a  given  direction,  a  regular  trajectory  closes  smoothly  in  phase  space.  On  the 
other  hand,  another  trajectory  leaving  the  same  point  but  in  a  different  direction  could  be 
a  chaotic  one  which  does  not  close  in  the  phase  space.  Let  £lia  be  the  solid  angle  which 
contains  all  the  chaotic  trajectories  originating  from  the  surface  point  'a'  while  Slia  is  the 
corresponding  solid  angle  for  the  regular  trajectories  (£l\a  +  tlza  =  4?r).  It  may  be  noted 
here  that  each  of  these  solid  angles  could  be  composed  of  disjoint  components.  The 
Eq.  (2.3)  then  can  be  written  as, 


The  last  two  terms  on  the  right  hand  side  of  this  equation  represent  the  local  and  non- 
local contributions  to  the  dissipation  due  to  the  trajectories  which  are  regular.  We  shall 
assume  at  this  point  that  the  net  dissipation  due  to  the  regular  trajectories  will  be  zero, 
similar  to  the  spherical  system  whose  all  the  trajectories  are  regular  [22].  Though  the  non- 
dissipative  nature  of  a  regular  system  has  been  demonstrated  only  for  a  spherical  system 
and  a  general  proof  for  any  regular  system  is  not  available  to  the  best  of  our  knowledge, 
the  latter  can  be  well  anticipated  on  physical  grounds  and  has  been  numerically 
confirmed  [9]  for  several  integrable  systems.  The  zero  dissipation  should  therefore  be  a 
generic  property  of  the  phase  space  of  integrable  systems  and  we  expect  it  to  hold  for  that 
part  of  the  phase  space  of  a  mixed  system  which  is  similar  to  that  of  an  integrable  system. 

We  shall  next  consider  the  second  term  on  the  right  side  of  the  Eq.  (3.1)  which 
represents  the  non-local  contribution  of  the  chaotic  trajectories.  We  immediately  note  that 
this  term  should  also  vanish  due  to  the  random  nature  of  the  surface  velocity  components 
at  successive  reflecting  points  as  has  been  mentioned  in  the  earlier  section.  We  shall 
therefore  be  left  with  only  the  local  term  arising  from  the  chaotic  trajectories  and  obtain, 


Q  =  pv      Ii(d)u2(a)£a  (3.2) 

where 


and  can  be  defined  as  the  fraction  of  the  trajectories  leaving  a  surface  point  'a'  which  are 
chaotic.  Since  we  do  not  expect  any  correlation  between  /i(a)  and  u2(a),  the  above 
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equation  can  be  further  reduced  to, 

Q  =  /Sfiwaii  (3.3) 

where 


and  S  is  the  total  surface  area.  Thus  p,  is  the  average  fraction  of  the  trajectories  which  are 
chaotic  when  the  sampling  is  done  uniformly  over  the  surface. 

We  have  thus  arrived  at  a  scaled  version  of  the  wall  formula  where  the  dissipation  rate 
depends  on  the  degree  of  chaos  in  the  single  particle  motion.  For  a  completely  irregular 
system,  ft  is  1  and  we  recover  the  full  wall  formula  whereas  for  a  system  with  integrable 
single  particle  dynamics,  there  would  be  no  chaotic  trajectories  (p,  =  0)  and  hence  no 
dissipation.  For  mixed  systems,  it  will  be  necesary  to  calculate  p,  separately  to  obtain  the 
dissipation. 

It  is  of  interest  to  note  here  that  a  quantity  (chaoticity  /chaos  parameter)  defined  in  a 
similar  fashion  to  p  is  often  used  in  the  literature  [33,  34]  to  express  the  degree  of 
irregularity  in  the  dynamics  of  a  system.  This  parameter,  to  be  denoted  henceforth  as  /u,  is 
defined  as  the  fraction  of  the  phase  space  occupied  by  the  chaotic  trajectories.  For  a  three- 
dimensional  billiard,  this  essentially  reduces  to  a  uniform  volume  sampling  to  find  the 
fraction  of  the  trajectories  which  are  chaotic.  In  what  follows,  we  shall  approximate  /Z  by 
\JL  to  simplify  our  calculation.  In  order  to  justify  this  approximation,  we  first  note  that  both 
the  volume  and  surface  samplings  give  the  same  limiting  values  of  0  and  1  respectively 
for  fully  regular  or  completely  chaotic  systems.  Also,  both  the  volume  and  surface 
averages  would  be  equal  for  systems  in  which  the  chaotic  trajectories  are  uniformly 
distributed.  Moreover,  as  a  regular  cavity  is  deformed  progressivly  towards  more  irregular 
shapes,  both  /j,  and  /I  also  change  monotonically  from  0  to  1.  We  therefore  do  not 
anticipate  any  drastic  difference  between  these  two  measures  for  mixed  systems. 
However,  it  would  still  be  of  interest  to  make  a  precise  comparison  between  JJL  and  p, 
which  we  plan  to  do  in  future. 

4.  Chaos  parameter  from  energy  spectrum 

We  have  so  far  considered  the  chaos  parameter  IJL  (or  £L)  as  a  classically  defined  quantity 
which  can  be  calculated  by  sampling  trajectories  in  the  classical  phase  space.  However 
for  the  present  purpose,  we  shall  obtain  the  chaos  parameter  "from  a  measure  of  the 
fluctuations  of  the  single  particle  energy  spectrum  of  the  corresponding  quantum  system. 
This  will  be  essentially  done  for  two  reasons.  Firstly,  the  measure  of  chaos  from  quantum 
spectra  is  known  {35]  to  be  equivalent  to  the  measure  from  classical  phase  space  and 
secondly,  the  ready  accessability  of  the  quantum  calculation  to  us.  Thus  the  quantum 
calculation  would  give  us  a  measure  of  chaos  in  the  corresponding  classical  system.  It 
may  be  noted  that  though  a  quantal  formulation  of  dissipation  has  been  obtained  [36] 
earlier,  the  generic  features  of  chaos  in  the  quantum  spectra  has  not  been  incorporated  in 
such  a  formulation  as  yet. 

For  a  regular  or  classically  integrable  system,  it  has  been  proved  [37]  that  in  the 
semiclassical  limit,  successive  energy  levels  of  the  corresponding  quantum  hamiltonian 
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arrive  randomly.  This  results  in  a  Poisson  distribution  P(s)  for  the  spacings  (j)  between 
the  nearest  neighbours  in  the  energy  spectra.  For  generic  irregular  systems  with  time- 
reversal  symmetry,  it  has  been  conjectured  [14]  that  spectral  fluctuations  are  universally 
reproduced  by  the  Gaussian  Orthonormal  Ensemble  (GOE).  For  mixed  systems,  it  has 
been  argued  [35]  that  in  the  semiclassical  limit  a  spectrum  should  consist  of  regular  and 
irregular  parts  that  are  associated  with  the  classical  regular  and  irregular  regions  in  phase 
space.  This  was  subsequently  confirmed  by  Bohigas  et  al  [38]  in  their  detailed  work  on 
manifestation  of  classical  phase  structure  in  quantum  systems.  Assuming  that  irregular 
and  regular  regions  yield  energy  level  sequences  with  GOE  and  Poisson  spacing 
distributions,  respectively,  and  that  the  whole  spectrum  is  generated  by  a  statistically 
independent  superposition  of  those  sequences  with  respective  weight  factors  n  and 
(1  -  j^),  Berry  and  Robnik  [39]  obtained  a  semiclassical  formula  for  P(s)  that  smoothly 
interpolated  between  the  Poisson  (^  =  0)  and  GOE  (IJL  =  1)  distributions.  One  can  thus 
extract  a  value  of  fj,  by  a  least-square  fit  of  this  parametrised  P(s)  to  the  level-spacing 
distribution  of  a  given  spectrum.  However,  it  has  been  subsequently  pointed  out  [40]  that 
the  Berry-Robnik  distribution  converges  extremely  slowly  to  the  semiclassical  limit.  It 
was  further  shown  [40]  that  the  phenomenological  Brody  distribution  [41] 

PBrody(/M)=^e-^+1  (  (4.1) 

with  a  =  (^+l}b  and  b  =  [T(l  +  j^-)]1+Ai  is  more  appropriate  in  the  near  semiclassical 
regime. 

Before  we  close  this  section,  it  is  worthwhile  to  point  out  that  several  authors  have 
discussed  [42,43]  other  approaches  to  obtain  smooth  interpolation  between  Poisson  and 
GOE  distributions.  French  et  al  [42]  considered  increasing  symmetry  breaking  in  a  given 
Hamiltonian  as  the  driving  mechanism  to  push  the  spacing  distribution  from  Poisson  to 
GOE.  On  the  other  hand,  Lenz  et  al  [43]  proposed  a  random  matrix  ensemble  displaying 
transition  from  Poisson  to  GOE  spectra.  However,  an  interpolation  from  Poisson  to  GOE 
based  on  classical  phase  space  arguments  is  more  appropriate  for  the  present  purpose.  We 
shall  therefore  use  Brody  parametrisation  in  the  present  work. 


4.1  Chaos  parameter  for  a  two  centre  shell  model 

We  have  earlier  studied  the  spectral  statistics  of  single  particle  motion  in  two  centre  shell 
model  potentials  [8,28].  The  total  Hamiltonian  of  a  single  particle  in  the  combined  field 
of  two  axially  symmetric  harmonic  oscillators  centred  at  zi  <  0  and  zi  >  0  (the  position 
of  the  barrier  is  at  the  origin)  joined  smoothly  by  a  neck  (figure  2)  and  with  a  spin-orbit 
potential  is  given  as: 


V+VSO  (4.2) 

where  T  is  the  kinetic  energy,  V  denotes  the  sum  of  the  oscillator  and  neck  potentials  and 
Vso  is  the  spin-orbit  interaction.  The  details  of  the  potentials  are  given  in  ref.  [28].  The 
above  potential  is  essentially  characterised  by  the  neck  parameter  e  =  U2/Ui  and  the 
separation  between  the  two  centres  R  =  ZI  —  ZI. 

A  large  number  of  levels  (w  150)  were  calculated  in  Ref.  [28]  and  the  best-fit  Brody 
parameter  was  extracted  over  a  range  of  values  of  the  separation  R.  The  system 
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Figure  2.  The  two  centre  oscillator  potential  along  the  z-axis.  The  barrier  heights 
without  (dotted  line)  and  with  (full  line)  the  neck  potentials  are  U\  and  U2, 
respectively. 
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Figure  3.  The  variation  of  the  chaos  parameter  /j  With  separation  R  for  two  centre 
shell  potential.  Dots  (•)  denote  the  calculated  values  and  the  lines  are  to  guide  the  eye. 
Lower  part  shows  the  evolution  of  the  profile  of  the  equipotential  surface.  The  neck 
parameter  is  held  fixed  at  €  =  0.8  for  the  cases  above. 

considered  was  of  mass  number  A  =  260  with  an  asymptotic  mass  asymmetry  of  1.6. 
Figure  3  shows  the  variation  of  the  chaos  parameter  with  R.  It  is  observed  in  this  figure 
that  the  system  is  almost  regular  for  near  spherical  shapes  (R  «  0)  as  expected.  With 
increasing  R,  the  single  particle  motion  becomes  increasingly  chaotic  as  the  system 
evolves  towards  a  dinuclear  complex.  At  still  larger  separations,  the  neck  eventually 
snaps  and  two  independent  spherical  nuclei  emerge  each  supporting  a  regular  single 
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Figure  4.  The  variation  of  the  chaos  parameter  /z  with  the  neck  parameter  e  for  two 
centre  shell  potential.  Dots  (•)  denote  the  calculated  values  and  the  lines  are  to  guide 
the  eye.  Lower  part  shows  the  evolution  of  the  profile  of  the  equipotential  surface.  The 
separation  is  held  fixed  at  R  =  10  fm  for  the  cases  above. 

particle  motion.  In  fission  or  collisions  between  two  heavy  nuclei,  the  nuclear  shape 
evolves  in  a  manner  similar  to  that  depicted  in  figure  3.  Therefore,  the  one-body 
dissipation  that  comes  into  play  in  the  dynamics  of  such  processes  has  to  be  scaled  by  the 
chaos  parameter  for  the  appropriate  shapes.  In  particular,  we  note  that  the  system  is  fully 
chaotic  at  R  «  10  fm  which  corresponds  to  the  touching  configuration  of  two  nuclei  of 
masses  100  and  160.  Since  in  damped  nuclear  collisions,  most  of  the  damping  occurs 
around  this  configuration,  a  scaling  factor  close  to  1  is  appropriate  for  the  one-body 
dissipation  in  collisions.  This,  in  turn,  justifies  the  use  of  the  original  wall  formula  of 
Blocki  et  al  in  collision  dynamics  which,  in  fact,  has  been  quite  successful  in  reproducing 
[24]  a  large  volume  of  experimental  data. 

On  the  other  hand,  energy  dissipation  takes  place  in  fission  when  the  nucleus  evolves 
from  a  compact  shape  to  the  scission  configuration.  It  has  been  observed  from 
phenomenological  studies  [30,31]  of  fission  dynamics  that  a  coordinate  dependence  of 
friction  is  essential  for  simultaneous  reproduction  of  both  the  fission  probability  and  the 
pre-scission  neutron  multiplicity.  A  reduced  friction  at  compact  shapes  followed  by  a 
gradual  increase  upto  the  scission  point  was  used  successfully  in  the  above  studies.  This 
shape  dependence  is  exactly  similar  to  our  findings  as  discussed  earlier.  Therefore,  the 
shape  dependence  of  nuclear  friction  arising  out  of  the  chaoticity  of  the  single  particle 
states  offers  a  possible  explanation  of  the  phenomenological  [30,  31]  shape  dependence. 

After  studying  the  separation  R  dependence  of  the  single  particle  chaos,  it  is  of  interest 
to  study  how  it  depends  on  other  shape  parameters.  In  particular,  we  would  be  interested 
to  find  the  dependence  of  the  chaos  parameter  on  the  neck  parameter  e  since  the  negative 
curvature  in  the  potential  profile  in  the  neck  region  provides  a  defocussing  effect  on  the 
single  particle  motion.  With  this  view,  the  chaos  parameter  is  obtained  for  a  number  of 
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Figure  5.  The  variation  of  the  chaos  parameter  ju,  with  deformation  0:2  for 
quadrupole  cavities.  Dots  (•)  denote  the  calculated  values  and  the  lines  are  to  guide 
the  eye.  Lower  part  shows  the  evolution  of  the  cavity  shape  with  increasing 
deformation. 


values  of  the  neck  parameter  (for  a  given  separation)  and  the  results  are  given  in  figure  4. 
It  shows  that  the  single  particle  motion  is  fully  chaotic  over  a  broad  range  of  the  neck 
potential.  This  implies  that  the  choice  of  the  neck  parameter  is  not  very  critical  so  far  as 
its  role  in  inducing  chaos  is  concerned. 


4.2  Chaos  parameter  for  a  deformed  cavity 

We  shall  now  consider  single  particle  motion  in  an  axially  symmetric  cavity  with 
quadrupole  deformation.  The  main  motivation  for  considering  this  system  is  twofold. 
Firstly,  we  would  like  to  study  systems  for  which  larger  number  of  levels  can  be 
numerically  obtained  and  secondly,  the  relevence  of  quadrupole  deformed  shapes  for  the 
study  of  giant  resonances  in  nuclei  [29].  Specifically,  we  shall  consider  a  cavity  whose 
surface  is  deformed  according  to  the  Legendre  polynomial  of  second  order, 


(4.3) 


where  RQ  is  the  radius  of  a  sphere  having  the  same  volume  as  the  deformed  cavity,  A  is  a 
volume  preserving  factor  and  0.2  is  the  strength  of  the  quadrupole  deformation.  The  single 
particle  eigenenergies  were  obtained  in  ref.  [18]  by  numerically  solving  the  Schrodinger 
equation.  The  nearest  neighbour  level-spacing  distributions  were  obtained  from  these 
eigenenergy  sequences.  The  spacing  distributions  were  subsequently  fitted  by  the  Brody 
distribution  to  yield  the  chaos  parameter  ju  for  cavities  of  various  deformations. 

We  shall  presently  extend  the  above  study  to  a  wider  range  of  deformation.  A  better 
description  of  the  variation  of  the  chaos  parameter  with  deformation  will  be  obtained  here 
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compared  to  the  two-centre  potential  system.  This  is  essentially  because  the  chaos 

parameter  is  obtained  here  at  smaller  intervals  of  the  deformation  and  larger  number  of 

levels  (w  1000)  are  employed  compared  to  the  two-centre  potential  calculation.  The  . 

variation  of  the  chaos  parameter  with  deformation  is  shown  in  figure  5.  The  nature  of  the  l"  * 

variation  is  similar  to  the  two-centre  potential  system,  namely  a  regular  to  chaos 

transition  is  observed  as  the  system  is  progressively  deformed.  It  is  of  interest  to  note  here 

that  in  typical  giant  quadrupole  oscillations  of  nuclei,  the  amplitude  of  the  deformation 

lies  within  0.1  and  one  finds  in  figure  5  that  the  chaos  parameter  is  small  (/u  <  0.3)  in  this 

range.  The  one-body  mechanism  for  the  damping  of  giant  resonances  is,  therefore,  a  very 

weak  mechanism  [29]. 

5.  Summary  and  discussion 

In  the  preceding  sections,  we  have  considered  the  wall  formula  for  one  body  dissipation 
by  relaxing  the  'full  randomization'  assumption  in  order  to  make  it  applicable  to  systems 
in  which  the  mixing  in  phase  space  is  only  partial.  In  particular,  we  investigated  the  effect 
of  irregularity  in  the  shape  of  the  one  body  potential  on  the  dissipation  mechanism. 
Closely  following  the  theoretical  framework  of  one  body  dissipation  developed  earlier  by 
Koonin  and  Randrup  [22],  we  arrived  at  a  dissipation  rate  which  is  a  scaled  version  of  the 
wall  formula.  We  could  show  that  the  scaling  factor  is  determined  by  a  measure  of  chaos 
in  the  single  particle  motion. 

Subsequently  we  calculated  this  measure  for  a  two-centre  potential  system  and  a 
quadrupole  deformed  cavity.  In  both  the  cases,  an  order  to  chaos  transition  in  the  single 
particle  motion  was  observed  with  increasing  deformation  of  the  system.  This  also 
corresponds  to  a  gradual  increase  of  the  one-body  dissipation  as  the  system  is 
progressivly  deformed.  Since  the  one-body  dissipation  is  expected  to  be  the  dominant 
mechanism  in  slow  collective  processes  such  as  low  energy  nucleus-nucleus  collisions 
and  fission,  it  is  of  interest  to  note  that  a  shape  dependent  friction  is  also  found  to  be 
essential  for  simultaneous  reproduction  of  both  the  fission  probabilty  and  the  pre-scission 
neutron  multiplicity  in  fission  dynamics  calculations  [30,  31].  It  was  found  in  these  works 
that  a  phenomenological  friction  which  is  very  weak  for  compact  configurations  and 
increases  after  the  neck  is  formed  in  a  fissioning  nucleus  can  describe  a  large  number  of 
experimental  data.  Interestingly,  this  corresponds  exactly  to  our  own  conclusions 
regarding  the  shape  dependence  of  nuclear  friction.  It  is  thus  quite  likely  that  the  original 
wall  formula  determined  from  a  series  of  simplifying  assumptions  may  require  some 
readjustments  to  account  for  the  relaxtion  of  these  assumptions  towards  more  realistic 
situations.  The  present  article  describes  some  of  the  efforts  in  this  direction  and  it  is 
expected  that  the  effects  of  shape-dependence  would  serve  as  an  essential  input  in 
evolving  a  better  model  of  one-body  dissipation. 
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Chaos  and  localization  in  coupled  quartic  oscillators 
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Abstract.  We  discuss  some  of  the  models  for  eigenfunction  localization  in  Hamiltonian  systems. 
In  particular,  we  review  some  of  our  work  on  classical  parametric  scaling  of  orbits  and 
identification  of  localized  states  in  a  two-dimensional  quartic  oscillator  system  which  is  deep  in  the 
classically  chaotic  region.  We  show  that  visual  methods  are  a  necessary  complement  to  quantitative 
methods  based  on  information  entropies.  Our  preliminary  results  indicate  that  the  periodic  orbit 
stability  determines  the  degree  of  localization  in  a  class  of  states,  even  when  the  stable  regions  are 
of  negligible  measure. 

Keywords.     Quantum  chaos;  wave  functions;  localizations. 
PACS  Nos    05.45;  03.65 

1.  Introduction 

The  study  of  quantization  of  chaotic  systems  has  been  an  active  field  of  research  for  very 
many  years  now,  and  has  seen  substantial  developments  in  the  theoretical  and 
experimental  techniques.  The  celebrated  trace  formula  of  Gutzwiller  [1]  connects  the 
quantum  spectra  semiclassically  to  unstable  periodic  orbits  embedded  in  a  sea  of  chaos. 
Random  matrix  theories  (RMT)  have  been  found  to  apply  to  remarkably  low-dimensional 
but  chaotic  systems,  while  their  putative  range  was  many  body  physics.  Experiments  have 
been  done  in  many  areas  from  atomic,  molecular  systems  to  mesoscopic  systems  for 
which  the  concepts  of  'quantum  chaos'  are  essential.  As  is  often  pointed  out,  while  we 
have  a  reasonable  grip  over  the  eigenvalues  of  a  quantum  spectra,  via  the  Gutzwiller 
formula,  eigenfunctions  are  still  a  region  of  mystery.  There  have  been  many  computations 
of  eigenfunctions,  and  even  some  experimental  realizations  have  been  made,  but  their  real 
nature  remains  obscure. 

We  deal  mainly  with  eigenfunctions  and  seek  to  understand  the  most  remarkable 
of  them,  which  we  believe  will  also  have  the  largest  experimental  impact.  Towards 
this  end  we  study  smooth  Hamiltonian  systems  in  two  degrees  of  freedom,  and  deal 
with  homogeneous  potentials.  We  are  able  to  compute  eigenfunctions  which  are  in 
the  highly  excited  regions  of  the  spectrum  and  for  parameter  ranges  which  take  us  deep 
into  classical  chaos.  We  note  that  in  contrast  to  previous  work  the  eigenfunctions 
computed  really  probe  the  semiclassical  regime.  We  have  noticed  the  predominance 
of  states  which  show  density  enhancements  near  unstable  periodic  orbits  called 
'scars',  but  also  have  studied  the  random  matrix  properties  of  these  states,  and  we 
find  that  paradoxically  even  some  strongly  scarred  states  can  show  random  matrix 
like  distributions.  There  are  however  embedded  in  this  'chaotic'  or  irregular  spectrum  a 


localized. 

Percivai's  early  conjecture  [2]  that  there  are  two  types  of  states  in  a  typical  spectrum, 
the  regular  and  the  irregular,  has  been  amply  verified  by  later  computations.  However  we 
note  that  the  series  we  study  here  is  not  associated  with  a  classically  regular  region  of 
phase  space  although  they  have  all  the  hallmarks  of  regular  states.  We  associate  simple 
periodic  orbits  to  these  states  and  study  the  effect  of  classical  stability  oscillations  in  the 
regime  of  extreme  chaos.  We  focus  attention  in  this  paper  on  the  study  of  these 
eigenfunctions,  and  since  they  are  strongly  localized,  we  wish  to  place  them  in  the 
context  of  the  general  localization  lore. 

One  of  the  distinguishing  features  of  quantum  mechanics  is  the  non-spreading  of  the 
eigenfunctions  in  the  whole  of  the  classically  accessible  configuration  or  momentum 
space,  succinctly  called  localization.  Essentially,  this  is  the  tendency  of  the  quantum 
eigenfunctions  to  accumulate  only  on  certain  subregions  of  the  classically  allowed  region. 
In  recent  years,  many  studies  on  eigenfunctions,  particularly  of  classically  chaotic 
systems,  have  focussed  on  this  particular  aspect  of  quantum  mechanics  [1,5,11]. 
Quantum  localization  is  a  generic  phenomenon  in  many  areas  of  physics.  In  condensed 
matter  physics,  Anderson's  model  [3]  is  one  of  the  earliest  attempts  to  explain 
localization  for  the  motion  of  an  electron  in  a  random  alloy,  in  which  potential  well 
depths  at  each  lattice  site  are  random.  In  this  case  it  has  been  shown  that  the 
wavefunctions  in  configuration  space  are  exponentially  localized.  In  quantum  optics, 
Kuklinski  [4]  has  shown  that  Anderson  type  localization  also  occurs  in  a  photon  number 
space  if  sharp  pulses  of  coherent  light  interact  with  an  optical  cavity  with  non-linear 
susceptibility.  It  has  been  suggested  that  this  localization  can  be  used  to  generate  an 
electromagnetic  field  in  a  pure  Fock  state.  Atomic  and  molecular  systems  provide  many 
instances  of  localization  phenomena.  Using  Husimi  distributions  Muller  and  Wintgen  [6], 
have  studied  the  localization  in  the  case  of  the  hydrogen  atom  in  a  uniform  magnetic  field 
and  connected  most  of  the  eigenstates  to  the  classical  periodic  orbits  and  invariant 
manifolds.  Microwave  ionization  of  hydrogen  atom  from  its  highly  excited  states  has 
been  the  subject  of  active  investigation,  both  theoretically  and  experimentally. 
Theoretical  studies  on  an  one-dimensional  model  predicted  [7]  that  for  hydrogen  atoms 
in  microwave  fields,  as  a  result  of  localization,  there  would  be  an  increase  in  quantum 
ionization  threshold  over  and  above  the  classical  threshold,  beyond  a  critical  value  of  the 
microwave  field.  This  has  been  experimentally  verified  [8]  .  More  recently,  Wilkinson 
et  al  [9]  using  tunnel-current  spectroscopy  in  semiconductor  heterostructures  were  able  to 
experimentally  observe  the  effect  of  localized  states. 


2.  Localization  in  chaotic  quantum  systems 

Localization  has  been  used  to  denote  phenomena  in  a  wide  class  of  problems,  and  there  is 
probably  no  consensus  on  what  exactly  this  term  means.  Even  in  the  context  of 
quantization  of  chaotic  systems,  localization  could  mean  the  scarring  of  eigenfunctions 
due  to  simple  orbits,  or  something  even  more  stringent.  There  is  perturbative  localization 
[10],  dynamical  localization  [11],  Anderson  localization  [3],  scarring  localization  [33], 
and  a  bewildering  array  of  mechanisms  to  go  with  them.  We  will  focus  on  localization 
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which  is  close  to  scarring  localization,  however,  as  will  be  apparent  this  is  a  strong  form 
of  scarring  localization,  that  one  may  attribute  also  to  adiabatic  mechanisms,  and  is 
exponential  in  some  sense. 

Below  we  study  localization  in  model  quantum  systems  whose  classical  limit  is 
chaotic.  Study  of  localization  is  important  since  it  is  a  pure  quantum  phenomenon  whose 
mechanism  and  systematics  are  yet  to  be  understood  in  complete  detail.  Secondly,  from 
the  point  of  view  of  quantum  chaos,  studies  on  deviations  of  theoretical  predictions  for 
energy  level  statistics  in  the  presence  of  localized  states  have  been  done  [12]. 

Based  on  the  semiclassical  eigenfunction  hypothesis  and  Shnirelmann's  theorem  [13], 
Berry  [14]  and  Voros  [15]  argued  that  a  typical  eigenfunction  in  the  irregular  quantum 
regime  is  ergodic  over  the  entire  configuration  space.  At  first  sight,  it  would  appear  that 
since  the  underlying  classical  dynamics  of  the  chaotic  systems  is  ergodic,  any  wavepacket 
launched  on  any  of  the  infinite  periodic  orbits  will  spread  over  all  the  accessible  space 
and  hence  would  not  support  localized  eigenfunction  structures.  First  results  contrary  to 
the  above  hypothesis  came  from  the  early  work  of  McDonald  and  Kaufmann  [16],  from 
the  studies  on  eigenfunction  structures  of  stadium  billiards.  Heller  [17],  using 
semiclassical  wave  propagation  techniques,  argued  that  scars  can  occur  near  the  least 
unstable  periodic  orbits.  In  chaotic  systems,  all  periodic  orbits  are  unstable  and  they 
contribute  significantly  to  density  enhancements  in  the  vicinity  of  the  orbit  [18].  In  this 
study  it  was  also  shown  that  wavefunctions  averaged  over  small  energy  intervals  can  be 
related  to  a  sum  over  periodic  trajectories  in  the  semiclassical  limit. 

Other  model  systems,  like  nonlinearly  coupled  quartic  oscillators,  kicked  rotor  and  an 
atom  in  a  magnetic  field  were  studied  and  they  also  showed  sharp  localization.  The 
kicked  rotor  has  been  extensively  investigated  both  classically  and  quantum  mechanically 
[19,  33].  The  Hamiltonian  of  the  periodically  kicked  rotor  is  given  by 


where  V(d]  =  cos(0),  /  is  the  moment  of  inertia,  T  is  the  kicking  period  and  classically  fj, 
is  the  only  effective  parameter  in  the  model  and  it  represents  the  kick  strength. 
Classically,  for  large  values  of  kick  strength,  i.e  JJL  >  1.0  the  dynamics  is  chaotic.  The 
quantum  mechanical  problem  is  solved  by  constructing  the  Floquet  operator  and  solving 
the  resulting  eigenvalue  problem  in  the  momentum  representation.  After  some  mathe- 
matical transformations,  the  Schrodinger  equation  for  the  kicked  rotor  in  momentum 
representation  looks  similar  to  the  one  for  Anderson  model  for  the  motion  of  an  electron 
in  a  crystalline  lattice  of  random  potential  wells.  Thus,  by  mapping  the  kicked  rotor 
problem  to  the  Anderson's  problem,  Grempel  et  al  [20]  attributed  localization  in  the 
kicked  rotor  to  destructive  quantum  phase  interference  since  the  same  mechanism  is 
responsible  for  localization  in  Anderson's  model.  However,  for  all  their  avowed 
similarity,  Anderson's  model  and  the  kicked  rotor  are  not  the  same.  The  potential  in 
kicked  rotor  is  not  strictly  random,  it  is  at  best  pseudo  random  and  the  quasi-energy  of  the 
rotor  becomes  a  parameter  of  the  potential. 

Rydberg  atoms  in  strong  magnetic  fields  have  emerged  as  one  of  the  testing  grounds 
for  quantum  chaos.  The  interaction  for  hydrogenic  atom  in  strong  magnetic  field  contains 
a  quadratic  Zeeman  term  which  makes  the  problem  non-trivial.  We  consider  a  particular 
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magnetic  quantum  number  manifold,  say  m  =  0,  and  assume  the  magnetic  field  B  to  be 
directed  along  the  z  axis.  In  such  a  case,  the  Hamiltonian  written  in  cylindrical 
coordinates  is, 


where  0  motion  has  been  separated  and  uc  =  eB/Mc  is  the  cyclotron  frequency. 
Rewriting  it  in  semi-parabolic  coordinates  [21]  we  get, 

-  4e(«2  +  *2)  +  8^V  +  A  (3) 


where  the  scaled  energy  e  =  £7  2/3  is  the  parameter  and  7  =  B/BQ  with  BQ  =  2.35  x 

105  T.  It  is  to  be  noted  that  in  this  form  this  system  resembles  the  Hamiltonian  for  a 

particle  in  a  two-dimensional  non-linear  potential  with  e  as  the  parameter.  Many  authors  j 

[21]  have  attempted  this  problem  theoretically  as  well  as  experimentally.  Rau  [10]  has  ! 

suggested  that  localization  in  this  problem  can  arise  from  the  perturbative  breaking  of  ^;, 

degeneracy.  The  integrable  part  of  the  atomic  many  body  system,  at  high  excitation,  ^ 

displays  a  very  high  degree  of  degeneracy.  For  example,  in  the  integrable  quantum 

hydrogen  atom  problem,  states  with  the  same  principal  quantum  number  n,  but  with 

different  angular  momentum  quantum  numbers  /,  are  degenerate  and  form  a  manifold  or  a 

subspace  in  the  full  Hilbert  space.  Perturbations  to  these  integrable  systems  lifts  these 

degeneracies  and  is  accompanied  by  mixing  of  base  states  of  these  manifolds.  Since 

angular  momentum  states  mix  and  diffuse,  the  uncertainty  principle  restricts  wavefunc- 

tions  to  be  localized  spatially  because  the  angle  and  angular  momentum  are  conjugate 

variables.  Furthermore  it  has  been  shown  that  this  perturbative  localization  is  common  to 

the  asymmetric  rotor  and  to  a  wide  class  of  few  body  problems  in  atomic  and  nuclear 

physics. 

3.  The  coupled  quartic  oscillator 

3.1  Classical  aspects 

One  of  the  most  widely  studied  models  of  quantum  chaos  is  the  non-linearly  coupled 
quartic  oscillator  (henceforth  referred  to  as  quartic  oscillator)  [11,  22,  43].  We  study  the 
Hamiltonian  given  by 

H(px,Py,x,y,OL}  =  p2x  +p2  +  x4  +y4  +  co2)?2,  (4) 

where  a  is  a  parameter.  This  Hamiltonian  is  of  interest  in  SU(2)  Yang-Mills  field 
theory  in  the  limiting  case  when  a  =  oo.  Since  the  system  is  homogeneous  the 
classical  motion  scales  with  energy.  This  implies  that  from  the  knowledge  of 
classical  dynamics  at  one  energy,  dynamics  at  any  other  energy  can  be  obtained 
[23].  Though  the  classical  dynamics  of  this  system  is  reported  extensively  [43,  29], 
we  recollect  some  of  its  salient  features.  The  potential  is  bounded  for  all  values  of 
a  greater  than  -2,  except  at  a  =  oo.  In  this  limit  when  a  =  oo,  though  classically 
the  potential  is  open,  the  quantum  energy  spectrum  is  discrete.  The  system  is  integrable 
as  well  as  separable  for  a  =  0,2, 6.  The  Poincare  section  shows  predominance  of 
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irregular  trajectories  as  a  increases  beyond  6.  Though  the  regular  region  shrinks  with 
an  increase  in  a,  due  to  bounded  stability  oscillations  of  the  channel  periodic  orbit, 
there  are  certain  windows  of  a  values  for  which  an  infinitesimally  small  regular  region 
around  the  orbit  exists,  even  for  very  high  values  of  a.  The  duality  in  the  parameter 
space  enables  us  to  map  the  interval  [0,  2]  onto  [6,  2]  and  the  interval  [—2,  0]  onto 
[oo,  6]  and 

f  .  . 

' 


(a  +  2)1/6  (a'+2)1/6 

where 


Geometrically,  this  transformation  given  by 


corresponds  to  rotation  of  the  potential  by  Tr/4  and  a  scaling. 

We  have  recently  shown  that  the  Hamiltonian  in  (4)  also  locally  scales  in  the  vicinity  of 
the  channel  periodic  orbit  with  respect  to  the  parameter  a  [26].  The  stability  of  the  simple 
channel  periodic  orbit  (x,y  =  Q,px,py  =  0),  as  measured  by  TrJ(a),  where  J(a)  is  the 
monodromy  matrix,  displays  bounded  oscillations  as  a  function  of  a.  The  formula  due  to 
Yoshida  [24]  for  the  trace  of  the  monodromy  matrix,  applied  to  the  case  of  the  quartic 
oscillator  in  (4)  gives 

Tr  J(a)  =  2^2  cos  Q  vT+4a)  .  (8) 

The  orbits  are  stable  if  |  Tr  J(a}\  <  2  and  are  unstable  if  |  Tr  J(a)|  >  2.  By  inverting  this 
formula  using  these  stability  conditions  we  find  that  at  values  of  a  =  n(n  +  1),  where  n  is 
an  integer,  the  channel  orbit  changes  stability  through  a  pitchfork  bifurcation.  Figure  1 
shows  the  stability  curves  for  the  channel  orbit  and  the  45°  diagonal  orbit.  The  channel 
orbit  corresponds  to  positive  a  range  and  the  diagonal  orbit  to  negative  a  values,  as  per 
the  transformation  in  (6).  It  is  useful  to  note  that  the  Hamiltonian  bifurcation  theory 
applied  to  two-dimensional  conservative  systems,  allows  for  five  and  only  five  types  of 
bifurcations  [25]. 

We  have  numerically  demonstrated  that  if  the  stability  at  two  different  values  a  and  a' 
are  equal,  i.e  Tr  /(a)  =  Tr  J(a'),  then  the  Poincare  section  locally  scales  around  the 
vicinity  of  the  channel  orbit.  If  the  system  possesses  reflection  symmetries  about  a 
particular  axes,  say  the  x  axis,  then  the  Poincare  section  simplifies  to  the  condition  x  =  0 
with  either  sign  ofpx.  This  constitutes  a  half-Poincare  map  [25]  and  has  all  the  properties 
of  the  conventional  Poincare  map.  Let 

q'i  =f(<b«P2\  <*),     p'2  =  g(qi,pi\  a)  (9) 

be  the  half-Poincare  map,  with  a'  >  a.  Due  to  the  reflection  symmetries  of  the  oscillator, 
f(l2,p-i]  a)  =  —f(—qi,  —P2\  OL),  and  similarly  for  g.  Since  the  Poincare  section  scales,  the 
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Figure  1.    The  stability  oscillations  of  the  channel  orbit  for  the  Hamiltonian  (4)  as  a 
function  of  a. 


above  functions  also  will  scale  as  follows, 


-71 


A  ~72 


(10) 


where  71  and  72  are  scaling  exponents  with  values  close  to  0.625  and  0.325  respectively. 
In  fact,  we  have  presented  evidence  that  is  more  general  and  covers  all  the  two- 
dimensional  homogeneous  Hamiltonian  systems  whose  Poincare  sections  around  channel 
orbits  scale  with  a  [26].  We  have  demonstrated  that  for  the  class  of  Hamiltonians, 


#2n  =  \p(  +  \p\  +  i  (frfi  +  h£)  4-  \  (fig-2 

the  scaling  exponents  depend  only  on  the  homogeneity  of  the  potential  as  follows, 
2n+l  2n-l 


(11) 


4n 


4n 
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a*  =  \P\  +\P*+TH  (f3iq?  +  hffi  +7i 

the  channel  orbit  is  always  marginally  stable  independent  of  a.  The  stable  region  around 
this  orbit  scales  with  a  and  the  scaling  exponents  are  given  by 


In  §7  we  will  see  that  stability  oscillations  of  the  channel  periodic  orbit  has  implications 
for  wavefunction  localization. 

3.2  Quantum  aspects 

Quantum  mechanically  we  solve  the  Schrodinger  equation, 


(15) 

using  symmetrized  linear  combination  of  one-dimensional  quartic  oscillator  eigenfunc- 
tions  given  by 

where  A/"  is  the  normalization  constant  depending  on  n  and  m,  both  even  integers  and  the 
index  j  represents  a  unique  pair  (n,  w)  of  these  integers.  The  functions  (f>n(x)  are  obtained 
numerically  by  accurately  solving  a  one-dimensional  Schrodinger  equation  for  bound 
states  of  quartic  oscillator  (see  [27]  and  references  therein).  Using  160  to  175  one- 
dimensional  quartic  oscillator  basis  states,  we  numerically  diagonalized  a  large 
Hamiltonian  matrix  of  orders  ranging  from  about  12000  to  15000  on  a  IBM  RISC 
6000/580  machine  to  obtain  about  2000  converged  eigenvalues  and  eigenfunctions.  In  all 
our  calculations,  positive  values  of  a  in  the  range  [6,  oo]  were  used  since  more 
eigenvalues  converged  in  this  case  than  for  the  corresponding  range  [0,  —2].  A  detailed 
account  of  numerical  techniques  for  large  matrix  diagonalization  and  its  convergence 
problems  will  be  presented  elsewhere. 

This  potential  belongs  to  the  C^  symmetry  group,  namely  the  group  of  all  the 
transformations  of  a  square.  This  group  has  four  one-dimensional  irreducible  representa- 
tions, but  for  all  our  calculations  reported  in  this  paper  we  confined  ourselves  to  A\ 
representation,  which  is  symmetric  under  reflections  on  the  axes  and  diagonals.  In  fact,  the 
basis  function  given  in  (16)  transforms  according  to  this  representation.  By  choosing  such 
appropriately  symmetrized  basis  functions  the  Hamiltonian  matrix  is  block  diagonal. 

Since  the  system  in  (4)  classically  scales,  the  quantum  spectra  at  different  values  of 
Planck's  constant  are  related  by 

E(H]  a)  =  C4E(/z/C3;  a),  (17) 

where  C  is  a  constant.  The  duality  in  parameter  space  gives  us  one  more  relationship 
between  quantum  energy  levels  between  different  values  of  a  related  by  (6): 

£(»;<*)    _    E(h-a') 

1  I  f\  1  lf\   *  \          / 
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4.  Localized  states  in  quartic  oscillator 

In  the  integrable  quartic  oscillator  all  the  eigenfunctions  are  localized  on  the  invariant 
tori.  Here  we  address  the  question  of  systematics  and  morphology  of  highly  excited  states 
when  the  classical  system  is  largely  chaotic.  Structurally,  most  of  the  localized  states  of 
the  quartic  oscillator  display  one  or  more  of  the  following  three  kinds  of  features,  namely, 
(i)  localization  along  the  channel  periodic  orbits,  (ii)  localization  along  the  orbit  45°  to 
the  channel  states,  (iii)  localization  along  a  circle,  called  ring  states.  However,  there  are 
other  scarred  states  with  interesting  structures  which  do  not  fall  in  any  of  the  above 
categories.  At  the  moment,  many  of  these  scarred  states  can  only  be  visually  identified 
since  there  are  no  quantitative  measures  to  identify  them.  One  of  the  straightforward 
methods  of  identifying  a  scarred  state  is  to  look  at  it  visually.  To  this  end,  we  define  a 
configuration  space  density 

p£(jt,y)  =  |^(:c,y)|2.  (19) 

By  plotting  \3f(x,  y)|2  as  an  intensity  plot,  we  identify  the  scarred  states.  In  one  of  the  first 
extensive  studies  on  the  eigenfunctions  of  the  quartic  oscillator,  Eckhardt  et  al  [28]  have 
reported  eigenfunctions  in  configuration  space  of  sixty  states  starting  from  the  ground 
state,  which  shows  all  the  familiar  localized  structures  mentioned  above.  They  have 
identified  the  classically  unstable  periodic  orbits  that  produce  these  localized  structures. 
Probing  much  deeper  in  the  semiclassical  regime,  even  as  high  as  up  to  2000  states  that 
we  computed  for  various  a  values  from  60  to  96,  we  found  that  the  localized  states 
scarred  by  the  channel  orbits  continue  to  exist.  These  special  states  scarred  by  channel 
orbits  form  a  dominant  series  in  the  semiclassical  region,  which  implies  that  it  is  most 
easily  amenable  to  semiclassical  treatment  as  will  be  seen  later.  We  showed  the  existence 
of  another  class  of  scarred  states  called  shadow  states,  which  are  weakly  scarred  by  the 
channel  orbit  in  addition  to  some  other  nearby  orbits  and  hence  show  weak  localization 
features  in  comparison  with  its  dominant  counterpart.  These  states  occur  near  the  channel 
localized  states  and  form  another  series  hi  the  spectrum  which  seems  to  be  absent  at  the 
lower  end  of  the  spectrum.  Another  peculiar  set  of  states  were  those  which  localized  on  a 
point  and  consequently  intensity  in  the  rest  of  the  configuration  space  was  weak.  Some 
intensity  plots  of  channel  localized  and  shadow  states  and  of  four  weak  states  have  been 
published  [29]. 

There  are  two  important  points  to  be  noted.  Firstly,  many  Hamiltonian  systems  in  the 
form  of  (1 1)  allow  for  these  types  of  localized  states.  For  instance,  the  potential  given  by 
V(x,y)  =  x2  +  y2  +  /fix2?2,  where  /3  is  a  parameter,  also  shows  these  familiar  localized 
structures  [30],  Secondly  since  the  quantitative  measures  to  identify  the  scarred  states  are 
lacking,  the  visualization  of  eigenfunctions,  at  the  moment,  seems  to  be  the  viable 
alternative.  In  fact,  as  we  will  see  later,  the  visual  techniques  complement  the  numerical 
methods,  even  in  such  cases  where  certain  measures  do  identify  particular  kind  of  scarred 
states. 

5.  Quantitative  methods  for  localized  states 

The  states  scarred  by  channel  orbits  form  a  small  fraction  of  the  total  number  of 
eigenstates  we  computed.  There  are  about  60-70  channel  localized  states  in  the  first  2000 
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states,  many  others  exhibit  various  other  types  of  structures  and  cannot  be  easily 
categorized.  Hence  to  go  beyond  morphological  features  and  take  a  complete  view  of  all 
the  states,  quantitative  methods  are  needed.  The  visual  means  are  more  computer  inten- 
sive and  for  systems  with  more  than  two  degrees  of  freedom,  they  may  not  be  helpful.  So 
we  proposed  the  information  entropy  measure  as  a  means  to  identify  localized  states  [29]. 
Entropy  is  a  well  defined  concept  classically  and  an  account  of  its  applications  in 
classical  chaotic  systems  is  given  by  Gutz  wilier  [1].  A  useful  review  of  general  properties 
of  entropy  is  given  by  Wehrl  [31].  Powell  and  Percival  [32]  and  Izrailev  [33]  have  used 
information  entropy  measure  for  studying  different  properties  of  classical  and  quantum 
chaotic  systems.  Recently,  it  had  also  been  used  in  the  theory  of  quantum  inference  [34]. 
We  employ  the  definition  of  information  entropy  of  a  quantum  state  as  follows, 

'  (20) 


where  a°j  are  the  eigenvector  coefficients  for  the  nth  eigenf  unction  of  the  system  which 
lias  M  components,  where  M  is  the  dimensionality  of  the  Hamiltonian  matrix  for  the 
problem.  This  measure  is  obviously  basis-dependent.  This  measure  identifies  the 
localized  states  by  showing  a  pronounced  dip  in  the  information  entropy  curve 
corresponding  to  various  localized  states.  Figure  2  shows  the  information  entropy  plotted 
against  logarithm  of  the  state  number  for  quartic  oscillator  for  a  =  90.00,  for  the  range  of 
states  from  950  to  2000.  The  well  pronounced  dips  correspond  to  channel  localized  states. 
Shadow  states  are  marked  by  less  pronounced  dips  and  are  sometimes  ambiguous.  It  must 
be  emphasized  that  this  measure  picks  out  only  such  localized  states  and  no  others.  We 
also  computed  the  momentum  space  intensities,  by  taking  the  Fourier  transform  of  the 
sigenfunctions.  Their  corresponding  information  entropies  were  also  computed.  We 
noticed  that  the  same  states  showed  minimum  entropy  in  both  momentum  and 
configuration  space,  leading  to  the  suspicion  that  these  may  be  some  sort  of  minimum 
uncertainty  states.  Further  work  in  this  direction  is  in  progress. 

The  structure  of  the  eigenvector  coefficients,  being  a  more  fundamental  entity,  contains 
the  signature  of  these  localized  states.  The  leading  unperturbed  basis  vectors  of  the 
localized  states  have  (N,0),  (TV,  2),  (Af,4)  ....  structure,  where  N  is  an  even  integer 
depending  on  the  particular  eigenstate.  It  is  seen  that  for  localized  states  most  eigenvector 
coefficients  a%j  are  effectively  zero  and  so  very  few  basis  states,  in  the  above  order, 
effectively  contribute  to  the  wavefunction. 

It  is  now  fairly  well  established  from  the  pioneering  work  of  Bohigas  et  al  [35]  that 
quantal  energy  level  spacings  distribution  for  classically  chaotic  quantum  systems  fall 
under  three  universality  classes,  determined  by  the  symmetry  properties  of  the 
Hamiltonian  of  the  system.  These  level  spacing  distributions  are  modeled  by  appropriate 
ensembles  from  random  matrix  theory.  For  the  Gaussian  Orthogonal  Ensemble  (GOE), 
which  models  the  fluctuation  properties  of  the  quartic  oscillator,  the  eigenvectors  are  x2 
distributed  [36].  In  this  case,  the  RMT  prediction  for  information  entropy  [37]  is 
expressed  in  terms  of  digamma  function  \I>  as, 
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Figure  2.    The  information  entropy  plotted  against  the  logarithm  of  the  state  number 
at  a  =  90.00. 


where  D  is  large  and  is  the  dimensionality  of  the  random  matrices  in  the  ensemble.  In 
figure  2,  the  information  entropy  curve  reveals  an  almost  linear  behaviour  as  predicted  by 
the  random  matrix  theory.  In  the  unperturbed  basis,  the  slope  of  the  line  fitted  to  the 
envelope  of  the  information  entropy  curve  is  close  to  unity.  However,  we  observed  that  in 
position  and  momentum  basis  the  information  entropy  measure  for  the  eigenfunctions 
does  not  exactly  follow  the  RMT  prediction.  Thus,  apart  from  basis  dependence  of 
eigenvector  statistics,  the  question  seems  to  hinge  on  the  effective  dimensionality  of  a 
state  in  the  infinite  dimensional  Hilbert  space.  Integrated  density  of  states  could  be  one 
such  measure  of  effective  dimensionality.  Another  method  is  to  employ  the  participation 
ratio  defined  as, 


Pn  = 


1 


which  gives  the  number  of  states  that  effectively  participate  in  the  building  up  of 
wavefunction  [381.  We  will  be  reporting  more  results  on  information  entroov  in  §  7. 
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Figure  3.  The  cumulative  eigenvector  distribution  curve.  The  solid  line  represents 
the  standard  cumulative  Porter-Thomas  distribution.  The  curves  (a)  correspond  to  two 
channel  localized  states,  409  and  423  at  a  =  64.66.  In  (b),  the  dotted  curve 
corresponds  to  a  generic  state  and  dashed  curve  corresponds  to  a  strongly  scarred 
state. 


The  eigenvector  distribution  for  the  Hamiltonian  systems  in  infinite  dimensional 
Gilbert  space  has  not  yet  been  well  studied.  There  have  been  many  studies  for  finite 
limensional  systems  and  it  has  been  noted  that  the  Porter-Thomas  distribution  is 
ippropriate  for  time  reversal  symmetric  chaotic  systems  [39].  Yet  the  basis  dependence  of 
ill  such  analysis  is  obvious.  One  way  out  is  to  select  a  generic  and  physical  basis.  In  our 
:ase  the  unperturbed  basis  is  one  such  possibility;  however  it  is  clear  that  it  preferentially 
reats  the  channel  localized  states. 

The  eigenvectors  of  the  coupled  quartic  oscillator  studied  above  follow  the  RMT 
>rediction  reasonably  closely  and  three  cases  arise,  namely,  (i)  the  eigenvectors  of  those 
igenfunctions  which  are  spread  over  all  the  classically  accessible  domain,  (ii)  the 
igenvectors  of  strongly  scarred  states  and  (iii)  the  eigenvectors  of  the  channel  localized 
tates.  Figure  3  shows  the  cumulative  Porter-Thomas  distribution  for  the  square  of  the 


eigenvector  components  normalized  to  unit  mean  given  oy 


where  Erf  is  the  standard  error  function.  It  is  evident  that  the  cases  (i)  and  (ii)  have  similar 
eigenvector  distribution,  which  is  close  to  the  standard  Porter-Thomas  distribution  (solid 
line).  The  strongly  scarred  state  is  state  no.  552  at  a  —  64.66  (see  ref.  [40]  for  a 
configuration  space  plot  of  state  552)  ,  and  the  generic  state  is  state  no.  439  at  the  same 
value  of  a.  In  this  sense,  even  the  strongly  scarred  state  can  be  said  to  exhibit  random 
matrix  like  distribution.  However  the  eigenvectors  of  the  channel  localized  states  deviate 
considerably  from  the  RMT  prediction,  as  figure  3  shows. 

One  reason  for  deviation  from  the  RMT  behaviour  for  the  channel  localized  states 
could  be  that  in  the  chosen  basis  states,  the  eigenvectors  of  the  localized  states  have  come 
in  for  a  special  treatment,  which  also  makes  these  states  stand  out  in  the  entropy  analysis. 
A  physical  basis  that  may  distinguish  the  scarred  states  from  the  rest  could  be  some 
coherent  basis  set,  but  not  the  position  or  momentum  basis  as  indeed  we  have  already 
demonstrated  that  the  minima  of  the  entropy  in  these  basis  states  also  correspond  only  to 
channel  localized  states  [29]. 


6.  Adiabatic  approaches  for  the  quartic  oscillator 

In  the  last  two  decades,  much  progress  has  been  made  in  the  semiclassical  quantization  of 
classically  chaotic  systems  and  the  work  of  Gutzwiller  [1]  in  this  direction  has  thrown 
more  light  on  the  classical-quantum  correspondence.  An  adiabatic  approach,  using  the 
standard  semiclassical  quantization,  has  been  used  to  explain  the  channel  and  ring 
localized  states  and  has  been  partially  successful. 

The  basic  premise  of  the  adiabatic  method  is  that  the  channel  motion  in  two  perpen- 
dicular directions  have  two  vastly  different  time  scales  and  hence  slow  and  fast  motions 
can  be  treated  separately.  For  instance  the  motion  along  the  channel  is  assumed  to  be 
slower  than  the  motion  perpendicular  to  it,  resulting  in  the  perpendicular  action  being  an 
adiabatic  invariant  resulting  in  an  adiabatic  potential  along  the  channel.  Thus  by  separat- 
ing motion  in  one  direction  from  the  other,  the  resulting  locally  integrable  Hamiltonian  is 
quantized  to  obtain  the  energy  eigenvalues.  However,  this  formalism  does  not  separate  the 
eigenvalues  coming  from  different  irreducible  representations,  quite  like  the  quantum 
adiabatic  method.  For  a  fair  comparison,  the  eigenvalues  from  this  approach  must  be 
tallied  with  the  average  quantum  eigenvalues  from  different  irreducible  representations. 
The  adiabatic  approximation  works  best  for  states  localized  far  away  from  the  origin  for 
most  of  the  time,  like  the  channel  localized  states.  This  is  due  to  the  adiabatic  curves 
having  non-analytic  properties  at  the  origin.  The  quantum  mechanically  untamed 
potential  V(xty;a)  =  copy1  has  been  studied  in  cartesian  adiabatic  approximation  [41]. 
Ring  type  localization  has  been  explained  using  the  polar  coordinate  adiabatic  breakup 
[42]. 

At  this  point  some  remarks  on  the  ring  states  and  the  accuracy  of  adiabatic  methods  is 
in  order.  The  adiabatic  treatment  involving  semiclassical  quantization,  by  its  very  nature, 
presupposes  the  existence  of  a  series  of  particular  type  of  scarred  states.  However,  our 
own  experience  with  the  computation  and  visualization  of  nearly  2000  eigenfunctions  for 
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Figure  4.  The  configuration  space  intensities  for  two  consecutive  states  for  (a) 
a  =  64.66,  states  1376  and  1377  with  energies  2770.2953  and  2770.5087  respectively, 
(b)  a  =  90.00,  states  1857  and  1858  with  energies  3642.9626  and  3643.0294 
respectively.  The  colour  code  for  all  images  is  such  that  red  is  the  maximum,  blue  is 
low  and  black  is  zero. 


various  a  values  in  the  deep  semi  classical  regime,  amply  confirms  a  series  for  channel 
states  but  does  not  show  any  evidence  for  a  series  of  ring  states.  One  reason  for  the 
absence  of  clean  ring  states  in  the  deep  semiclassical  regime  could  be  the  instability  of 
the  underlying  periodic  orbits.  It  may  be  remarked  that  the  previous  studies  [42]  rely  on 
low-lying  eigenfunctions  which  do  not  show  many  avoided  crossings  and  in  spite  of  it 
being  such  a  simple  spectrum  in  the  energy  region  considered,  the  identification  of  ring- 
states  seems,  at  times,  to  be  subjective.  The  highly  excited  region  exhibits  a  large  number 
of  avoided  crossings  and  many  localized  states  stand  on  them  and  the  adiabatic  approach 
cannot  unambiguously  identify  such  states  which  differ  in  energy  by  less  than  the  mean 
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a  large  number  of  such  cases,  it  becomes  necessary  to  visualize  eigenfunctions  to  confirm 
the  results  obtained  by  this  approach.  In  this  sense,  the  adiabatic  approach  provides  more 
qualitative  insight  rather  than  quantitative  result.  As  pointed  out  before,  the  visual 
methods  have  to  complement  these  other  techniques  for  correct  interpretation  of  these 
approximate  methods. 

We  proceed  knowing  fully  well  the  limitations  of  this  method.  Eckhardt  et  al  [28]  have 
reported  a  variation  of  this  method  based  on  adiabatic  stability.  They  have  shown  that  one 
of  the  least  unstable  periodic  orbits,  namely  the  channel  orbit,  corresponds  to  a  prominent 
peak  in  the  Fourier  transform  of  quantal  spectra  and  have  identified  this  orbit  as  the  one 
that  is  responsible  for  scarring  a  series  of  eigenfunctions  in  quartic  oscillator  spectrum. 
Their  analysis  is  based  on  the  concept  of  adiabatic  stability.  Adiabatic  stability  refers  to 
an  orbit  that  is  globally  unstable  as  measured  by  its  Lyapunov  exponent  but  still  stable  for 
short  times  of  the  order  its  time  period.  The  channel  orbit,  called  the  'adiabatic  well',  is 
identified  as  the  one  around  which  the  other  orbits  in  the  vicinity  wind  around.  The 
adiabatic  Hamiltonian  associated  with  the  channel  orbit  is  constructed  using  these  other 
orbits  in  a  coordinate  system  defined  by  the  channel  orbit,  which  coincides  with  the 
cartesian  coordinates  in  this  case.  Adiabatic  eigenvalues  are  obtained  by  quantizing 
actions  of  these  other  orbits  along  and  perpendicular  to  the  channel  orbit.  The  eigenvalues 
obtained  using  this  formalism  have  a  good  correspondence  with  the  quantal  eigenvalues 
of  the  channel  localized  states.  The  adiabatically  stable  periodic  orbits  are  thus  shown  to 
scar  the  channel  states. 

While  the  above  works  show  that  the  existence  of  adiabatically  stable  channel  orbits 
correlate  with  quantum  density  enhancements  along  the  channels,  this  model  ignores  the 
effect  of  stability  oscillations  of  the  channel  periodic  orbits  discussed  earlier.  We  have 
noticed  that  the  accuracy  of  the  adiabatic  approach  depends  on  the  stability  of  the  channel 
orbits  even  in  the  case  when  the  stable  regions  are  of  very  small  measure.  Although  the 
basic  structure  of  these  localized  states  remains  the  same,  the  degree  of  localization 
changes  significantly. 

The  above  semiclassical  quantization  schemes  have  largely  ignored  the  discrete 
symmetries  in  the  system.  Sinha  and  Sheorey  [43]  have  proposed  a  semiclassical 
quantization  scheme  that  incorporates  discrete  symmetries  using  quantum  input  that 
takes  cognisance  of  symmetries.  In  quartic  oscillator  the  channel  localized  states  are 
effectively  one-dimensional  states,  and  so  a  WKB-like  scaling  form  for  the  irreducible 
representation  T, 

E(T,a-N)^aQ  +  al(N  +  ^3  (22) 

has  been  applied  for  (N,  0)  manifold,  where  N  is  any  even  integer  for  A\  representation. 
With  the  data  from  a.  =  64.66  case,  it  has  been  shown  that  the  values  a0  and  a}  obtained 
from  low-lying  excited  states  predict  the  energies  of  the  highly  excited  states  in  the  region 
of  state  numbers  from  400  to  500.  The  constant  a0  is  a  function  of  a.  We  have  verified 
this  relation  for  excited  states  extending  up  to  2000  states  for  different  values  of  a.  Again, 
there  is  a  one-to-one  correspondence  between  the  predicted  energy  values  and  the 
quantum  energy  levels.  In  fact  the  adiabatic  approach  also  predicts  a  formula  very  similar 
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to  the  above  one  and  is  capable  of  giving  a  functional  form  to  GO-  More  detailed  results  on 
this  approach  will  be  published  elsewhere. 

It  should  again  be  emphasized  that  only  a  subclass  of  scarred  states,  the  channel 
localized  states  and  ring  states  have  thus  far  been  amenable  to  any  kind  of  approximate 
description.  The  plethora  of  various  structures  exhibited  by  eigenfunctions  of  quartic 
oscillators  still  remain  a  mystery.  As  of  today,  another  outstanding  problem  is  the 
construction  of  adiabatic  wavefunctions  that  would  resemble  the  actual  quantum 
wavefunctions. 


7.  Localization  and  stability  of  channel  orbits 

Since  it  is  well  known  that  the  underlying  periodic  orbits  play  a  crucial  role  in  the 
semiclassical  localization,  the  next  aspect  to  be  studied  is  the  effect  of  bifurcation  and  the 
stability  of  these  orbits  on  localization.  The  bounded  stability  oscillations  allow  us  to 
determine  the  values  of  a  at  which  the  channel  orbit  undergoes  bifurcations.  The  value 
a  =  90.00  is  one  such  point  at  which  the  channel  orbit  is  about  to  lose  stability  through  a 
pitchfork  bifurcation.  We  computed  the  eigenvalues  and  eigenfunctions  for  the  first  2000 
states  of  the  quartic  oscillator  for  values  of  a.  ranging  from  88.00  to  96.00. 

The  information  entropy  in  the  unperturbed  basis  states  for  all  these  eigenfunctions 
was  computed.  The  preliminary  results  are  indicative  of  a  correlation  between  the 
stability  of  the  channel  orbit  and  degree  of  eigenfunction  localization.  Higher  the  stability 
of  the  channel  orbit,  the  localization  is  more  compact  and  clean.  For  instance,  figure  5 
shows  a  series  of  highly  excited  eigenfunctions  in  configuration  and  unperturbed  space 
for  various  values  of  a.  It  is  clear  from  this  figure  that  comparing  the  eigenfunctions  at 
a  =  64.66  and  a.  —  90.00,  the  latter  is  highly  compact  and  almost  collapses  on  the 
underlying  periodic  orbit.  Again  the  structure  of  unperturbed  eigenfunction  for 
a  =  90.00,  namely  the  eigenvector  coefficients  reveal  that  most  of  the  components  are 
almost  zero,  except  for  a  few  dominant  ones  which  go  up  to  build  the  wavefunction. 
However  at  a  =  64.66,  the  eigenfunction  in  unperturbed  space  is  spread  over  many  more 
basis  states  than  for  a  =  90.00. 

The  information  entropy  measure  also  reflects  this  trend  as  we  go  across  a  =  90.00. 
We  calculated  averaged  information  entropy  for  a  particular  a  value  by  taking  the  mean 
of  information  entropies, 

V1    *?a 

/?   \  —  ^-iff     cr 

(Sa}-~^~ 

for  a  group  of  m  localized  states  represented  by  a  within  some  energy  range.  Strictly 
speaking  this  quantity  may  not  be  physically  meaningful,  but  it  is  a  coarse-grained  value 
that  would  reveal  the  average  trend  in  the  degree  of  localization.  The  participation  ratio 
also  shows  an  identical  trend.  Figure  6  shows  the  average  entropy  plotted  against  a  and  it 
has  a  clean  dip  at  a  =  91.00.  This  is  when  the  channel  orbit  has  just  begun  to  lose 
stability.  The  dip  at  a  —  91.00  seems  to  justify  the  remark  that  the  influence  of  a  new 
orbit  on  an  eigenstate  is  not  clear  immediately  after  the  bifurcation  [44].  A  careful 
analysis  is  still  needed  to  uncover  the  changes  in  eigenfunction  structures  that  accompany 
bifurcations  of  the  underlying  periodic  orbits.  This  could  also  mean  that  though  the 
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Figure  5.  Configuration  space  intensities  for  four  localized  states  for  (a)  a=64.66, 
(b)  88.00,  (c)  90.00  and  (d)  96.00,  all  with  the  same  principal  unperturbed  state 
(252, 0).  The  colour  code  for  all  the  images  is  such  that  red  is  the  maximum,  blue  is 
low  and  black  is  zero. 

channel  orbit  plays  an  important  role  in  channel  localization,  the  contribution  of  the  other 
orbits  in  the  vicinity  cannot  be  ignored. 

All  these  results  in  totality  imply  a  strong  correlation  between  the  stability  of  channel 
orbits  and  degree  of  localization,  and  also  are  indicative  of  the  dominant  role  played  by 
the  channel  orbit  although  other  nearby  orbits  do  contribute  to  localization  along  the 
channel.  In  a  similar  spirit,  though  not  quite  the  same,  Atkins  and  Ezra  [44]  have 
qualitatively  studied  the  effect  of  pitchfork  bifurcations  of  the  diagonal  classical  periodic 
orbit  on  the  eigenstates  of  the  quartic  oscillator.  This  orbit  is  always  unstable  beyond 
ft  =  6.0.  The  bifurcation  of  the  orbit  is  shown  to  be  associated  with  the  spread  in  the 
eigenstate  in  configuration  space  transverse  to  the  45°  orbit  considered.  However,  they 
have  confined  themselves  to  values  of  a  up  to  15  in  the  low-energy  spectrum  only. 
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Figure  6.     The  plot  of  average  information  entropy  as  a  function  of  a. 


8.  Conclusions 

A  comprehensive  theory  encompassing  all  the  nuances  of  localization  in  quantum 
mechanics  is  still  an  open  problem.  Our  extensive  studies  on  the  eigenfunctions  of  the 
two  dimensional  quartic  oscillator  potential  shows  the  existence  of  various  scarred  states, 
in  particular  the  channel  localized  states  at  least  up  to  about  2000  levels  from  the  ground 
state.  We  expect  that  these  will  continue  to  exist  throughout  the  spectrum.  The  channel 
localized  states  alone  can  be  identified  by  the  information  entropy  method,  or  other 
measures  of  localization  such  as  the  participation  ratio. 

We  have  studied  localization  in  a  time  independent  classically  chaotic  system,  and  is 
quite  distinct  from  the  localization  in  kicked  systems  such  as  the  rotor.  The  regions  of 
parameters  we  have  studied  the  oscillator  in  are  ones  of  extreme  chaos,  although  an 
important  periodic  orbit  scarring  the  set  of  localized  states  we  study  oscillates  in  stability. 
We  have  numerically  studied  the  effect  of  these  oscillations  and  concomitant  classical 
bifurcations  on  eigenfunction  localization,  and  found  that  such  measures  as  information 
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Experiments  on  quantum  chaos  using  microwave  cavities: 
Results  for  the  pseudo-integrable  L-billiard 
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Abstract.  We  describe  microwave  experiments  used  to  study  billiard  geometries  as  model 
problems  of  non-integrability  in  quantum  or  wave  mechanics.  The  experiments  can  study  arbitrary 
2-D  geometries,  including  chaotic  and  even  disordered  billiards.  Detailed  results  on  an  L-shaped 
pseudo-integrable  billiard  are  discussed  as  an  example.  The  eigenvalue  statistics  are  well-described 
by  empirical  formulae  incorporating  the  fraction  of  phase  space  that  is  non-integrable.  The 
eigenfunctions  are.  directly  measured,  and  their  statistical  properties  are  shown  to  be  influenced  by 
non-isolated  periodic  orbits,  similar  to  that  for  the  chaotic  Sinai  billiard.  These  periodic  orbits  are 
directly  observed  in  the  Fourier  transform  of  the  eigenvalue  spectrum. 

Keywords.  Quantum  chaos;  billiards;  eigenvalues;  eigenfunctions;  random  matrix  theory; 
pseudo-integrable. 
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1.  Introduction 

The  issue  of  non-integrability  in  quantum  mechanics  is  relevant  in  a  variety  of  physical 
systems.  Indeed  initial  developments  were  in  nuclear  physics,  where  random  matrix 
theory  (RMT)  was  developed  to  account  for  the  statistical  properties  of  the  eigenvalue 
spectra  of  heavy  nuclei  [1,  2].  Similar  and  hence  universal  behavior  has  been  shown 
numerically  in  low  dimensional  billiard  systems  [3].  This  universal  behavior  is  not  just 
confined  to  quantum  mechanical  systems  such  as  atoms,  electrons  and  nuclei,  but  is  also 
seen  in  wave  mechanical  systems  like  acoustic  and  electromagnetic  systems.  The 
common  feature  in  all  these  apparently  disparate  systems  is  that  the  corresponding  ray  or 
particle  dynamics  is  chaotic.  Significant  developments  in  semiclassical  theory  have 
shown  the  importance  of  classical  structures,  such  as  periodic  orbits,  in  organizing 
quantum  behavior  [4-7].  Although  many  experimental  systems  exhibit  quantum  or  wave 
chaos,  few  of  these  yield  properties  which  are  directly  comparable  to  analytical  results. 
Hence  theoretical  developments  have  relied  on  numerical  simulations  to  support  their 
claims.  Although  computational  tools  are  rapidly  increasing  in  their  applicability,  there 
are  nevertheless  many  important  problems  which  are  still  not  amenable  to  reliable 
numerical  solutions.  In  this  paper  we  describe  an  experimental  approach  which  leads  to 
important  insights  to  the  role  of  classical  chaos  and  even  disorder  in  quantum  or  wave 
mechanics.  This  approach,  which  utilizes  microwave  cavities,  is  best  suited  to  the  study 
of  "billiard"  problems  that  examine  the  quantum  properties  of  a  particle  in  model  2-D 
geometries.  In  the  past  five  years  these  experiments  have  resulted  in  several  important 
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contributions  to  the  Held  or  quantum  cnaos.   me  expenments  nave  lea  to  me  rirst 

direct  observation  of  scarred  eigenfunctions   [8].  The   statistical  properties   of  the 

eigenfunctions  for  the  chaotic  Sinai-stadium  geometry  were  seen  to  be  in  good  agreement 

with  the  universal  Gaussian  orthogonal  ensemble  (GOE)  of  RMT  [9].  It  was  further 

found  that  chaotic  geometries  with  non-isolated  periodic  orbits  like  the  Sinai  billiard 

show  deviation  from  universal  behavior.  The  eigenfunctions  of  disordered  geometries 

(cavities  with  many  finite  sized  scatterers)  have  shown  the  effects  of  localization  on 

their  statistical  properties  [9].  This  has  stimulated  interesting  theoretical  work  on  the 

spatial  distribution  of  eigenfunctions  using  supersymmetric  non-linear  cr-models  [10-12] 

and  to  date  remain  the  only  experiments  in  which  these  theories  can  be  tested.  The 

eigenvalue  statistics  have  also  been  analyzed  and  shown  to  be  in  direct  agreement  with 

GOE  for  the  Sinai-stadium  [13].  The  experiments  have  also  led  to  the  experimental 

verification  [14]  of  the  theorem  of  isospectral  domains  [15]  and  resolved  an  important 

issue  of  degeneracies.  Periodic  orbits  were  observed  in  both  closed  and  open  billiard 

systems  by  Fourier  transforming  the  eigenvalue  and  transmission  spectrum  respectively 

[16, 17].  A  consistent  theme  of  this  work  has  been  the  gleaning  of  universal  behavior  and 

deviations  from  such  universal  behavior,  by  exploiting  the  ability  to  study  controlled 

variations  in  geometry  afforded  by  the  microwave  experiments.  In  this  paper,  we  discuss 

in  detail  the  eigenvalue  and  eigenfunction  properties  of  an  important  class  of  billiards 

which  are  pseudo-integrable.  This  example  is  important  because  it  is  an  intermediate 

system  that  is  neither  integrable  nor  fully  chaotic.  The  theoretical  description  of 

intermediate  systems  is  challenging  and  it  is  hoped  that  these  quantitative  experimental 

results  will  lead  to  rigorous  tests  for  theory.  We  begin  with  an  introduction  of  the 

experimental  techniques. 


2.  Experimental  techniques 

The  experiments  utilize  "thin"  microwave  cavities,  in  which  below  a  cutoff  frequency, 
the  2-D  scalar  limit  of  the  Maxwell-Helmholtz  wave  equation  applies,  and  the 
correspondence  to  the  Schrodinger  equation  is  exact.  For  a  cavity  of  arbitrary  cross- 
section  in  the  x  —  y  plane  but  uniform  along  the  z-  direction,  the  z-component  of  the  wave 
vector  is  quantized  as  kz  =  prc/d,  where  p  is  an  integer  and  d  is  the  cavity  thickness  along 
the  z-  axis.  Maxwell's  equations  then  reduce  to 

[V2  +  (£2-(^T/J)2]{£2A}=0,  (1) 

where  V  is  the  2-D  Laplacian  operator.  Of  course  the  EM  field  is  a  vector  field,  however 
a  special  case  occurs  for  the  transverse  magnetic  (TM)  modes  for  which  Bz  =  0,  and 
further  when  p  =  0.  In  this  limit,  (1)  reduces  to  the  Schrodinger  equation  (V2  +  k2} 
^  =  0,  where  x&< — >EZ.  This  limit  can  be  experimentally  achieved  by  confining  measure- 
ments to  the  frequency  range  f  <fc  =  cfd.  For  d  =  6  mm,  the  cutoff  frequency 
fc  =  25  GHz,  and  for  all  lower  frequencies,  the  correspondence  between  the  Schrodinger 
and  Maxwell  wave  equations  is  exact  in  the  thin  2-D  cavities.  Cavities  are  fabricated  with 
the  2-dimensional  cross-section  cut  out  from  6mm  thick  copper  sheets,  and  placed 
between  two  copperplates.  Coupling  to  the  cavities  is  accomplished  by  loops  terminating 
coaxial  cables  -  the  loops  couple  to  the  microwave  magnetic  fields  at  the  perimeter.  The 
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cavity  response  is  examined  using  an  HP  8510  Network  Analyzer  (ANA).  The  eigenvalue 
spectrum  is  obtained  from  the  resonances  observed  in  the  transmission  spectrum,  with  the 
energy  eigenvalues  En  obtained  from  the  resonant  eigenfrequencies  /„  using  En  —f£.  By 
using  appropriate  care,  in  particular  coupling  at  several  locations,  it  is  possible  to  observe 
the  first  1000  or  so  levels  of  the  spectrum,  without  any  missed  levels.  Thus  although  the 
finite  conductivity  of  the  copper  walls  leads  to  broadened  resonances,  it  is  not  necessary 
to  use  superconducting  cavities  in  order  to  carry  out  reliable  experiments  on  eigenvalue 
statistics.  One  of  the  important  and  unique  features  of  the  experiments  is  the  ability  to 
directly  measure  eigenfunctions.  This  method,  first  devised  by  one  of  us  [8,  18],  utilizes 
the  perturbation  of  the  cavity  resonance  by  a  small  metallic  ball.  Placing  the  ball  at 
coordinates  (x,  y]  in  the  cavity  leads  to  a  shift  of  the  resonance  frequency  A/,,  (jc,  y)  = 
—/3'3?f'l(x,y),  from  which  the  eigenfunction  %,  can  be  directly  determined  by  measuring 
the  frequency  shift  Afn(x,y}  using  the  ANA.  Further  details  about  the  method  can  be 
found  in  [18]. 

3.  Pseudo-integrable  geometries 

An  important  class  of  billiard  systems  are  the  pseudo-integrable  geometries  as  they  are 
the  closest  step  away  from  integrable  systems.  Most  polygonal  billiards,  with  the 
exception  of  rectangles,  30°-60°-90°  angle  triangles,  and  the  equilateral  triangle  are 
pseudo-integrable.  The  non-integrability  arises  at  the  vertex,  when  it  is  not  90°, 
60°,  45°,  30°.  Trajectories  diverge  at  this  point  and  lead  the  classical  phase  to  be  a  surface 
of  genus  greater  than  one.  Since  the  classical  phase  space  has  to  be  a  torus  for  the 
motion  to  be  integrable,  a  higher  genus  surface  ensures  non-integrability.  A  formula 
for  relating  the  genus,  g,  to  the  angles  of  the  vertex  has  been  derived  and  is  given  as 
[19,20]: 


where  «,-  and  m/  are  such  that  a/  =  m/7r/n(-  are  the  internal  vertex  angles,  and  N  is  the 
smallest  of  the  integer  set  nf.  A  surface  of  genus  2  which  corresponds  to  a  pseudo- 
integrable  system  is  the  L-shaped  billliard  [21,22].  However  there  are  similarities  to 
integrable  systems,  as  the  periodic  orbits  in  these  geometries  also  occur  in  families,  which 
bifurcate  at  the  vertex  which  is  not-integrable.  They  also  appear  to  have  the  same 
asymptotic  proliferation  of  periodic  orbits  [20].  However  numerical  work  has  shown  that 
the  eigenvalue  statistics  of  these  systems  show  non-Poisson  statistics.  Below  we  shall 
discuss  results  on  a  well-known  example,  the  L-shaped  billiard. 

3.1  Eigenvalue  statistics 

The  eigenvalue  statistics  of  a  pseudo-integrable  (PI)  L-shaped  cavity  was  studied  with  a 
44cm  by  21.8cm  copper  cavity  with  a  9.55cm  by  9.65cm  copper  piece  at  one  of  the 
corners.  In  the  frequency  range  0.045  GHz  to  18.497  GHz,  1000  energy  levels  were 
obtained.  This  data  also  shows  good  agreement  with  the  Weyl  formula  [23],  The  nearest 
neighbor  energy  spacing  P(s),  and  spectral  rigidity  A3(L)  were  studied  to  understand  the 
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Figure  1.  Eigenvalue  statistics  for  the  pseudo-integrable  (PI)  geometry.  (Top)  P(s] 
for  the  pseudo-integrable  geometry.  The  interpolation  to  fit  the  data  is  done  using 
eq.  (3)  with  /3  =  0.62.  (Bottom)  The  rigidity,  A3(L)  versus  L  for  the  pseudo-integrable 
geometry.  The  interpolation  is  done  using  (4)  with  j3  =  0.65. 


nature  of  the  eigenvalue  spectrum.  Even  though  the  classical  phase  space  structure  is 
closer  to  that  of  an  integrable  geometry,  level  repulsion  is  quite  evident  in  figure  1 .  The 
spectral  rigidity  AS  (L)  was  also  analyzed  and  is  also  shown  in  figure  1.  The  resulting  data 
lie  intermediate  between  the  universal  integrable  and  chaotic  curves  for  both  the 
statistical  measures  in  figure  1.  We  have  shown  elsewhere  [13]  that  the  auto-correlation 
function  [24]  has  a  correlation  hole,  as  it  goes  below  the  asymptotic  value  in  the  time 
scales  comparable  to  the  inverse  of  the  mean  energy  spacing.  However,  the  effect  is  not  as 
strong  as  in  the  chaotic  case,  and  hence  in  this  respect  also,  pseudo-integrable  geometries 
display  intermediate  behavior. 

For  intermediate  cases,  Berry  and  Robnik  [25]  proposed  a  theory  that  the  intermediate 
case  can  be  characterized  by  the  volume  of  phase  space  which  is  chaotic.  While  this 
argument  was  for  quasi-integrable  cases  that  have  a  mixed  phase  space,  we  have 
attempted  to  use  it  to  describe  the  pseudo-integrable  data. 
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P(S]  = 


(3) 


where,  ft  is  the  parameter  which  characterize  the  ratio  of  phase  space  which  is  integrable. 
The  fit  corresponding  to  eq.  (3)  is  displayed  in  figure  1,  where  the  data  is  well  described 
by  the  fit  for  s  greater  than  the  first  few  bins.  The  ft  value  obtained  is  0.62.  While  there 
are  a  few  missing  levels,  the  discrepancy  from  (3)  may  be  due  to  poor  statistics,  but  is 
more  likely  to  be  real,  indicating  that  a  more  refined  theory  is  required.  The  experimental 
spectral  rigidity  in  figure  1  is  compared  with  theory  [25,26]: 

(4) 


A3(L)  =  (L/15}(3  +  (l  -  0)  x     -n(L)  -  0.007 

2 


The  ft  value  which  approximates  the  experimental  curve  is  0.65,  in  reasonable  agreement 
with  that  obtained  using  (1).  The  auto  correlation  function  for  the  intermediate  case  was 
also  derived  by  Alhassid  and  Whelan  [24]  using  the  same  argument  of  Berry  and  Robnik. 
The  data  for  the  L-billiard  again  fits  the  expression  derived  by  them,  expression  for 
ft  =  0.65  [13].  Therefore  a  ft  of  approximately  0.65  roughly  describes  the  intermediate 
nature  of  the  data  for  the  three  different  statistical  measures.  But  such  matching  should  be 
taken  with  caution  since  the  phase  space  structure  of  pseudo-integrable  systems  is 
different  from  that  of  a  quasi-integrable  system. 

3.2  Eigenfunction  properties 

As  noted  earlier,  one  of  the  unique  features  of  the  experiments  is  the  ability  to  obtain 
directly  the  eigenfunctions.  While  there  are  at  least  a  few  studies  of  the  eigenvalues  of 
these  systems  [19,27,28],  the  eigenfunctions  have  not  been  studied  in  much  detail. 
Below  we  discuss  several  unique  features  of  the  eigenfunctions  of  the  L-billiard. 
Eigenfunction  data  were  obtained  in  the  frequency  range  0.5  GHz  and  4.5  GHz,  and  can 
be  easily  taken  into  a  higher  frequency  range.  Sample  eigenfunctions  are  displayed  in 
figure  2,  and  appear  to  have  similar  structure  as  the  eigenfunctions  of  the  Sinai  billiard  in 
the  same  frequency  range.  The  averaged  density  distribution  is  shown  in  figure  3  for 


Figure  2.  Sample  experimental  eigenfunctions  of  the  pseudo-integrable  system.  The 
corresponding  frequencies  are  3.032 GHz,  3.346  GHz,  4.079  GHz,  and  4.462  GHz. 
Dark  regions  correspond  to  high  amplitudes. 
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Figure  3.    Density  distribution  of  the  eigenfunctions  of  the  pseudo-integrable  system 


Figure  4.    (Top)  A  bouncing  ball  state  of  the  quarter  Sinai  billiard,  and  (Bottom)  its 
density  distribution. 


V 


igenfunctions  between  3.0  GHz  and  4.5  GHz.  A  useful  statistical  measure  to  characterize 
igenfunctions  is  the  density  distribution,  P(\^f],  which  gives  the  probability  of  finding  a 
articular  J7/;|2  in  the  cavity.  The  distribution  corresponding  to  the  L-billiard  is 
istinctively  different  from  an  integrable  rectangle,  and  in  fact  is  closer  to  that  of 
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Figure  5.  (Top)  Fourier  transform  of  the  eigenvalue  spectrum  of  the  L-billiard. 
(Bottom)  The  peaks  which  can  be  associated  with  periodic  orbits  are  labelled  and 
shown.  Also  shown  (middle)  are  corresponding  orbits  for  the  Sinai  billiard. 

chaotic  geometries.  For  a  rectangle,  P(|^|2)  is  truncated  at  |\I/|2  =  4.  Thus  large 
intensities  are  forbidden  in  the  rectangle,  in  contrast  to  chaotic  systems  which  obey  the 
Porter-Thomas  distribution  [2]  (see  figure  3)  that  shows  a  finite  though  exponentially 
vanishing  probability  of  finding  large  amplitudes.  We  have  demonstrated  this  for  the 
chaotic  Sinai-stadium  geometry.  Closer  examination  shows  that  the  data  for  the  L-billiard 
actually  disagrees  with  P-T  at  high  densities,  and  instead  are  in  closer  agreement  with  the 
Sinai  billiard  than  with  the  Sinai-stadium.  The  similarity  of  the  behavior  of  the  pseudo- 
integrable  system  with  the  Sinai  billiard  should  not  be  surprising,  as  both  have  a  similar 
structure  of  non-isolated  periodic  orbits.  Very  little  work  on  the  eigenfunction  properties 
has  been  done  for  pseudo-integrable  systems  [29],  and  new  theoretical  developments  will 
have  to  take  place  to  describe  experimental  data. 
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As  discussed  in  the  previous  section,  some  states  of  geometries  like  the  Sinai  and  stadium 
billiards  and  the  L-billiard,  have  structures  which  are  far  from  irregular.  In  fact  some 
show  eigenfunctions  that  have  rectangular  structure,  similar  to  the  eigenfunctions  of  a 
rectangle.  These  eigenfunctions  have  an  amplitude  distribution  with  far  less  probability  at 
large  amplitudes.  Figure  4  shows  an  example  of  a  bouncing  ball  state  of  the  Sinai  billiard 
with  \ip\2  almost  evenly  distributed  between  the  parallel  sides.  The  corresponding  P(\tp\  ) 
is  also  displayed  in  the  same  figure  that  shows  strongly  truncated  distribution.  So  when 
eigenfunctions  are  averaged  over  all  states,  some  that  are  the  bouncing  ball,  one  gets  an 
over  all  departure  from  Porter-Thomas.  Observation  of  these  small  deviations  clearly 
requires  very  sensitive  experiments  such  as  presented  here.  Recent  experiments  [30]  with 
stadium  cavities  which  probed  the  distribution  of  widths  could  not  see  this  deviation  from 
universality  due  to  lack  of  accuracy  and  statistics  [31]. 

3.4  Fourier  transforms  and  direct  observation  of  periodic  orbits 

A  very  useful  means  to  directly  observe  and  even  measure  the  shortest  periodic  orbits 
(PO)  is  by  taking  the  Fourier  transform  (FT)  of  the  measured  eigenvalue  spectrum.  The 
FT  is  shown  in  figure  5,  and  has  several  peaks,  which  correspond  very  well  with  the  PO 
shown  in  figure  5.  Note  that  only  the  non-isolated  PO,  i.e.  bouncing  ball  orbits,  are  clearly 
present.  The  strength  of  a  given  PO  in  the  FT  is  likely  to  be  related  to  its  area  of  stability 
[32,22].  It  appears  that  in  order  to  manifest  themselves  in  the  FT  of  the  eigenvalue 
spectrum,  the  relevant  PO  must  have  sufficient  phase  space  areas  of  stability.  Experiments 
on  the  Sinai  billiard  also  show  similar  peaks  in  the  FT  since  the  non-isolated  PO  are 
similar  in  both  cases  (see  figure  5). 

4.  Conclusion 

The  experiments  described  in  this  paper  represent  a  unique  and  novel  approach  to 
studying  model  problems  in  quantum  or  wave  chaos.  We  have  examined  the  behavior  of  a 
system  that  is  intermediate  between  integrable  and  chaotic.  The  statistical  properties  of 
the  eigenvalues  and-eigenfunctions  show  intermediate  behavior  between  Poisson  and  the 
Gaussian  orthonormal  ensemble  of  random  matrix  theory.  Comparisons  with  existing 
models  which  examine  the  quasi-integrable  systems  shows  reasonable  agreement  but  also 
point  to  a  need  for  a  more  refined  theory  for  pseudo-integrable  systems.  Our  results 
demonstrate  that  powerful  insights  can  be  obtained  by  a  judicious  combination  of 
experiment  and  comparison  with  theory. 
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Abstract.  I  will  show  how  aspects  of  quantum  chaology  are  relevant  even  in  a  seemingly  well 
understood  quantum  phenomenon  like  the  photoelectric  effect.  This  example  together  with  recent 
experiments  in  atom  optics  are  used  to  define  and  discuss  the  larger  questions,  recent  progress  made 
by  us  in  resolving  some  of  these  issues  and  future  directions. 
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1.  Introduction 

In  his  1987  Bakerian  lecture  to  the  Royal  Society  in  London,  M  V  Berry  defined  quantum 
chaology  as  "the  study  of  semiclassical,  but  nonclassical,  behaviour  characteristic  of 
systems  whose  classical  motion  exhibits  chaos"  [1].  For  more  than  two  decades  prior  to 
this,  work  on  systems  which  exhibited  such  behaviour  had  been  ongoing  under  the 
nominal  definition  of  'quantum  chaos'  [2].  Unfortunately,  the  very  nature  of  quantum 
mechanics  precludes  many  of  the  defining  characteristics  of  classical  chaos,  making  this 
latter  definition  burdensome.  In  fact  it  has  even  led  some  researchers  to  ignore  the 
unquestionable  fact  that  there  are  quantifiable  changes  in  the  quantum  dynamics  when  the 
limiting  classical  dynamics  exhibits  chaos,  as  compared  with  a  nonchaotic  counterpart. 

In  this  article,  I  shall  illustrate  that  aspects  of  these  changes  are  readily  observed  even 
in  a  simple  quantum  paradigm  -  the  photoelectric  effect.  The  fact  that  this  phenomenon 
embodies  wave-particle  duality  makes  it  ideal  to  introduce  and  discuss  quantum-classical 
correspondence  in  the  larger  context  of  non-integrable  systems.  I  will  then  consider 
another  new  experimental  system  [3]  in  atom  optics  which  facilitates  the  realization  of 
theoretical  paradigms  like  the  quantum  5-kicked  rotor  and  provides  an  example  of  a 
different  manifestation  of  quantum  chaology.  Proceeding  from  these  illustrations,  I  shall 
define  the  larger  picture  in  the  study  of  quantum  dynamics  with  classically  chaotic  limit 
and  briefly  discuss  our  work,  in  collaboration  with  Rainer  Scharf,  in  integrating  all  the 
relevant  issues  within  the  framework  of  a  single  model  system.  Concluding  remarks 
address  ongoing  directions  with  a  view  on  future  experiments  in  atom  optics. 

Most  of  us  remember  the  photoelectric  effect  as  the  emission  of  electrons  when  a 
surface  is  irradiated  with  light.  The  threshold  is  defined  by  the  condition  hv  =  W  where  v 
is  the  frequency  of  the  light  and  W  a  characteristic  binding  energy  for  the  electron.  It  was 
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soon  realized  that  a  generalized  multiphoton  threshold  N^hv  —  W  was  also  possible 
though  the  intensity  of  the  light  had  to  be  considerably  higher  with  increasing  Afth-  To  put  I 

this  principle  to  practice  required  the  advent  of  the  laser.  Atoms  and  molecules  became  :; 

the  targets  of  choice,  W  was  taken  to  be  the  ionization  or  dissociation  energy  and  the  age  ?/ 

of  experimental  multiphoton  physics  was  born.  Since  that  time,  laser  power  has  rapidly 
gone  through  the  kilo-,  mega-,  giga-  and  tera-  prefixes  and  now  borders  on  the  peta-Watt 
regime.  This  increase  in  intensity  has  brought  with  it  new  and  manifestly  non-perturbative 
phenomena  such  as  above-threshold  ionization  (where  the  electron  absorbs  more  photons 
than  it  needs  just  for  the  ionization  threshold)  and  harmonic  generation  (where  the 
irradiated  system  behaves  as  a  driven  anharmonic  oscillator  and  scatters  photons  at 
frequencies  which  are  large  multiples  of  the  incident  frequency).  These  have  led  to  a 
revision  of  the  traditional  analysis  of  multiphoton  physics  [4].  Most  importantly  from  our 
perspective,  it  has  brought  dynamics  to  the  foreground. 

Despite  these  changes  in  our  understanding  of  the  photoeffect,  the  one  feature  which 
had  not  been  challenged  was  the  fact  that  the  probability  of  freeing  the  electron  was  still 
expected  to  increase  with  increasing  intensity  of  the  light.  In  other  words,  the  lifetime  of 
the  quantum  state  was  expected  to  decrease  with  growing  intensity. 

The  phenomenon  of  'stabilization'  was  reported  in  numerical  experiments  [5]  which 
considered  a  model  Hamiltonian,  for  a  1-D  atom,  of  the  form 

H(X,p,t}=p2/2-l/v/l+x2  +  XFcos(ut  +  (j)}  ,(a.u.)  (1) 

where  F,  w,  and  0  are  the  field  strength,  frequency,  and  phase  of  the  oscillating  electric 
field.  This  1-D  model  potential  asymptotes  to  the  Coulombic  potential  for  large  x,  but 
eliminates  the  singular  behaviour  at  the  origin.  On  solving  the  corresponding  time- 
dependent  Schrodinger  equation,  it  was  found  that  on  increasing  F,  the  probability  for 
freeing  the  electron  decreased  dramatically,  for  fixed  interaction  time.  The  wavefunction 
at  this  time  also  exhibited  peaks  which  were  consistent  with  an  effective  double  well 
potential  [5].  Both  'stabilization'  and  the  distinct  'dichotomous'  form  of  the  residual 
wavefunction  were  soon  confirmed  in  more  realistic  3-D  simulations  [6].  We  shall  show 
how  stabilization  provides  an  illustration  of  effects  in  'quantum  chaology'  which  can  then 
be  used  to  predict  the  extent  of  stabilization  and  the  shape  of  the  wavefunction  [7]. 

It  is  true  that  advances  in  classical  nonlinear  dynamics  and  chaos  have  been 
successfully  applied  to  strongly  coupled  quantum  systems  like  the  photo-absorption 
spectrum  of  Rydberg  atoms  in  strong  magnetic  fields  [8],  microwave  ionization  of  highly 
excited  hydrogen  atoms  [9],  and  the  excitation  of  doubly  excited  states  of  helium  atoms 
[10].  However,  the  fact  that  the  atoms  are  initially  in  the  ground  state,  far  from  the 
classical-quantum  correspondence  limit,  makes  the  application  of  these  ideas  to  the 
remarkable  phenomenon  of  'stabilization'  in  super-intense  laser  fields  still  more  counter- 
intuitive [11]. 

2.  Stabilization  and  scarring  of  wavefunctions 

In  the  high-field  limit  (F  >  1  atomic  unit  meaning  that  it  is  greater  than  the  binding 
potential)  the  smoothed  Coulomb  potential  in  (1)  can  be  treated  as  a  perturbation  on  the 
regular,  classical  motion  of  a  free  electron  in  an  oscillating  field.  So,  let  us  first  consider 
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the  Hamiltoman  for  the  one-dimensional  motion  or  a  free  electron  in  the  oscillating 
electric  field  E  =  Fcos(otf0), 

HQ(x,  v,  t)  =  v2/2  -  xF  cos(ut  +  0)  .  (2) 

The  classical  equations  of  motion  can  be  integrated  exactly  and  the  solution  for  the 
position  of  the  electron  as  a  function  of  time 

F  \         F         1 

x(t)  =XQ  --  o  [cos  (ujt  +  4>]  -  cos  0]  +    v0  ---  sin  0  f  ,  (3) 

<jj^  L         ^         J 

describes  a  particle  that  oscillates  back  and  forth  in  the  electric  field  with  frequency 
v  —  u)/2ir  and  amplitude  a  —  F/u2,  and  drifts  away  from  its  initial  position,  XQ,  with 
velocity 

F 

Vd  =  VQ  --  sin  <f)  .  (4) 

uj 

(Note  that  even  if  the  initial  velocity,  VQ,  is  zero,  a  nonzero  initial  phase,  <j)  of  the 
oscillating  field  can  lead  to  a  large  drift  velocity,  Vd  =  —  (F/u)  sin</>.)  The  classical 
motion  is  considerably  simplified  if  we  consider  variables  in  the  oscillating  frame 


q  =  x  +  a  [cos  (ut  +  4>]  —  cos  </>] 


=  v- 


F 

—  sin  (ujt  + 

UJ 


(5) 


In  the  absence  of  any  other  forces  p  =  va  is  constant  and  the  oscillation  center  drifts  at  the 
constant  velocity  q(t)  =  x(0]  +  v^t.  In  particular,  if  Vd.  =  0  (eg.,  VQ  =  0  and  0  =  0  or  TT), 
then  q(t]  =  XQ  is  constant.  On  adding  a  potential  V(x],  we  have  in  the  transformed  frame 

H(q,p,  t]  =  p2/2  +  V(q  —  a  [cos  (ujt  +  </>)  -  cos  0])  ,  (6) 

which  can  be  conveniently  expanded  in  a  discrete  Fourier  series 

oo 

H(q,p,t}=p2/2  +  £  VnMcf"*,  (7) 

n=—oo 

as  the  perturbation  is. periodic  with  period  T  =  2^/u.  The  Fourier  coefficient  Vo(tf)  is 
simply  the  time-averaged  potential  which  is  all  that  is  required  in  the  high-frequency 
approximation.  Away  from  this  limit,  the  other  coefficients  Vn(q)  contain  valuable 
dynamical  information.  The  ansatz  that  Vn(q)  «  VQ(^)  for  a  large  number  of  n  provides 
the  other  extreme.  In  this  case,  the  Hamiltonian  in  the  oscillating  frame  reduces  to 


=  P2/2  +  Vbfe)    T  6(t  -nT  +  07/27T),  (8) 


using  the  Poisson  sum  rule.  The  phase  4>  can  be  set  to  zero  with  no  loss  of  generality  and 
integrating  the  classical  equations  of  motion  over  one  period  T  leads  to  the  nonlinear, 

Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  H) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences"  471 


-60 


Figure  1.  Kick  potential  as  a  function  of  q  for  F  =  5.0  (a.u.)  and  cj  =  0.52  (a.u.),  for 
both  exact  and  approximate  (evaluated  only  in  the  vicinity  of  the  classical  turning 
points)  kicking  terms.  The  curves  clearly  show  that  a  simple  constant  shift  is  the  only 
difference.  Note  that  F  —  5  means  that  the  external  field  is  five  times  larger  than  the 
binding  field. 


area-preserving  map, 

qn+\  =  qn 


=pn  +  G(qn], 


(9) 


where  qn  and  pn  are  the  positions  and  momenta  of  the  electron  in  the  oscillating  frame, 
evaluated  once  every  period  T  —  2-n/u  of  the  field.  The  total  impulse  of  each  kick  G(qn) 
is  the  time-integral  of  the  force  over  one  period  and  can  be  simply  expressed  in  terms  of 
the  space  derivative  of  the  time-integral  of  the  oscillating  potential, 


_d_ 
dq 


(10) 


A  more  physical  motivation  for  the  map  as  well  as  the  explicit  conditions  for  its  validity 
are  discussed  in  ref.  [7].  Let  us  now  consider  the  smoothed  Coulomb  potential 


which  in  the  moving  frame  is 


V(q,t)  =  - 


-  I]) 


(11) 


(12) 
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Figure  2.  Comparison  of  the  impulse  generated  over  a  single  period  from  the 
differential  dynamics  (crosses)  and  the  single  valued  function  G  for  the  map.  The 
parameter  a  is  (a)  0.37,  (b)  3.7,  (c)  18.5  and  (d)  295.9.  (d)  displays  only  a  small 
portion  of  the  total  range  so  as  to  emphasize  the  maximum  discrepancy. 


where  Z  is  the  nuclear  charge  and  a  eliminates  the  x~{  singularity  at  the  origin  and 
also  defines  a  characteristic  length  scale.  The  time-average  of  V(q,  t)  can  be  expressed 
exactly  in  terms  of  elliptic  integrals  though  for  our  purposes  it  is  enough  to  note 
that,  as  shown  in  figure  1,  it  is  a  double- welled  potential  with  minima  near  the 
classical  turning  points  at  0  and  -2a  for  the  free  electron  in  the  oscillating  field. 
The  map  approximates  the  impulse  only  at  these  turning  points  and  results  in 
G(<ln)  =  kF(qn)  where  k  =  y/8/F  and  F(x)  involves  complete  elliptic  integrals  of 
the  first  and  second  kinds.  The  effective  potential  in  the  map  approximation,  also 
shown  in  figure  1,  is  simply  shifted  relative  to  the  exact  potential.  The  product  K  =  kT 
is  the  stochasticity  parameter  and  determines  the  nature  (locally/globally/not  chaotic) 
of  the  phase  space  of  classical  solutions.  Note  that  both  the  limits  of  high  field  F 
and  high  frequency  a;  lead  to  small  K  which  corresponds  to  'linear'  or  integrable 
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Figure  3.  Poincare  section  for  F  =  5.0  and  w  =  0.52  which  is  typical  for  large 
values  of  the  stochasticity  parameter.  As  discussed  in  the  text,  no  stable  (island) 
structures  are  expected  and  the  classical  dynamics  is  dominated  by  rapidly  ionizing 
orbits. 


The  validity  of  the  map  approximation  can  be  checked  a  posteriori  by  comparing  the 
magnitude  of  the  impulses  experienced  by  the  electron  after  each  period  of  the  oscillating 
field  with  those  predicted  by  the  single-valued  function  in  the  map.  Figure  2  contrasts  the 
impulse  computed  from  the  differential  dynamics  with  G  for  a  range  of  a.  It  is  clear  that 
the  map  becomes  an  effective  description  of  the  true  dynamics  with  increasing  a. 

Having  established  the  map  as  a  good  approximation  to  .ihe  true  classical  dynamics, 
we  can  now  use  a  general  stability  analysis  of  trajectories  to  predict  the  classical 
parameter  bounds  for  stabilization.  This  is  also  readily  seen  from  the  Poincare  surfaces  of 
section  which  provide  an  overview  of  the  classical  dynamics.  For  example,  figure  3 
considers  field  parameters  for  which  almost  all  the  classical  electron  orbits  are  chaotic 
and  ionizing.  By  contrast  figure  4  shows  a  case  where  many  initial  conditions  starting 
out  near  the  nucleus  are  regular  and  do  not  ionize,  tracing  out  regular  island  structures 
in  the  classical  phase  space.  Thus,  classical  prediction  of  stabilization  is  determined 
merely  by  the  existence  or  otherwise  of  stable  regions  in  phase  space. 

The  clearest  representation  of  the  parameter  values  for  which  regular  regions  exist 
in  the  classical  phase  space  is  a  stability  diagram  in  the  space  of  F  -  w,  shown  in 
figure  5.  The  dotted  line  divides  the  overall  parameter  space  into  two  regions  where 
a  <  I  (on  the  right  side)  and  a  >  1  (on  the  left  side).  When  a  <  1  the  map  potential  is 
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Figure  4.  Poincare  section  for  parameters  u  =  1.34,  F  —  5.0  of  showing  the 
presence  of  stable  classical  islands.  The  occurrence  of  these  stable  regions  is  typical 
for  smaller  values  of  the  stochasticity  parameter. 


a  single  well  centered  on  a  fixed  point  near  —a.  Increasing  a  leads  to  the  effective 
double-well  potential  shown  earlier  with  two  elliptic  (stable)  and  one  hyperbolic 
(unstable)  fixed  points.  The  elliptic  fixed  points  become  unstable  for  parameter  values 
below  an  analytically  predicted  stability  border  (solid  line)  after  which  no  classical 
mechanism  for  stabilization  exists.  The  predicted  boundary  is  consistent  with  the 
existence  of  stable  structures  in  both  map  dynamics  (indicated  by  the  dotted  region) 
and  the  full  differential  dynamics  (indicated  by  the  open  circles),  as  seen  from  figure  5. 
Note  the  region  of  low  F  and  cj  where  map  and  differential  dynamics  disagree.  In 
this  regime,  the  relevant  time  scale  is  the  internal  period  rather  than  the  external  field 
period  we  use.  This  results  in  a  different  stroboscopic  approximation  called  the  Kepler 
map  [12]. 

We  consider  three  parameter  sets  corresponding  to  the  points  A,  B  and  C  in  the  stability 
diagram.  Purely  classical  stability  arguments  would  suggest  no  stabilization  for  A  and  a 
larger  fraction  for  C  as  compared  with  B.  C  is  also  closer  to  the  line  a  =  1  which  means  a 
near  single-well  effective  potential.  As  I  shall  now  show,  the  quantum  determination  of 
stability  is  considerably  more  complicated.  For  the  quantum  dynamics,  we  simply 
integrate  the  time-dependent  Schrodinger  equation  (in  atomic  units) 


(13) 


on  a  space-time  grid  using  the  standard  two-sweep  algorithm  [13].  The  initial  condition 
is  the  ground  state  of  the  undriven  potential.  Given  our  picture  of  the  stabilization 
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Figure  5.  The  classical  stability  of  the  map  (dots)  and  the  full  differential  dynamics 
(circles)  was  assessed  by  advancing  the .  equations  of  motion  200  periods  of  the 
perturbation.  If  the  trajectory  returned  to  the  vicinity  of  the  nucleus  a  point  was  plotted 
in  the  (w  -  F)  parameter  plane.  The  dividing  line  a  =  1 .0  is  indicated  by  the  dotted 
line  while  the  stability  boundary  is  shown  by  the  solid  line. 


mechanism  as  resulting  from  a  free-particle  interacting  periodically  with  trapping  centers 
(in  this  case  a  double  well  potential),  a  simple  measure  of  ionization  is  the  fraction  of 
wavefunction  that  is  outside  the  "interaction  volume".  A  smaller  ionized  fraction  means 
increased  stabilization.  As  the  maximum  spacing  of  the  double-wells  (averaging  over  an 
external  period)  is  4a,  a  suitable  choice  for  this  interaction  volume  (in  one  dimension)  is 
— 4a  <x<  4a.  Thus,  a  definition  of  ionization  as 


\x\>4a 


(14) 


is  adequate  to  establish  when  stabilization  is  significant. 

The  results  of  the  quantum  simulations  for  cases  A,  B  and  C  are  shown  in  the  lower 
two  panels  in  figure  6.  The  corresponding  classical  phase  portraits  shown  reinforce  our 
inferences  from  the  stability  diagram;  no  stabilization  for  A  while  larger  islands  exist  for 
C  as  compared  with  B.  However,  the  ionized  fraction  as  calculated  from  the  quantum 
evolution  supports  the  contrary  result  that  there  is  more  stabilization  for  A  as  compared 
with  B.  Case  C  is  the  most  stable  which  is  at  least  consistent  with  the  classical  prediction. 
What  is  the  origin  of  this  discrepancy?. 
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Figure  6.  Classical  phase  portraits  (upper  panel),  residual  quantum  wavefunctions 
(middle  panel),  and  ionization  probability  versus  time  (in  units  of  the  period  T) 
(bottom  panel).  The  parameters  are  (A)  F  =  5.0,  u  =  0.52;  (B)  F  =  20,  uj  =  1.04; 
and  (C)  F  —  10  and  u  —  2.0.  Note  that  the  peak  structure  of  the  final  wavefunction 
reflects  both  stable  and  unstable  classical  fixed  points.  For  case  C,  the  peaks  are 
beginning  to  coalesce  reflecting  the  approach  of  the  single-well  effective  potential 
(see  text). 


The  resolution  comes  from  the  fact  that  the  classical  predictions  were  based  entirely 
on  the  stability  analysis  of  low-order  fixed  points.  Though  a  fixed  point  may  be 
unstable,  there  remains  an  important  dynamically  invariant,  but  still  unstable,  classical 
structure  -  the  homoclinic  tangle.  Homoclinic  tangles  corresponding  to  cases  A  and  B 
are  shown  in  figures  7  and  8  respectively.  The  tangle  emanates  from  unstable  fixed 
points  and  its  complicated  appearance  is  simply  a  consequence  of  the  area-preserving 
constraint,  of  Hamiltonian  evolution.  The  unstable  tangle  also  provides  support  for 
quantum  wavefunctions  -  an  example  of  a  phenomenon  referred  to  as  'scarring'  [14]. 
A  heuristic  estimate  of  the  'support'  provided  by  the  tangle  for  quantization  is  obtained 
by  considering  how  the  size  (area)  of  the  tangle  compares  with  Planck's  constant. 
This  measure  clearly  shows  that  despite  larger  stable  regions,  the  overall  support  stable 
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Figure  7.  Homoclinic  tangle  associated  with  the  fixed  point  at  (—a,  0)  for  case  A. 
Near  the  fixed  point,  the  solid  line  gives  the  unstable  direction  while  the  dashed  line  is 
the  stable  direction.  The  size  of  Planck's  constant  h  is  shown  to  illustrate  that  several 
states  can  be  supported  by  the  single  structure.  An  estimate  of  the  number  of  states  is 
given  by  the  number  of  h  boxes  needed  to  cover  the  structure. 


and  unstable,  is  considerably  less  in  case  B  as  compared  with  case  A.  This  is  the 
reason  for  the  reduced  stabilization.  However,  the  wavefunction  at  the  end  of  the 
interaction  time  in  case  B  exhibits  three  distinct  peaks  which  clearly  reflect  the  stable 
regions  in  the  phase  space,  unlike  A.  Case  C  is  one  where  the  phase  space  is  dominated 
by  a  large  stable  region  and  classical  and  quantum  intuition  agree.  Thus,  the  discrepancy 
in  cases  A/B  is  a  direct  consequence  of  what  was  stated  to  be  quantum  chaology  - 
"..nonclassical  behaviour  characteristic  of  systems  whose  classical  motion  exhibits 
chaos". 

These  considerations  can  be  extended  to  the  full  3-D  system  where  a  four-dimensional 
map  and  considerably  more  involved  analysis  is  necessary  [15, 16],  though  scarring  is 
once  again  a  relevant  feature.  For  those  concerned  by  the  seemingly  unphysically  large 
field  strengths  used  in  the  illustrative  cases,  it  should  be  noted  that  these  results  can  be 
scaled  to  other  frequency  regimes  and  also  to  excited  state  initial  conditions,  where  an 
experimental  realization  is  more  likely  [17].  The  mobility  of  the  light  component  of  the 
two-component  plasmas  considered  in  wakefield  accelerators  is  another  potential 
experimental  system. 
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Figure  8.     Same  as  figure  7  but  for  case  B.  Note  that  the  classical  excursion 
a  =  F/u2  is  the  same  in  the  two  cases. 

).  Atom  optics  and  dynamical  localization 

Dur  earlier  discussion  makes  clear  that  quantum  dynamics  of  systems  with  chaotic 
classical  limit  are  usually  much  more  stable  in  their  evolution  than  their  classical 
counterparts.  However,  contrary  to  expectations,  the  mechanisms  that  lead  to  quantum 
stabilization  are  not  solely  associated  with  classically  stabilizing  phase  space  structures 
ike  the  regions  around  elliptic  periodic  orbits,  KAM-tori,  and  separatices.  Even 
structures  connected  with  unstable  classical  motion  such  as  cantori,  hyperbolic  periodic 
Drbits,  homoclinic  and  heteroclinjc  tangles  can  support  quantum  mechanical  wavefunc- 
ions.  Even  though  these  'scarred'  wavefunctions  are  ubiquitous  in  both  autonomous  and 
ion-autonomous  strongly  coupled  quantum  systems,  they  form  a  small  part  of  the  bigger 
picture  of  quantum  dynamics  with  chaotic  classical  limit. 

For  example  there  is  also  a  genuinely  quantum  dynamical  stabilization  mechanism. 
'Dynamical  localization'  refers  to  the  fact  that  eigenstates  of  quantum  dynamics  can  be 
localized  in  an  action  variable  like  momentum  despite  the  fact  that  the  classical  limit 
exhibits  (deterministic)  diffusion  [18].  This  effect  is  analogous  to  Anderson  localization 
in  tight-binding  models  [19]  and  provides  a  global  mechanism  (does  not  distinguish 
details  of  the  classical  phase  space)  for  the  quantum  suppression  of  classical  chaos.  In 
one-dimension,  the  localized  wavefunction  has  a  characteristic  exponential  form  as  in  the 
case  of  Anderson  localization. 
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This  is  in  contrast  with  the  phenomenon  of  'scarring'  where  the  quantum 
wavefunctions  show  enhanced  probability  around  specific  classically  unstable  invariant 
structures.  Thus,  the  excitation  of  scarred  wavefunctions  provides  a  'local'  mechanism  of 
suppression  of  the  classically  chaotic  dynamics. 

As  the  emphasis  in  this  article  is  on  the  usefulness  of  these  ideas  in  unexpected 
contexts,  I  will  now  consider  a  recent  example  of  dynamical  localization  in  atom  optics. 
Instead  of  the  smoothed  Coulomb  potential,  let  us  take  V  (q)  =  k  cos  q.  In  the  presence  of 
the  time-drive  and  in  the  moving  frame,  we  have 


=  p2/2  +  k  cos  (q  —  a  sin  (ujt}) 

(a}c°s((l-n(jJt},  (15) 


which  describes  a  modulated  pendulum  where  the  Fourier  weights  Jn  are  Bessel  functions 
of  order  n.  For  large  a,  the  asymptotic  form  for  /„  can  be  taken  where  the  amplitude 
varies  slowly  with  n  (as  required  by  the  ansatz  of  the  earlier  example).  The  resulting  map 
is  a  textbook  paradigm  for  Hamiltonian  dynamics  and  chaos  -  the  standard  or  Chirikov- 
Taylor  map.  More  significantly,  the  quantum  standard  map  or  kicked  rotor  is  the 
paradigm  for  dynamical  localization. 

A  novel  realization  of  this  driven  system  was  reported  recently  by  considering  the 
momentum  transfer  from  a  modulated  standing  wave  of  light  to  a  sample  of  independent, 
ultra-cold  atoms  [3,20].  The  thermal  (cold)  sample  of  approximately  105  sodium 
atoms  provide  a  Gaussian  initial  condition  in  momentum.  The  relevance  of  the  internal 
structure  of  the  atom  to  the  dynamics  can  be  eliminated  by  tuning  the  frequency  of  the 
standing  wave  far  from  resonance.  The  remaining  dynamics  is  that  of  a  'structureless' 
atom,  described  by  the  above  Hamiltonian,  which  undergoes  a  momentum  change  of 
2HkL  (k]_  is  the  wavenumber  of  the  light  wave)  when  the  velocities  of  the  atom  and 
standing  wave  are  matched.  These  'resonant  kicks'  occur  twice  every  standing  wave 
period  though  they  are  not  equally  spaced  in  time.  The  experiment  measures  the  final 
momentum  distribution  of  the  atoms  as  the  modulation  amplitude  of  the  standing  wave 
is  varied. 

The  classical  dynamics  generated  by  (15)  is  governed.  by  resonances  at  p  =  q  =  nui 
with  approximate  widths  4^k\Jm(a}\  in  momentum.  These  resonances  overlap  as  a  is 
increased  after  which  the  classical  particle  can  diffuse  in  momentum.  It  should  be  noted 
that  there  is  a  maximum  momentum  value  pmax  =  a  beyond  which  the  interaction  turns 
off.  This  is  easily  seen  as  the  velocities  of  the  atom  and  standing  wave  have  to  match 
(stationary  phase  condition)  for  the  momentum  transfer  to  occur.  Thus,  the  classical 
diffusion  is  over  a  bounded  region  in  momentum  space,  the  extent  of  which  can  be 
controlled  by  changing  a. 

The  first  column  shown  in  Figure  9  corresponds  to  zero  modulation  amplitude, 
which  is  just  the  case  of  the  simple  pendulum.  The  classical,  experimentally 
measured  and  quantum  distribution  of  final  momenta  of  the  atomic  ensemble 
agree  once  again  because  the  phase  space  is  entirely  regular.  On  increasing  a,  the 
phase  space  becomes  increasingly  chaotic  and  the  stable  regions  shrink  as  shown  in 
column  two.  The  dominant  classical  mechanism  is  now  diffusion  in  momentum.  The 


Figure  9.  Phase  portraits  (upper  panel),  classical  momentum  distributions  (middle 
panel),  and  experimentally  measured  momentum  distributions  compared  with 
quantum  theory  (bottom  panel,  theory  marked  by  lines).  The  first  column  is  for 
a  =  0  and  is  just  the  case  of  the  simple  pendulum  while  a  ^  0  in  column  two.  The 
initial  condition  corresponds  to  a  Gaussian  distribution  in  momentum  and  fully 
extended  (plane  waves)  in  position.  The  vertical  scales  for  the  distributions  are 
logarithmic  and  are  marked  in  decades. 
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Figure  10.    Schematic  representation  of  issues  which  arise  when  considering  the 
quantization  of  classically  chaotic  dynamics. 


final  classical  momentum  distribution  is  nearly  uniform  over  the  allowed  region,  except 
for  a  small  peak  corresponding  to  the  remnant  stable  region.  This  is  in  sharp  contrast  with 
the  quantum  and  experimental  distributions  which  both  exhibit  an  exponential 
distribution  in  momentum  (note  the  logarithinic  y-axis).  This  is  an  example  of  dynamical 
localization. 

Anderson  localization  describes  the  motion  of  electrons  on  a  disordered  lattice  where 
the  randomness  is  externally  imposed.  In  dynamical  localization  randomness  or,  more 
correctly,  pseudorandomness  is  generated  by  the  dynamics  leading  to  localization  in  an 
action  variable,  rather  than  in  space.  Dynamical  equivalents  of  higher  dimensional 
Anderson  models  [21]  also  exist  which  should  be  realized  soon  by  the  atom  optics 
experiments.  It  should  be  mentioned  that  these  experiments  have  already  confirmed  many 
of  the  details  of  the  theory  of  dynamical  localization  [22]  as  well  as  the  mechanism  of 
overlapping  resonances  to  achieve  global  chaos  [23]. 

4.  The  bigger  picture 

Scarring  and  dynamical  localization  are  merely  two  aspects  of  a  much  larger  view  of 
quantum  dynamics  with  classical  chaotic  limit.  As  shown  by  the  schematic  in  figure  10, 
this  picture  originates  from  two  simple  questions  which  still  demarcate  current 
approaches,  (a)  How  does  semiclassical  quantization  work  for  nonintegrable  dynamics? 
and  (b)  What  are  the  semiclassical  signatures  of  chaos? 
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The  first  question  resulted  m  periodic  orbit  quantization,  an  idea  which  dates  back  to 
Van  Vleck  in  1928  and  which  was  extensively  pursued  by  Gutzwiller  [24].  The  seminal 
idea  is  the  use  of  classical  trajectories  to  calculate  the  quantum  propagator  of  a 
mechanical  system  within  the  semiclassical  approximation.  The  trace  of  this  propagator, 
which  gives  information  about  the  spectrum  of  the  dynamics,  is  therefore  linked  to  the 
periodic  orbits  of  the  classical  dynamics.  This  approach  works  not  only  for  integrable 
dynamics  but  also  for  strongly  chaotic  dynamics  as  well.  It  has  been  applied  successfully 
to  describe  the  quantum  properties  of  model  systems  like  the  anisotropic  Kepler  problem 
[25, 26]  and  the  motion  of  a  particle  on  a  surface  of  constant  negative  curvature,  where 
the  trace-formula  is  exact  (see  for  example  [27]);  as  well  as  to  the  spectral  analysis  of  a 
hydrogen  atom  in  a  strong  magnetic  field  [8]  and  the  doubly  excited  spectrum  of  the 
helium  atom  [10].  A  related  and  yet  outstanding  problem  is  periodic  orbit  quantization  for 
dynamics  in  a  mixed  phase  space,  where  regular  and  chaotic  structures  coexist. 

A  recent  twist  to  the  periodic-orbit  analysis  came  with  the  observation  that  in  systems 
where  new  orbits  are  created  through  bifurcation,  the  impact  of  these  orbits  is  felt  in  the 
quantum  spectrum  before  they  appear  ay  real  orbits  of  the  classical  dynamics 
(parametrically  speaking)  [28,  29,  30].  These  so-called  "ghost"  orbit  contributions 
further  modify  the  semi-classical  analysis. 

The  second  question  leads  to  "quantum  chaology".  It  includes  the  study  of  the 
statistical  features  of  the  eigenvalue  spectra,  as  well  as  of  the  eigenvector  components  in  a 
fixed  representation.  Using  this  approach,  a  variety  of  interesting  results  have  surfaced 
(see  for  example  [31, 32]).  An  example  of  consequence  to  experiments  is  the  relationship 
between  the  statistical  features  of  the  spectra  of  most  strongly  coupled  systems  to  certain 
matrix  ensembles  devised  to  understand  nuclear  spectra  [33].  However,  this  statistical 
description  results  in  randomized  eigenfunctions  and  has  to  be  modified  as  soon  as  a 
dynamically  localized  or  scarred  wavefunction  is  observed. 

The  interplay  of  the  different  mechanisms  had  not  been  addressed  till  veiy  recently.  In 
large  part  this  is  due  to  the  fact  that  the  demands  made  on  model  systems  by  the  two 
branches  are  quite  distinct.  For  example,  most  aspects  of  quantum  chaology  can  be 
studied  within  the  <5-kicked  rotor  or  standard  map  paradigm.  However,  semiclassical  or 
periodic  orbit  quantization  is  greatly  facilitated  by  the  existence  of  a  symbolic  coding 
(alphabet)  and  pruning  rales  (grammar)  for  the  classical  periodic  orbits,  as  in  the  3-  and 
4-disk  scattering  systems.  It  is  not  easy  to  calculate  classical  invariant  structures  in  the 
standard  map.  As  such  even  the  competition  between  local  (scarring)  and  global 
(dynamical  localization)  suppression  mechanisms  has  not  been  explored.  Scarring  [34] 
and  dynamical  localization  [35]  have  only  recently  been  reconciled  with  semiclassical 
theory,  though  even  there  open  questions  still  remain. 

Piecewise  continuous  potentials  have  been  used  to  address  a  number  of  issues  related 
to  classical  transport  [36].  Scharf  and  I  [37]  showed  that  a  piecewise  linear  standard  map 
[38]  facilitated  the  calculation  of  classical  invariant  structures  for  scarring  and  provided  a 
simple  coding  for  periodic  orbits.  We  used  this  modified  dynamics  to  explore  the  strength 
of  scarring  [39]  as  well  as  the  construction  of  local  effective  Hamiltonian  flows  based  on 
classical  periodic  orbits  [40].  This  local  effective  description  becomes  necessary  in  mixed 
phase  spaces  though  the  crucial  issue  of  matching  solutions  then  becomes  the  open 
question.  Current  work  on  relating  these  local  effective  flows  to  Frobenius-Perron  [41] 
operator  dynamics  appears  very  promising. 

Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  H) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences"  483 


Bala  Sundaram 

Experimentally,   a  strong  focus  of  the  next  generation  of  experiments  in  atom 
optics  will  be  mixed  phase  space  dynamics  and  the  effects  of  noise  [42].  Direct  excitation 
of  scarred  states  and  an  assessment  of  their  coherence  with  an  atom  interferometer 
may  also  be  possible.  Advances  in  fabrication  technology  have  opened  up  another 
new  testing  ground  in  the  area  of  layered  heterostructures.  The  role  of  the  ideas  we 
have  discussed  to  the  transport  or  magnetic  properties  of  driven  quantum  wells  and 
a  variety  of  mesoscopic  systems  is  currently  under  intense  scrutiny  with  the  anticipa- 
tion that  these  might  impact  on  device  physics  as  well.  In  particular,  experiments 
measuring  changes  in  the  magnetoconductance  as  the  shape  of  the  scattering  domain, 
for  a  two-dimensional  electron  gas,  is  changed  from  integrable  (circle  or  square) 
to  nonintegrable  (stadium)  [43,44]  provide  interesting  deviations  from  the  theoretical 
paradigms.  We  had  suggested  mat  ghost  orbit  contributions  are  important  to  the 
dynamics  once  the  Lorentz  force  is  considered  in  computing  classical  trajectories 
[30].  Recently  experimental  evidence  also  indicates  deviations  from  random  matrix 
theory  predictions  [45]  as  well  as  the  role  of  scarring  [46].  Inelastic  scattering  and 
decoherence  are  also  issues  which  invoke  the  role  of  parametric  or  systemic  noise  on 
any  stabilized  quantum  structure  [47].  It  is  my  view  that  just  as  turbulence  provides 
strong  motivation  in  classical  dynamical  analyses,  these  inherently  'messy',  truly  many- 
body  systems  pose  an  analogous  challenge  for  nonintegrable  quantum   dynamical 
systems. 
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Abstract.  We  review  some  properties  of  periodic  orbit  families  in  polygonal  billiards  and  discuss 
in  particular  a  sum  rule  that  they  obey.  In  addition,  we  provide  algorithms  to  determine  periodic 
orbit  families  and  present  numerical  results  that  shed  new  light  on  the  proliferation  law  and  its 
variation  with  the  genus  of  the  invariant  surface.  Finally,  we  deal  with  correlations  in  the  length 
spectrum  and  find  that  long  orbits  display  Poisson  fluctuations. 
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1.  Introduction 

Billiards  are  an  interesting  and  well  studied  class  of  hamiltoman  systems  that  display  a 
wide  variety  of  dynamical  behaviour  depending  on  the  shape  of  the  boundary.  The  ray 
equations  commonly  considered  follow  from  a  short  wavelength  expansion  of  the 
Schrodinger  equation  with  Dirichlet  (or  Neumann)  boundary  conditions.  A  particle  thus 
moves  freely  between  collisions  at  the  boundary  where  it  suffers  specular  reflection.  As  an 
example,  a  square  billiard  generates  regular  dynamics  that  is  restricted  to  a  torus  in  phase 
space  due  to  the  existence  of  two  well  behaved  constants  of  motion.  In  contrast,  generic 
trajectories  in  the  enclosure  formed  by  three  intersecting  discs  explore  the  entire  constant 
energy  surface  and  hence  the  system  is  ergodic.  Moreover,  orbits  that  are  nearby  initially 
move  exponentially  apart  with  time  and  hence  the  system  is  said  to  be  metrically  chaotic. 
Polygonal  billiards  are  a  class  of  systems  whose  properties  are  not  as  well  known. 
The  ones  accessible  to  computations  are  rational  angled  polygons  and  they  generically 
belong  to  the  class  of  systems  referred  to  as  pseudo-integrable  [1].  They  possess  two 
constants  of  motion  like  their  integrable  counterparts  [2]  but  the  invariant  surface 
F  is  topologically  equivalent  to  a  sphere  with  multiple  holes  [1]  and  not  a  torus. 
Such  systems  are  characterized  by  the  genus  g  where  2g  is  the  numbers  of  cuts 
required  in  T  to  produce  a  singly  connected  region  (alternately,  g  is  the  number  of 
holes  in  F).  As  an  example,  consider  the  1-step  billiard  in  figure  1.  For  any  trajectory, 
pi  and  p^  are  conserved.  The  invariant  surface  consists  of  four  sheets  (copies)  corres- 
ponding to  the  four  possible  momenta,  (ip*,  ip>)  that  it  can  have  and  the  edges  of 
these  sheets  can  be  identified  such  that  the  resulting  surface  has  the  topology  of  a 
double  torus. 
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Figure  1.  The  1,  2  and  3-step  billiards  are  pseudo-integrable  due  to  the  3?r/2 
internal  angles.  The  1-step  billiard  (bottom)  is  also  referred  to  as  the  L-shaped  billiard 
in  the  text. 


The  classical  dynamics  no  longer  has  the  simplicity  of  an  integrable  system  where  a 
transformation  to  action  and  angle  co-ordinates  enables  one  to  solve  the  global  evolution 
equations  on  a  torus.  On  the  other  hand,  the  dynamics  is  non-chaotic  with  the  only 
interesting  feature  occurring  at  the  singular  vertex  with  internal  angle  3?r/2.  Here, 
families  of  parallel  rays  split  and  traverse  different  paths,  a  fact  that  limits  the  extent 
of  periodic  orbit  families.  This  is  in  contrast  to  integrable  billiards  (and  to  the  ?r/2 
internal  angles  in  figure  1)  where  families  of  rays  do  not  see  the  vertex  and  continue 
smoothly. 

We  shall  focus  here  on  the  periodic  orbits  of  such  systems  for  they  form  the  skeleton  on 
which  generic  classical  motion  is  built.  They  are  also  the  central  objects  of  modern 
semiclassical  theories  [3]  which  provide  a  duality  between  the  quantum  spectrum  and  the 
classical  length  and  stabilities  of  periodic  orbits.  In  polygonal  billiards,  primitive  periodic 
orbits  can  typically  be  classified  under  two  categories  depending  on  whether  they  suffer 
even  or  odd  number  of  bounces  at  the  boundary.  In  both  cases  however,  the  linearized 
flow  is  marginally  unstable  since  the  Jacobian  matrix,  Jp,  connecting  the  transverse 
components  (u±(t  +  Tp]  =  Jp  uj_(f)  where  U.L  =  (q±,pi_}T  and  Tp  is  the  time  period  of 
the  orbit)  has  unit  eigenvalues.  However,  when  the  number  of  reflections,  np,  at  the 
boundary  is  even,  the  orbit  occurs  in  a  1 -parameter  family  while  it  is  isolated  when  np  is 
odd.  This  follows  from  the  fact  that  the  left  (right)  neighbourhood  of  an  orbit  becomes  the 
right  (left)  neighbourhood  on  reflection  so  that  an  initial  neighbourhood  can  never 
overlap  with  itself  after  odd  number  of  reflections  [1]. 
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Figure  2.  An  example  of  a  periodic  orbit  in  the  3-step  billiard.  Note  that  adjacent 
parallel  rays  are  also  periodic  so  that  a  family  of  periodic  orbits  exists.  Its  extent  is 
limited  by  the  3?r/2  vertices. 

While  isolated  orbits  are  important  and  need  to  be  incorporated  in  any  complete 
periodic  orbit  theory,  a  family  of  identical  orbits  has  greater  weight  and  such  families 
proliferate  faster  than  isolated  orbits  [4].  Besides,  in  a  number  of  cases  including  the  L- 
shaped  billiard  of  figure  1,  isolated  periodic  orbits  are  altogether  absent  since  the  initial 
and  final  momenta  coincide  only  if  the  orbit  undergoes  even  number  of  reflections.  For 
this  reason,  we  shall  consider  families  of  periodic  orbits  in  this  article  (see  figure  2  for  an 
example). 

Not  much  is  however  known  about  the  manner  in  which  they  are  organized  and  the  few 
mathematical  results  that  exist  [4]  concern  the  asymptotic  properties  of  their  proliferation 
rate.  For  a  sub-class  of  rational  polygons  where  the  vertices  and  edges  lie  on  an  integrable 
lattice  (the  so  called  almost-integrable  systems  [4]),  these  asymptotic  results  are  exact.  It 
is  known  for  example  that  the  number  of  periodic  orbit  families,  AT(/),  increases 
quadratically  with  length,  /,  as  /  — >  oo.  For  general  rational  polygons,  rigorous  results 
show  that  c\l2  <  N(l]  <  c2/2  for  sufficiently  large  values  of  I  [4,  5]  while  in  case  of  a 
regular  polygon  Pn  with  n  sides,  it  is  known  that  N(l]  ~  cnl2/A  [6]  where  cn  is  a  number 
theoretic  constant  and  A  denotes  the  area  of  ?„.  Very  little  is  known  however  about  other 
aspects  such  as  the  sum  rules  obeyed  by  periodic  orbit  families  in  contrast  to  the  limiting 
cases  of  integrable  and  chaotic  behaviour  where  these  have  been  well  studied.  Besides,  it 
is  desirable  to  learn  about  the  variation  of  the  proliferation  rate  as  a  function  of  the  genus 
for  this  should  tell  us  about  the  transition  to  chaos  in  polygonal  approximations  of  chaotic 
billiards  [7,  8]. 

We  shall  concern  ourselves  primarily  with  a  basic  sum  rule  obeyed  by  periodic  orbits 
arising  from  the  conservation  of  probability.  This  leads  to  the  proliferation  law, 
N(l]  =  nbal2 / (a(t))  where  bo  is  a  constant.  The  quantity  (a(l))  is  the  average  area 
occupied  by  all  families  of  periodic  orbits  with  length  less  than  /  and  is  not  a  constant 
unlike  integrable  billiards.  We  provide  here  the  some  numerical  results  on  how  (a(/)) 
changes  with  the  length  of  the  orbits  and  the  genus  of  the  invariant  surface.  While  this 
does  not  allow  us  to  make  quantitative  predictions,  the  qualitative  behaviour  sheds  new 
light  on  the  proliferation  law  for  short  orbits  and  its  variation  with  the  genus,  g  of  the 
invariant  surface. 

Finally,  we  shall  also  study  correlations  in  the  length  spectrum  of  periodic  orbits.  The 
numerical  results  provided  here  are  the  first  of  its  kind  for  generic  pseudo-integrable 
billiards  and  corroborate  theoretical  predictions  provided  earlier  [9]. 

The  organization  of  this  paper  is  as  follows.  In  §  2,  we  provide  the  basic  sum  rule  and 
the  proliferation  law  in  pseudo-integrable  systems.  In  §3,  we  discuss  algorithms  to 
determine  periodic  orbit  families  in  generic  situations.  This  is  followed  by  a  numerical 


Oll^    3U11UJUCU.1.£>1<U    ill     O  U* 

2.  The  basic  sum  rule 

The  manner  in  which  periodic  orbits  organize  themselves  in  closed  systems  is  strongly 
linked  to  the  existence  of  sum  rules  arising  from  conservation  laws.  For  example,  the  fact 
that  a  particle  never  escapes  implies  that  for  chaotic  systems  [10,  11] 

U 

where  the  summation  over  p  refers  to  all  primitive  periodic  orbits,  Tp  is  time  period,  Jp  is 
the  stability  matrix  evaluated  on  the  orbit  and  the  symbol  {•)  denotes  the  average  value  of 
the  expression  on  the  left.  Since  the  periodic  orbits  are  unstable  and  isolated, 
|  det(l  —  Jpr)  j  ~  eVT/>,  where  Xp  is  the  Lyapunov  exponent  of  the  orbit.  The  exponential 
proliferation  of  orbits  is  thus  implicit  in  (1). 

A  transparent  derivation  of  (1)  follows  from  the  classical  evolution  operator 

(2) 

where  V  governs  the  evolution  of  densities  </>(x),  x  =  (q,  p)  and  f '  refers  to  the  flow  in 
the  full  phase  space.  We  denote  by  A.n(i)  the  eigenvalue  corresponding  to  an 
eigenfunction  $n(x)  such  that  L'o^re(x)  =  An(f)<ji>n(x).  The  semi-group  property, 
L'i  oL'2  =  L'l+ti,  for  continuous  time  implies  that  the  eigenvalues  {A.n(t)}  are  of  the 
form  {e*nt}.  Further,  for  hamiltonian  flows,  (2)  implies  that  there  exists  a  unit  eigenvalue 
corresponding  to  a  uniform  density  so  that  AO  =  0.  For  strongly  hyperbolic  systems, 
An  =  ~otn  +  z'An  n  >  1  with  a  negative  real  part  implying  that 

Tr  U  =  1  +  ]T  exp{-a:Mf  +  i$nt}.  (3} 

n 

Equation  (1)  is  thus  a  restatement  of  (3)  with  the  trace  expressed  in  terms  of  periodic  orbit 
stabilities  and  time  periods. 

For  polygonal  billiards,  appropriate  modifications  are  necessary  to  take  account  of  the 
fact  that  the  flow  is  restricted  to  an  invariant  surface  that  is  two  dimensional.  Further, 
since  classical  considerations  do  not  always  yield  the  spectrum  {Xn(t}},  we  shall  take 
resort  to  the  semiclassical  trace  formula  which  involves  periodic  orbit  sums  similar  to  the 
kind  that  we  shall  encounter  in  the  classical  case. 

Before  considering  the  more  general  case  of  pseudo-integrable  billiards,  we  first 
introduce  the  appropriate  classical  evolution  operator  for  integrable  systems.  This  is 
easily  defined  as 

y  (4) 

where  d\  and  02  are  the  angular  coordinates  on  the  torus  and  evolve  in  time  as 
B\  =  w/(/i,/2>  +  et  with  w  =  dH(Ii,I2)/dli  and/,-  =  (1/2*)  jFr,p.dq.  Here  IV  =  1,2 
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refer  to  the  two  irreducible  circuits  on  the  torus  and  p  is  the  momentum  conjugate  to  the 
coordinate  q. 

It  is  easy  to  see  that  the  eigenfunctions  {</>n(#i,02)}  are  such  that  0n(0'j,02)  = 
An(f)0M(#i,#2)  where  An(t)  =  e'Q"r.  On  demanding  that  0n(#i,02)  be  a  single  valued 
function  of  (0i,02)>  it  follows  that  0n(#i,  #2)  =  e'("lfll+nzfe)  where  n  =  (ni,n2)  is  a  point 
on  the  integer  lattice.  Thus  the  eigenvalue,  An(f)  =  exp{it(niuj\  +niuj2)}. 

To  illustrate  this,  we  consider  a  rectangular  billiard  where  the  hamiltonian  expressed  in 
terms  of  the  actions,  7i,72  is  #(7i,72)  =  -n2(I\/L^  +I2/LI)  where  Lj,L2  are  the  lengths 
of  the  two  sides.  With  I\  =  \/EL\  cos(<p)/7r  and  72  =  ^/ELi  sin(<^)/7r,  it  is  easy  to  see 
that  at  a  given  energy,  E,  each  torus  is  parametrized  by  a  particular  value  of  ip.  Thus 

An(?)  =  e' 

and  the  spectrum  is  continuous.  The  trace  thus  involves  an  integration  over  (p  as  well  as  a 
sum  over  all  «i,«2: 


Tr  L'  =  d</?  e"sin^+^  =  27rTVo(\/£^)  (6) 


where  /o  is  a  Bessel  function,  Z  =  2f\/E,  tan(/in)  =  n\Lil(niL\)  and  £„  =  7T2(ni/Li  + 
rtj/Lj).  ^n  s^arating  out  n  =  (0,0)  from  the  rest  and  restricting  the  summation  to  the 
first  quadrant  of  the  integer  lattice,  it  follows  that 


Tr  L'  =  2?r  +  27rtf       /oOs/^O-  (7) 

n 

where  N  —  4.  Note  that  the  first  term  on  the  right  merely  states  the  fact  that  there  exists  a 
unit  eigenvalue  on  every  torus  labelled  by  <p.  Also,  though  we  have  not  invoked 
semiclassics  at  any  stage,  the  spectrum  {£"„}  corresponds  to  the  Neumann  spectrum  of  the 
billiard  considered.  We  shall  subsequently  show  that  this  is  true  in  general  and  for  now  it 
remains  to  express  the  trace  of  the  evolution  operator  in  terms  of  periodic  orbits. 
The  trace  of  L'  in  the  integrable  case  can  be  expressed  as 

Tr  V  =  I  dy?  /  dfl^  S(0l  -  e\}8(e2  -  0$.  (8) 

It  follows  immediately  that  the  only  orbits  that  contribute  are  the  ones  that  are  periodic. 
For  a  rectangular  billiard,  the  integrals  can  be  evaluated  quite  easily  and  yields 

^(/  -(„„„,)  (9) 


where  {N\,  A^}  are  the  winding  numbers  on  the  torus  and  label  periodic  orbits  of  length 
.  Using  (7)  and  (9),  it  follows  that 


_  (10) 

NI      NI      "ii/v2  n 

Thus,  the  dominant  non-oscillatory  contribution  to  the  trace  that  survives  averaging  is  2?r 


Figure  3.  The  singly  connected  region  for  an  L-shaped  billiard  consists  of  four 
copies  with  edges  appropriately  identified.  A  trajectory  originating  near  the  3?r/2 
vertex  in  1  is  plotted  in  configuration  space  using  bold  lines  and  the  corresponding 
unfolded  trajectory  is  also  shown.  The  latter  consists  of  parallel  segments  and  the 
trajectory  can  be  parametrized  by  the  angle  tp  that  it  makes  for  example  with  the  q\ 


axis. 


and  this  gives  rise  to  the  analogue  of  (1) 


(11) 


The  proliferation  rate  for  the  rectangular  billiard  thus  follows  from  these  considerations. 

These  ideas  can  be  generalized  for  polygonal  billiards  that  are  pseudo-integrable  even 
though  the  structure  of  the  invariant  surface  no  longer  allows  one  to  use  action  and  angle 
variables. 

For  both  integrable  and  pseudo-integrable  polygonal  billiards,  the  dynamics  in  phase 
space  can  be  viewed  in  a  singly  connected  region  by  executing  2g  cuts  in  the  invariant 
surface  and  identifying  edges  appropriately.  At  a  given  energy,  the  motion  is  parametrized 
by  the  angle,  <p,  that  a  trajectory  makes  with  respect  to  one  of  the  edges.  As  a  trivial 
example,  consider  the  rectangular  billiard.  The  singly  connected  region  is  a  larger 
rectangle  consisting  of  four  copies  corresponding  to  the  four  directions  that  a  trajectory 
can  have  and  these  can  be  glued  appropriately  to  form  a  torus  [12].  As  a  non-trivial 
example,  consider  the  L-shaped  billiard  of  figure  (1)  which  is  pseudo-integrable  with  its 
invariant  surface  having,  g  =  2.  Alternately,  the  surface  can  be  represented  by  a  singly 
connected  region  in  the  plane  (see  figure  3)  and  consists  of  four  copies  corresponding  to 
the  four  possible  directions  an  orbit  can  have  and  these  are  glued  appropriately.  A 
trajectory  in  phase  space  thus  consists  of  parallel  segments  at  an  angle  tp  measured  for 
example  with  respect  to  one  of  the  sides.  It  will  be  useful  to  note  at  this  point  that  the 
same  trajectory  can  also  be  represented  by  parallel  segments  at  angles  TT  —  (p,  TT  -f  <p  and 
2?r  —  (p.  In  general,  the  number  of  directions  for  representing  a  trajectory  equals  the 
number  of  copies,  AT,  that  constitute  the  invariant  surface. 
The  classical  propagator  on  an  invariant  surface  parametrized  by  (p  is  thus 


where  q  refers  to  the  position  in  the  singly  connected  region  and  q"(^)  is  the  time 
evolution  parametrized  by  <p  as  described  above. 
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The  trace  ofU((p)  takes  into  account  all  possible  invariant  surfaces  that  exist  and  hence 
involves  an  additional  integration  over  <p.  Thus 

(13) 

Clearly  the  only  orbits  that  contribute  are  the  ones  that  are  periodic.  Further,  the  q 
integrations  are  simpler  if  we  transform  to  a  local  coordinate  system  with  one  component 
parallel  to  the  trajectory  and  the  other  perpendicular.  Thus  6\\(q\\  —  q\\)  =  0/v)<5(?  ~  rTP) 
where  v  is  the  velocity,  Tp  is  the  period  of  the  orbit  and  r  is  the  repetition  number. 
Similarly,  for  an  orbit  of  period  Tp  parametrised  by  the  angle  ipp,  6jL(qjL  -  q*L]  = 
<%?  —  <PP)/\dqjL/dy>\v=v  where  \dq±/dif>\v=::((>  —rlp  for  marginally  unstable  billiards. 
Putting  these  results  together  and  noting  that  each  periodic  orbit  occurs  in  general  at  N 
different  values  of  (p,  we  finally  have 


Tr  I'  =        An  W  =  N  6(1  -  r/p)  (14) 

«  p      r=i    nP 

where  /  =  tv  and  the  summation  over  p  refers  to  all  primitive  periodic  orbit  families  with 
length  lp  and  occupying  an  area  ap.  Note  that  (9)  is  a  special  case  of  (14)  which  holds  for 
both  integrable  and  non-integrable  polygonal  billiards. 

It  is  possible  to  re-express  the  periodic  orbit  sum  in  (14)  starting  with  the  appropriate 
quantum  trace  formula  [13] 


£  6(E  -  En]  =  dav(E)  +  -1=  £  E  -=  cos  krlp  -     -  ^rnp       (15) 


Here  dav(£)  refers  to  the  average  density  of  quantal  eigenstates,  k  =  -v/E,  lp  is  the  length 
of  a  primitive  periodic  orbit  family.  The  phase  7crnp  is  set  to  zero  while  considering  the 
Neumann  spectrum  while  in  the  Dirichlet  case,  np  equals  the  number  of  bounces  that  the 
primitive  orbit  suffers  at  the  boundary.  For  convenience,  we  have  chosen  H  =  1  and  the 
mass  m  =  1/2.  Starting  with  the  function 


/CO 
dEf(VEl)^EY^S(E-En]  (16) 


where  /(jc)  =  -^/2/Trxcos(x  -  Tr/4)  and  0  <  e  <  EQ,  it  is  possible  to  show  using  (15)  that 
for  polygonal  billiards  [15] 

00 

E  E  T  ^  ~  rf')  =  27r£>°  +  27rE/(V£«/)  (17) 

p      r=l       P  n 

for  /5  —  >  0+.  In  the  above, 

*o  =  E  E  ^T^  /£  ^f(VEl)f(VErlp]  (18) 

p         r  •'O 

and  is  a  constant  [15,  16].  It  follows  from  (17)  and  (14)  that 

»T 


Tr  L'  =          ~r  *('  ~  r/^) 

p      r=l     r/P 
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Diinara,  tne  oscillatory  contributions  wasft  out  on  averaging  so  that 

'(20) 


P      r=l 

This  is  the  central  result  of  this  section  and  forms  the  basic  sum  rule  obeyed  by  periodic 
orbit  families. 

A  few  remarks  about  this  derivation  and  the  magnitude  of  bo  are  however  in  order. 
Equation  (14)  is  exact  for  the  L-shaped  billiard  and  all  other  boundaries  which  preclude 
the  existence  of  isolated  periodic  orbits.  However,  even  for  these  shapes,  the  semiclassical 
trace  formula  is  only  an  approximation  to  the  exact  density  whenever  the  billiard  in 
question  is  pseudo-integrable.  In  this  sense,  the  sum  rule  in  (20)  is  not  expected  to  be 
exact.  However,  we  believe  the  existence  of  higher  order  corrections  (such  as  diffraction) 
affects  the  magnitude  of  the  constant  bQ  while  preserving  the  constant  nature  of  the 
periodic  orbit  sum  on  the  left. 

In  the  integrable  case  it  is  easy  to  show  from  other  considerations  that  bo  =  1/N  where 
N  is  the  number  of  sheets  that  constitute  the  invariant  surface  [15].  This  also  follows  from 
(10)  and  (19)  since  b$N  =  1.  For  pseudo-integrable  billiards,  each  invariant  surface 
parametrised  by  y>  has  an  eigenvalue,  AQ(</?)  =  1.  Thus  the  non-oscillatory  part  of  the 
trace  should  equal  /Ao(<£>)  d</3  and  to  a  first  approximation  this  yields  2?r  implying  that 
bo  =  1/W.  However  each  singular  vertex  connects  two  distinct  points  at  any  angle  (p  and 
hence  the  integration  over  (p  is  non-trivial.  We  can  therefore  state  that  bo  is  approximately 
l/N  in  the  pseudo-integrable  case  while  this  is  exact  in  the  integrable  case.  We  shall  show 
numerically  diat  the  magnitude  of  deviations  (from  1/N)  in  the  pseudo-integrable  case 
depends  on  the  existence  of  periodic  orbit  pairs  at  the  singular  vertex.  First  however,  we 
briefly  describe  the  algorithms  used  to  determine  periodic  orbit  families. 

3.  Algorithms  for  determining  periodic  orbits 

Periodic  orbits  hi  polygonal  billiards  are  generally  hard  to  classify.  Unlike  integrable 
billiards,  they  cannot  be  described  by  two  integers  which  count  the  number  of  windings 
around  the  two  irreducible  circuits  on  the  torus  though  in  exceptional  cases  this  can 
indeed  be  done.  However,  since  the  invariant  surface  has  a  well-defined  genus,  it  is 
expected  that  a  set  of  integers  N  =  {ty  ,  #2>  •  •  •  ,  Nzg}  obeying  the  relationship 

2-nNi 


can  be  used  to  label  periodic  orbits.  Here  w,  refers  to  the  frequency  corresponding  to  each 
irreducible  circuit  Tt  and  depends  on  the  energy,  E  and  the  angle  </?  that  labels  each 
invariant  surface.  Note  however  that  not  all  points  on  this  multi-dimensional  integer 
lattice  are  allowed  since  there  are  constraints  and  this  method  of  labelling  orbits  becomes 
cumbersome  for  surfaces  of  higher  genus.  Nevertheless,  we  illustrate  the  idea  here  for  the 
L-shaped  billiard  of  figure  1. 

Let  the  length  of  the  two  bouncing  ball  orbits  in  the  X-direction  be  LI  and  LI 
respectively  and  their  lengths  in  the  F-direction  be  L3  and  £4.  These  define  the  irreducible 
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2L2 

v  sin(^) 
2L3 


2L4     ~    TN 
This  implies  that  the  angle  <p  at  which  a  periodic  orbit  can  exist  is  such  that 

A/sL^+A^ 
v^'  ^     } 


Equations  (23)  and  (22)  merely  express  the  fact  that  any  periodic  orbit  should  have 
integer  number  of  windings  around  the  irreducible  circuits.  Thus,  the  total  displacement 
along  the  X-direction  should  be  2(N\L\  +  A7^)  while  the  total  displacement  in  the  Y- 
direction  should  be  2(N^  +  A^JU)  where  NI  are  integers.  As  mentioned  before  however, 
not  all  realizations  of  {A7,-}  correspond  to  real  periodic  orbits  and  the  final  step  consists  in 
checking  numerically  whether  a  periodic  orbit  at  the  angle  (p  (given  by  (23))  exists.  Note 
that  one  member  of  each  family  necessarily  resides  at  one  of  the  singular  vertices  and 
hence  it  is  sufficient  to  verify  the  existence  of  this  orbit. 

This  method  works  equally  well  for  other  billiards  with  steps  (see  figure  1)  and  the 
number  of  integers  necessary  to  describe  orbits  increases  with  the  number  of  steps.  An 
alternate  method  which  exploits  the  fact  that  periodic  orbits  occur  in  families  is  often 
useful  when  the  irreducible  circuits  are  not  obvious  and  this  is  described  below. 

Note  that  a  non-periodic  orbit  originating  from  the  same  point  q  (e.g.  the  singular 
vertex)  as  a  periodic  orbit  but  with  a  momentum  slightly  different  (from  the  periodic 
orbit)  suffers  a  net  transverse  deviation  q\_  —  (—l)n<p  sin(y>  —  ^,)/v?.  Here  lv  is  the 
distance  traversed  by  a  non-periodic  orbit  at  an  angle  (p  after  n^  reflections  from  the 
boundary  and  (pp  is  the  angle  at  which  a  periodic  orbit  exists.  This  provides  a  correction  to 
the  initial  angle  and  a  few  iterations  are  normally  sufficient  to  converge  on  a  periodic 
orbit  with  good  accuracy.  In  order  to  obtain  all  periodic  orbits,  it  is  necessary  to  shoot 
trajectories  from  every  singular  vertex  since  one  member  of  each  family  resides  at  one  of 
these  vertices. 

Apart  from  the  length  of  a  periodic  orbit,  it  is  also  important  to  compute  the  area 
occupied  by  the  family.  This  can  be  achieved  by  shooting  a  single  periodic  trajectory 
which  resides  at  a  singular  vertex  and  by  noting  that  this  orbit  lies  on  the  edge  of  a  family. 
Thus  if  the  rest  of  the  family  lies  on  the  left  neighbourhood  initially,  it  is  necessary  to 
determine  the  perpendicular  distance  from  a  singular  vertex  to  this  trajectory  every  time 
the  initial  neighbourhood  lies  towards  this  singular  vertex.  The  shortest  of  these 
perpendicular  distances  gives  the  transverse  extent  of  the  family  and  the  area  can  thus  be 
computed. 

These  algorithms  have  been  used  to  generate  the  lengths  {lp}  and  areas  {ap}  of 
primitive  orbits  in  the  L-shaped  billiard  as  well  as  the  two  and  three  step  billiards.  We 
present  our  numerical  results  in  the  following  sections. 

Pramana  -  /.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  H) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences"  495 


5(0 


Figure  4.    A  demonstration  of  the  sum  rule  for  the  1-step  billiard.  The  curve  with 
slope  significantly  different  corresponds  to  a  degenerate  example. 


4.  Numerical  results:  The  sum  rule  and  area  law 

We  present  here  our  numerical  results  on  the  1-step  (L-shaped),  2-step  and  3-step 
billiards.  Their  invariant  surfaces  have  genus  2,  3  and  4  respectively  and  the  quantities  we 
shall  study  are  the  sum  rule  derived  in  §  2  and  the  variation  of  (a(/))  with  I  and  the  genus, 
g  of  the  invariant  surface. 
The  sum  rule  we  wish  to  study  can  be  re-expressed  as 

ap       L  i  fr>A\ 

„  -f  ~  bnl  .  (24) 


and  this  is  plotted  in  figure  4  for  four  different  1-step  (L-shaped)  billiards.  Notice  first 
that  in  each  case  the  behaviour  is  linear  as  predicted  by  (24).  Besides,  in  three  of  the  four 
cases,  the  slopes  are  quite  close  (bo  ~  0.27)  and  these  correspond  to  non-degenerate  L- 
shaped  billiards  with  sides  that  are  unequal  and  irrationally  related.  The  one  with  a 
substantially  larger  slope  (b^  ~  0.32)  is  a  degenerate  case  of  a  square  with  a  quarter 
removed  and  for  this  system  there  exist  a  substantial  number  of  periodic  orbit  pairs  at  the 
same  angle  on  the  two  adjacent  edges  at  the  singular  vertex.  The  differences  between  the 
degenerate  and  non-degenerate  case  also  persists  in  other  quantities  as  we  shall  shortly 
see. 

We  next  plot  S(t)  for  non-degenerate  examples  of  a  1,  2  and  3-step  bilh'ards  in  figure  5. 
The  number  of  orbits  considered  are  far  less  due  to  increased  computational  effort  though 
the  linear  behaviour  is  obvious  in  all  three  cases.  The  slopes  are  again  close  to  0.25  and 
vary  from  bQ  ~  0.24  to  bQ  ~  0.28. 

Thus,  periodic  orbits  obey  a  basic  sum  rule  given  by  (24)  where  bo  is  a  constant. 
Further,  the  magnitude  of  bo  is  close  to  0.25  in  all  cases  and  the  deviations  from  this  value 
are  larger  hi  the  degenerate  case  where  periodic  orbit  pairs  exist  at  the  singular  vertex. 

The  sum  rule  yields  the  proliferation  rate  of  periodic  orbits 


(25) 


where  (a(l}}  is  the  average  area  occupied  by  periodic  orbit  families  with  length  less  than 
I.  In  the  case  of  integrable  polygons  (a(l)}  is  a  constant  and  equals  AN  where  A  is  the  area 
of  the  billiard  while  bQ  =  l/N.  For  pseudo-integrable  cases,  the  areas  ap  occupied  by 
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Figure  5.    The  sum  rule  for  a  1,  2  and  3-step  billiard.  They  are  respectively  the 
middle,  bottom  and  top  curves. 


Wfi 


0.2  i 


Figure  6.  The  normalized  average  area,  (a(f))  =  (a(l))/NA  as  a  function  of  the 
length  /  for  3  examples  of  the  1-step  billiard.  The  saturation  value  differs  for  the 
degenerate  example  mentioned  earlier. 


(a(/)>  0.2  - 


Figure  7.    The  normalized  area  for  an  example  each  of  the  1-step  (top),  2-step 
(middle)  and  3-step  (bottom)  billiards. 


individual  periodic  orbit  families  genetically  occupy  a  wide  spectrum  bounded  above  by 
NA.  As  the  length  of  a  family  increases,  encounters  with  singular  vertices  are  more 
frequent  so  that  the  transverse  extent  of  family  decreases.  Quantitative  predictions  about 
the  behaviour  of  (a(l}}  are  however  difficult  and  we  provide  here  some  numerical  results. 
Figure  6  shows  a  plot  of  (a(f))/NA  for  three  different  L-shaped  billiards  one  of  which 
is  the  degenerate  case  presented  in  figure  4.  The  average  area  increases  initially  before 
saturating  for  large  /  in  all  cases.  The  normalized  saturation  value  seems  to  be 
independent  of  the  ratio  of  the  sides  as  long  as  they  are  irrationally  related  (we  have 
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observed  this  for  other  cases  not  presented  here)  but  is  very  different  for  the  degenerate 
example  of  a  square  with  a  quarter  removed. 

A  comparison  of  non-degenerate  1,  2  and  3-step  billiards  is  shown  in  figure  7.  Recall 
that  the  invariant  surfaces  for  these  have  genus  respectively  equal  to  2,  3  and  4.  The 
saturation  observed  for  the  1-step  (L-shaped)  case  seems  to  be  a  common  feature  and 
interestingly  the  normalised  average  saturation  value  decreases.  This  is  however  expected 
since  the  number  of  singular  vertices  increases  with  the  genus  thereby  reducing  the 
average  transverse  extent  of  orbit  families  at  any  given  length. 

These  observations  allow  us  to  conclude  that  for  short  lengths,  the  proliferation  law  is 
sub-quadratic.  Asymptotically  however,  N(f)  ~  I2  as  in  integrable  billiards.  Further,  the 
asymptotic  proliferation  rate  for  billiards  with  the  same  area  A  increases  with  the  genus 
due  to  a  decrease  in  the  asymptotic  normalized  saturation  value  of  (a(l)). 

These  numerical  results  provide  a  qualitative  picture  of  the  area  law  and  show  that 
periodic  orbits  in  polygonal  billiards  are  organized  such  that  they  obey  a  sum  rule. 
Quantitative  predictions  would  require  a  more  extensive  numerical  study  such  that 
empirical  laws  for  the  saturation  value  and  its  variation  with  the  genus  can  be  arrived  at. 

5.  Correlations  in  the  length  spectrum 

Our  numerical  explorations  so  far  have  been  focussed  on  the  average  properties  of  the 
length  spectrum.  We  shall  now  attempt  to  understand  the  nature  of  the  fluctuations  and 
characterize  their  statistical  properties. 

Fluctuations  in  the  length  spectrum  are  generally  difficult  to  study  from  purely 
classical  considerations.  There  are  notable  exceptions  however.  Fluctuations  in  the 
integrable  case  can  be  studied  using  the  Poisson  summation  formula  since  the  lengths  of 
orbits  are  expressed  in  terms  of  integers  {N\,  N2}  [17].  The  other  extreme  is  the  motion 
on  surfaces  of  constant  negative  curvature  where  the  Selberg  trace  formula  provides  an 
exact  dual  relationship  between  the  classical  lengths  and  the  eigenvalues  of  the  Laplace- 
Beltrami  operator  [18].  For  other  systems,  a  possible  way  of  studying  fluctuations  in  the 
length  spectrum  lies  in  inverting  the  semiclassical  quantum  trace  formula.  For  pseudo- 
integrable  billiards,  this  has  been  achieved  in  §  2  and  the  integrable  density  of  lengths, 
N(l]  can  be  expressed  as 

(26) 


Statistical  properties  of  the  fluctuations  can  thus  be  studied  using  techniques  introduced 
for  the  fluctuations  in  the  quantum  energy  spectrum  [19,  9]. 

The  correlations  commonly  studied  are  the  nearest  neighbour  spacings  distribution, 
P(s)  and  a  two-point  correlation  referred  to  as  the  spectral  rigidity,  As(L).  The  rigidity 
measures  the  average  mean  square  deviation  of  the  staircase  function  N(t)  from  the  best 
fitting  straight  line  over  L  mean  level  spacings.  For  a  normalised  (unit  mean  spacing) 
Poisson  spectrum,  P(s)=$~s  while  A3(L)=L/15.  These  features  are  commonly 
observed  in  the  quantum  energy  spectrum  of  integrable  systems  as  well  as  in  their  length 
spectrum.  For  chaotic  billiards,  the  quantum  energy  spectrum  genetically  produces  non- 
Poisson  statistics  while  the  length  spectrum  correlations  are  Poisson  for  long  orbits  at 
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Figure  8.    The  nearest  neighbour  spacings  distribution  for  a  non-degenerate  example 
of  a  1-step  billiard.  Also  plotted  is  the  Poisson  distribution. 


Figure  9.    The  spectral  rigidity  for  the  1-step  billiard  together  with  the  L/ 15  line. 


Figure  10.  The  nearest  neighbour  distribution  for  a  3-step  billiard.  The  available 
lengths  are  much  smaller  and  minor  deviations  from  Poisson  behaviour  can  be 
observed. 


least  over  short  ranges  [20].  There  are  however  deviations  that  can  be  observed  in  As(L) 
for  short  orbits  or  over  longer  ranges  in  the  spectrum  [20].  With  this  background,  we  now 
present  our  results  for  pseudo-integrable  billiards. 

Figures  (8)  and  (9)  show  plots  of  P(s)  and  A3(L)  for  a  non-degenerate  L-shaped 
billiard.  A  total  of  3000  lengths  have  been  considered  after  excluding  the  shortest  3000 
orbits.  The  correlations  are  clearly  Poissonian  as  in  the  case  of  integrable  or  chaotic 
billiards. 

For  the  3-step  billiard  where  fewer  lengths  are  available  (about  1250),  we  have  carried 
out  a  similar  study  and  the  results  are  shown  in  figures  (10)  and  (1 1).  Deviations  from  the 


Figure  11.    The  spectral  rigidity  for  the  3-step  billiard  considered  earlier.  The 
deviations  from  Poisson  behaviour  is  more  pronounced  for  this  measure. 


Possion  behaviour  can  now  be  seen  especially  in  the  spectral  rigidity.  By  considering 
shorter  lengths  in  the  L-shaped  billiard,  similar  deviations  were  observed. 

The  statistical  properties  of  fluctuations  in  the  length  spectrum  of  pseudo-integrable 
systems  are  thus  similar  to  those  of  chaotic  billiards.  For  long  orbits  the  correlations  are 
Poisson  while  deviations  exist  for  shorter  orbits. 


6.  Discussions  and  conclusions 

Quantum  billiards  are  experimentally  realizable  in  the  form  of  microwave  cavities  since 
the  wave  equations  are  identical  [21,  22].  They  are  also  interesting  in  their  own  right  and 
have  proved  to  be  the  testing  grounds  for  several  ideas  in  the  field  of  quantum  chaos.  Of 
all  possible  shapes,  polygonal  billiards  have  perhaps  been  the  least  understood  largely 
because  very  little  was  known  about  the  organization  of  periodic  orbits.  The  results 
presented  here  have  however  been  used  recently  to  obtain  convergent  semiclassical 
eigenvalues  with  arbitrary  non-periodic  trajectories  [23]  as  well  to  demonstrate  that 
periodic  orbits  provide  excellent  estimates  of  two-point  correlations  in  the  quantum 
energy  spectrum  [16].  Though  it  is  beyond  the  scope  of  this  article  to  review  these  recent 
developments  on  quantum  polygonal  billiards,  we  refer  the  interested  reader  to  these 
articles  and  the  references  contained  therein. 

In  the  previous  sections  we  have  explored  the  organization  of  periodic  orbits  in  rational 
polygonal  billiards  that  are  pseudo-integrable.  Our  main  conclusions  are  as  follows: 

•  Orbit  families  obey  the  sum  rule  Q3p  Y^=i  ap$(l  ~  rW/(rW)  =  2irf>o  thereby  giving 
rise  to  the  proliferation  law  N(l)  =  7rfc0/2/{a(/)}  for  all  rational  polygons. 

•  The  quantity  bo  is  approximately  l/N  in  generic  rational  billiards  and  deviations  from 
this  value  are  observed  to  be  significant  in  degenerate  situations. 

•  (a(l)}  increases  initially  before  saturating  to  a  value  much  smaller  than  the  maximum 
allowed  area  NA.  The  asymptotic  proliferation  law  is  thus  quadratic  even  for  systems 
that  are  not  almost-integrable  and  the  density  of  periodic  orbits  lengths  is  far  greater 
than  an  equivalent  integrable  systems  having  the  same  area. 

•  The  normalized  average  area  (a(l}}/NA  decreases  with  the  genus  of  the  billiard  while 
bQ  is  approximately  l/N.  Periodic  orbits  thus  proliferate  faster  with  an  increase  in 
genus. 
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Abstract.  We  present  a  treatment  of  many-body  fermionic  systems  that  facilitates  an  expression 
of  well-known  quantities  in  a  series  expansion  in  h.  The  ensuing  semiclassical  result  contains,  to  a 
leading  order  of  the  response  function,  the  classical  time  correlation  function  of  the  observable 
followed  by  the  Weyl-Wigner  series;  on  top  of  these  terms  are  the  periodic-orbit  correction  terms. 
The  treatment  given  here  starts  from  linear  response  assumption  of  the  many-body  theory  and  in  its 
connection  with  semiclassical  theory,  it  assumes  that  the  one-body  quantal  system  has  a  classically 
chaotic  dynamics.  Applications  of  the  framework  are  also  discussed. 
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1.  Introduction 

Semiclassical  framework  for  single-particle  systems  is  presently  in  an  advanced 
stage  [1,2].  For  both  the  integrable  and  chaotic  dynamics,  we  now  understand  the 
semiclassical  quantization.  However,  for  intermediate  [3]  and  mixed  dynamical  scenarios, 
we  still  lack  a  convincing  theory.  Another  situation  of  a  great  practical  relevance  arises  in 
the  systems  comprising  of  many  bodies.  That  the  spectral  fluctuation  characteristics 
of  these  systems  are  modelled  in  much  the  same  way  as  for  chaotic  systems  with 
lesser  degrees  of  freedom  has  been  shown  quite  recently  [4].  It  is  very  important  to 
note  that  complex  systems  do  possess  features,  mainly  associated  with  the  generic 
nature  of  the  thermodynamic  limit,  which  are  completely  absent  from  the  systems 
with  fewer  freedoms,  friction  being  one  of  the  examples.  There  are  other  systems 
which  one  would  like  to  understand  semiclassically.  Some  notable  examples  are  of 
quantum  dots  [5],  metallic  clusters  [6]  and  their  optical  properties  [7];  also,  nuclear 
physics  at  high  spins  [8]  presents  us  with  opportunity  to  test  various  semiclassical  ideas 
in  many-body  theory.  There  has  been  an  attempt  based  on  the  semicjassical  limit  of 
the  time-dependent  Hartree-Fock  equation  where  one  can  establish  a  connection  [9] 
between  the  strength  function  given  by  the  Vlasov  equation  and  the  corresponding 
quantum  function  in  the  limit  of  large  quantum  numbers.  Analogous  to  the  many-body 
treatment  based  on  random-phase  approximation  (RPA),  the  linearized  Vlasov 
equation  gives  an  integral  equation  for  the  particle-hole  propagator  in  terms  of  classical 
propagator  in  the  static  mean  field  and  two-particle  Coulomb  interaction.  Thus  one 
needs  to  evaluate  the  classical  propagator  in  the  mean  field  and  then  solve  the  integral 
equation. 
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describe  the  shell  effects  in  metallic  clusters  [10].  Of  course,  these  have  been  preceded  by 
nuclear  physicists  [11]  by  more  than  a  decade.  In  fact,  the  periodic-orbit  theory  [12], 
which,  in  principle,  should  be  rendered  useless  for  practical  purposes  due  to  a  huge 
number  of  technical  problems  associated  with  finding  periodic  orbits  and  summing  a 
conditionally  convergent  series,  gets  encouragement  from  observations  that  only  a  few 
periodic  orbits  are  enough  in  many  situations  [12]. 

We  present  here  a  semiclassical  treatment  of  the  response  function.  Response  function 
is  essentially  the  imaginary  part  of  generalized  dynamical  susceptibility  which,  in  turn,  is 
related  to  the  correlation  commutator.  In  §  2,  we  present  the  general  discussion  from  the 
linear  response  theory  to  arrive  at  a  quantity  for  which  semiclassical  expression  can  be 
written.  The  key  point  of  this  section  is  to  show  that  the  most  important  quantity  is  a  time 
correlation  function.  In  §  3,  we  present  semiclassical  treatment  of  the  system  perturbed  by 
an  external  influence.  This  enables  us  to  write  the  relevant  results  of  §§  2  and  3  in  a 
semiclassical  expansion.  We  will  see  that  the  leading  term  is  indeed  the  two-time 
correlation  function  averaged  over  the  phase  space,  followed  by  the  Weyl-Wigner  series 
which  has  the  periodic-orbit  corrections.  On  our  way,  we  take  account  of  the  fermionic 
nature  of  the  particles. 

2.  Response  function  and  the  time  correlation  function 

We  present  here  a  discussion  on  the  response  of  a  system  in  the  presence  of  an  external 
field.  We  work  with  susceptibility  [13]  and  express  it  in  terms  of  trace  over  single-particle 
states  of  two-time  correlation  function  of  the  observable.  This  quantity  is  also  known  as 
the  polarization  propagator  [14]. 

If  a  system  described  by  the  Hamiltonian  H  is  externally  disturbed  by  a  field  Fext(f), 
then  the  total  Hamiltonian  is 

H7=H-QF**(t},  -  (1) 

where  Q  is  an  observable,  an  example  could  be  magnetization  in  the  context  of  spin 
systems,  or,  an  electric,  dipole  operator  in  an  example  involving  photoabsorption,  and  so 
on.  The  response  function  can  be  written  as  the  imaginary  part  of  the  dynamical 
susceptibility,  ? 

'-".'•  (2) 


The  angular  brackets  denote  the  expectation  value  and  the  square  brackets  denote  the 
commutator.  Setting  an  initial  time  to  0  and  the  final  time  to  f,  we  can  write 


x~(t)  =  (2»)  * [(G(/)e(o)>  -  (e(o)Q(O)],  (3) 

where 

fi(0  =  exp(iHt/H)  Q  exp(-iHt/h)    .  (4) 
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and  where  (•}  denotes  the  average  over  the  initial  state  of  the  system  which  is,  for 
instance,  the  thermal  state  wherein 


with 

Z(/?)=trexp(-/3#),  (6) 


if  the  system  was  in  contact  with  a  thermal  reservoir  at  temperature  T  =  l/^ks  in  the 
period  preceding  the  interaction  with  the  external  field,  Fext(f).  If  the  system  was  in  its 
pure  ground  state  |<&o}  then  the  average  is  over  this  many-body  eigenstate  of  H,  i.e., 

{•}  =  <$o|  •  |*o>.  (7) 

In  this  case,  we  can  write  x"(0  as 

x"(t}  =  (in}-1  K 


(8) 
where  {|$n)}  denote  all  the  many-body  eigenstates  of  the  isolated-system  Hamiltonian, 

=  £„!$„),  (9) 


En}  -  6(fa  -  E0  +  En}} 


with  En  >  EQ. 
Upon  Fourier  transformation, 


where  we  used  the  identity 

— — '——  =  P  \  —  1  —  i  TT  Six]  (11} 

JC-HO+         \xj           •  v 

P  denoting  the  Cauchy  principal ;  value.  Methods  have  been  developed  to  evaluate 
such  expressions  semiclassically  [I].  However,  we  should  recall  that  the  system  is 
many-body  so  that  such  methods  would  require  the  search  for  classical  orbits  of  the 
many-body  system.  Simplification  arises  by  taking  account  of  the  fermionic  character 
of  the  system  which  allows  a  reduction  of  the  problem  to  one-body  Hamiltonian  in 
an  effective  potential  determined  by,  for  instance,  the  time-independent  Hartree-Fock 
method  [6,10,11].  We  shall  make  this  simplifying  assumption  here  and  restrict 
our  system  to  a  set  of  uncoupled  one-body  Hamiltonians.  A  similar  assumption  is 
carried  out  on  the  coupling  operator  for  which  we  assume  the  same  form  as  for  the 
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Hamiltonian, 


1=1 

N 


where  hi  and  g*  are  one-body  operators.  Let  us  denote  the  one-body  eigenstates  as 

h\<j>a)  =  ea\<t>a).  (13) 

When  the  fermionic  many-body  system  is  in  its  ground  state,  all  the  one-body  eigenstates 
are  occupied  up  to  the  Fermi  energy,  ep  so  that  the  energy  of  the  ground  state  is 

£o=-  (14) 


In  any  many-body  excited  state,  ($„},  at  least  one  one-body  level  above  the  Fermi  energy 
is  occupied.  We  may  denote  such  excited  states  by  the  list  of  the  occupied  one-body 
levels,  or  equivalently,  by  the  list  of  the  one-body  levels  for  which  the  occupation  is 
different  with  respect  to  the  ground  state  |$0),  having  in  mind  that  such  states  are 
antisymmetric  for  an  exchange  of  two  fermions: 


...ll.   00..  .0} 

fp 
\3>n}  =  Jill...  1110:   0100...}.  (15) 

For  operators  which  are  sums  of  one-body  operators  as  assumed  in  (16),  we  obtain 
the  result  that  the  matrix  elements  ($o|G|$n)  are  non-  vanishing  only  for  the  states 
<!>„  which  differ  from  the  ground  state  $0  by  one  excitation.  These  states  have  a  hole  in 
state  a  (ea  <  Q?)  and  a  particle  in  state  b  (e&  >  ep).  For  these  states  the  matrix  elements 
are  thus 


.  (16) 

Moreover  the  energy  of  the  excited  state  is 

En=EQ-€a  +  eb  (17) 

under  the  assumption  that  ea  <  e/?  <  e^.  After  some  standard  manipulations,  we  can  write 
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where  the  Heaviside  step  function,  6(-)  takes  care  of  the  aforementioned  restriction  on 
the  location  of  the  one-body  states  4>a  and  fo  with  respect  to  ep. 

In  the  presence  of  a  thermal  reservoir  at  temperature  T[=  \/(j3kB}}  described  with 
canonical  density  matrix,  we  get  that 

=  itnr*V*/*Ge 

Z-i 


|.        (19) 
Thus  the  dynamical  susceptibility  is 

*»  = 


Similar  assumptions  as  before  enable  us  to  reduce  this  expression  to  the  one-body 
system.  Now  the  probability  to  find  a  state  \$m)  in  which  \<f>a)  is  occupied  is  given  by  the 
Fermi-Dirac  distribution  at  energy  ea.  But  )<&„)  is  related  to  |$m}  by  the  fact  that  Ifo)  must 
be  unoccupied  (see  above).  So,  we  have  a  joint  probability  that  \<j>a)  is  occupied  and  \cf>b) 
is  unoccupied,  hence  the  propagator  is 


nFD  _  nFD 


where 

rf 
Pa 


denotes  the  Fermi-Dirac  probability  or  mean  occupation  number,  fi  is  the  chemical 
potential  fixed  by  the  total  number  of  fermions  in  the  system.  Equation  (21)  gives 
therefore  the  Fourier  transform  of  the  two-time  correlation  function. 

At  zero  temperature,  the  Fermi-Dirac  distribution  becomes  a  Heaviside  step  function, 
so  the  expression  (21)  becomes 


i  ffrtAS  \~Q          I          ' 

which  is  identical  with  the  expression  previously  derived.  In  the  foregoing  discussion  of 
this  section,  we  have  started  from  the  time  correlation  function  (actually  the  commutator) 
and  written  an  expression  for  the  response.  Now  that  the  susceptibility  has  been  reduced 
to  a  1-body  expression,  we  can  treat  the  1-body  system  semiclassically  in  terms  of 
periodic  orbits  in  the  effective  Hartree-Fock  Hamiltonian  h. 

It  is  well  to  recall  that  x"(w)  has  a  physical  interpretation  in  terms  of  energy 
dissipation  which  is  related  to  the  fluctuations  in  the  frequency  spectrum  obtained  as  the 
Fourier  transform  of  the  time  correlation  function.  Indeed  the  fluctuation-dissipation 
theorem  [13]  lies  just  in  realizing  that  connection. 
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Employing  the  identity  (1 1)  in  (21),  and  expanding  the  difference  between  pFD's  in  the 
resulting  expression,  we  obtain 

|2  [pFD(ea]  ~PFD(ea 


•       a,b      ^ 

fnM,  (24) 


where  we  introduced  the  expression 

(25) 


which  is  the  Fourier  transform  of  the  time  correlation  functions 

pfijfD 

fn(t)  =  tr^-  (fc)ewk/»$e-afc/'l$.  (26) 

oe" 

Moreover,  we  can  rewrite  these  time  correlation  functions  in  terms  of  a  single  time 
correlation  function  as  follows 


(27) 
j          c/e- 

with 

and        q(t)  -  c&l*  qeT&l* .  (28) 


In  summary,  we  have  so  far  reduced  the  many-body  dynamical  susceptibility  to  an 
expression  involving  the  time  correlation  function  Ce(t)  of  the  1-body  effective  dynamics: 


We  will  see  in  the  next  section  that  this  expression  is  particularly  suited  for  developing 
semiclassical  expansions. 

3.  Semiclassical  expressions  of  response 

In  many  cases,  fermions  occupy  the  one-body  levels  up  to  high  quantum  numbers. 
Accordingly,  a  semiclassical  method  (like  for  instance  WKB)  turns  out  to  be  justified, 
in  which  the  quantum-mechanical  quantities  are  obtained  as  asymptotic  expansions  in 
terms  of  classical  mechanics.  However,  progress  in  the  last  decades  has  shown  that 
chaotic  motion  is  a  typical  feature  of  classical  mechanics  so  that  an  exponential 
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proliferation  of  periodic  orbits  is  to  be  expected.  The  classical  dynamics  in  the  three- 
dimensional  one-body  effective  potential  of  the  fermionic  system  will  in  general 
present  chaotic  motion  because  chaos  is  possible  in  systems  with  more  than  two 
degrees  of  freedom.  Chaotic  motion  is  often  mixed  with  quasiperiodic  motion, 
specially,  at  low  energies.  At  higher  energies,  chaotic  motion  becomes  dominant  in 
many  systems,  which  justifies  the  assumption  used  in  the  following  that  all  the  periodic 
orbits  are  unstable  and  isolated  like  in  fully  developed  chaos.  It  is  under  this  assumption 
of  a  chaotic  dynamical  scenario  that  Gutzwiller  derived  his  famous  semiclassical  trace 
formula  [12].  Our  purpose  is  now  to  apply  Gutzwiller' s  method  in  order  to  obtain  the 
response  function  semiclassically. 

To  this  end,  we  need  the  semiclassical  expression  for  the  time  correlation  function 
Q(f),  which  we  consider  in  the  form: 

Ct(t}  =  ti6(e-h)X  =  ti6(e-h)A  +  itr8(e-h)B,  (30) 

where  X  =  X(t)  =  q(t}q(0)  is  a  one-body  operator.  This  operator  is  non-hermitian  but 
can  be  decomposed  as  X  =  A  +  iB  in  terms  of  two  Hermitian  operators:  A  —  (X  +  X^}/2 

andJB=(X-Xt)/(2i). 

Methods  have  been  obtained  to  evaluate  such  expressions  semiclassically  in  terms 
of  periodic  orbits  [1, 15].  To  motivate  this  method,  we  begin  by  observing  [16]  that  such 
an  expression  involves  the  matrix  elements  of  A  (and  B)  over  the  eigenstates  of  h.  These 
matrix  elements  can  be  obtained  at  the  first-order  perturbation  theory  of  a  perturbed 
Hamiltonian  h(\]  =h  +  AA.  Assuming  the  eigenvalue  problem  for  h(X)  to  be  solved,  the 
matrix  elements  of  A  may  thus  be  obtained  in  terms  of  derivatives  of  eigenvalues  of  the 
perturbed  Hamiltonian,  en(A),  with  respect  to  A.  The  periodic-orbit  theory  by  Gutzwiller 
can  then  be  used  to  calculate  the  diagonal  matrix  elements. 

Each  term  of  the  correlation  function  Ce(t)  can  therefore  be  expressed  as 


7T 


-  h  +  zO+ 

(31) 


A=0 


Comparing  with  the  following  identity 

1        / 


§  trlog(e  -  h  ~  AA  +  iO+)  =  tr£(e  -  h  -  AA) 
TT  oe 


(32) 


which  is  the  derivative  of  the  staircase  function  N(e\  A)  with  respect  to  the  energy,  we 
arrive  at  the  relation, 

,  (33) 


which  gives  each  term  of  the  correlation  function  as  the  derivative  of  the  staircase 


periodic  orbits  are  unstable  and  isolated: 


(34) 
,    „,...    |det[m;(A)-/]r  '" 

where  m^(A)  is  the  monodromy  matrix  governing  the  stability  of  the  classical  periodic 
trajectory,  p;  vp  is  the  Maslov  index  of  the  trajectory  and  hw  is  the  Weyl-Wigner 
transform  of  the  Hamiltonian  h(X]  =  h  +  AA.  The  Gutzwiller-type  expression  (34) 
contains  different  expansions.  First  of  all,  we  find  series  in  powers  of  h  (such  as  the  Weyl 
corrections  to  the  average  staircase  function).  These  series,  which  also  appear  in  the 
WKB  method,  are  known  to  be  asymptotic  series  but  with  a  controllable  rest  in  general. 
The  properties  of  these  series  in  H  have  been  discussed  in  the  literature  [19-22].  On  the 
other  hand,  several  rigorous  results  exist  about  the  convergence  of  the  sum  over  periodic 
orbits  [19,20,22].  Different  regularizations  have  been  studied  which  are  based  on 
Lorentzian  smoothing  (i.e.,  on  the  complexification  of  energy  [23])  or  on  Gaussian 
smoothing  [24, 25]  and  which  yield  results  with  controllable  error  bounds. 

We  notice  that  the  above  periodic  orbits  p  in  equation  (34)  are  those  of  the  perturbed 
Hamiltonian.  We  assume  here  that  the  periodic  orbits  of  hw(\]  deform  continuously  to 
the  periodic  orbits  of  hw(X  =  0)  for  A  small  enough.  For  the  purpose  of  differentiating 
this  expression  with  respect  to  A,  we  use  the  following  classical  formula  which  gives  the 
derivative  of  the  action  of  the  periodic  orbits  [26], 


*.  f  — »         Q^         —         Y  ••"••' *Wj  \33) 

where  the  integrals  go  around  the  periodic  orbit,  />,  and  Aw  is  the  Weyl-Wigner  transform 
of  the  operator  A.  Taking  the  derivative  with  respect  to  A  and  adding  both  terms 
composing  the  correlation  function,  we  finally  obtain 


-  h}X  = 


1        |^5p-rf.)    f 

^Vs'l'Mm-/)!1/2  /, 


where  hd  =  hw(X  =  0)  and  the  periodic  orbits  are  those  of  frc/. 
For  the  case  like  ours,  when 


(37) 

$&• 

the  Weyl-Wigner  tranform  can  be  written  as 
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=  [exp(— £>dt)q.w]qw  +  (Weyl  corrections) 

;0)  +  (Weyl  corrections),  (38) 


icre  £c/  =  {/z,  •}  is  the  classical  evolution  operator,  and  the  operator  A  is  standard  [27] 
the  Weyl-Wigner  expansions  defined  as 

A-l.1-1.1  (39) 

dp  dx     dx  dp'  {    } 

i  qw(t)  =  qw[&(x,  p)]  is  the  value  of  qw  at  the  current  point  <3>'(x,  p)  of  the  trajectory 
im  the  initial  condition  (x,p)  of  the  Hamiltonian  flow  $'  of  hci. 
Putting  all  the  pieces  together,  we  can  now  write  the  semiclassical  expansion  for  the 
rrelation  function  as 


=  J 


+  (Weyl  corrections) 


5  now  go  back  to  equation  (29)  where  we  see  that  we  still  need  to  evaluate  the  Fourier 
nsform  of  the  correlation  function  Ce(i). 

Let  us  evaluate  the  first  term  with  n  =  1  of  the  susceptibility.  This  should  give  the 
minant  term  of  the  Weyl  series  in  powers  of  the  Planck  constant  H  as  well  as  a  sum  over 
riodic  orbits  with  the  smallest  power  in  H  which  gives  therefore  the  most  important 
ntribution.  The  other  periodic-orbit  sums  would  involve  smaller  amplitudes  with  higher 
wers  in  H.  Replacing  with  the  previous  semiclassical  result,  we  get 


x  (    drqw(r}qw(r  +  t}  +  O(H°).  (41) 

JP 

isuming  that  q  =  $  is  a  Hermitian  operator,  its  Weyl-Wigner  transform  is  a  real 
action  of  (x,  p).  Along  a  periodic  orbit,  we  have  the  Fourier  series 


.  . 
4w(r)  =    >     qp,n  exp    i—  -r    ,  (42) 

n=-oo  \     1P      / 


where  Tp  is  the  classical  period  and  qp^n  =  q*p^n.  We  obtain  that 

//•  +00  r  9/717) 

dt  #*    6  drqw(r]  qw(r  +  /)  =  27T  Tp(e)  V  |<?p,n(e)|2  S  L  -  —  - 
^  n=^oo  L  1P\f- 

(43) 

Except  in  special  systems  like  the  harmonic  oscillators,  the  periods  Tp(e)  vary  with  the 
energy  e.  Because  of  the  presence  of  the  Dirac  distribution,  we  can  perform  the  last 
integral  over  e  by  using 

*-*"™'  (44) 


where  e  =  ep,n(v}  is  the  energy  obtained  by  solving  the  implicit  equation  Tp(e)  =  2-!m/u 
(when  this  solution  exists).  We  here  assumed  that  the  periods  vary  monotonically  with 
energy,  otherwise  we  find  a  sum  of  terms  instead  of  a  single  term  in  the  right-hand 
member  of  the  previous  formula.  Noting  that 


T 

'  (45) 


we  have  that 


t-tPM},  (46) 

where  Sp>n  =  5p[eplJI(u;)]. 

Substituting  in  the  previous  relation,  we  finally  obtain  the  absorptive  part  of  the 
dynamical  susceptibility  as: 


p,r,n 


,1/2 


where  all  the  expressions  with  indices  p,n  are  functions  of  the  frequency  u;  through 
their  dependency  on  the  energy  €  =  €ptn(u).  Equation  (47)  is  the  central  result  of  this 
work. 

The  formula  (47)  has  delicate  convergence  properties  which  are  similar  to  those  of 
Gutzwiller  trace  formula,  as  discussed  above.  Such  periodic-orbit  and  H  expansions  have 
already  been  applied  to  several  systems  where  reliable  and  also  high-precision 
calculations  can  be  carried  out  with  appropriate  truncations  [1,2,15,17,26,28].  To 
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interpret  the  result  (47),  let  us  go  back  to  the  time  correlation  function  C6(t)  which  is 
—  h]  e,xp(iht/ti)qex.p(—iht/n)q 

ier'h  tr  e^'-T>/*  q  e-'*f/n  q.  (48) 


=  —  j 


In  this  expression,  the  trace  can  be  rewritten  in  terms  of  the  Green  propagators  as 
trexp[w(f  —  T)/7j]#exp[—  iht/H]q 

=   I  dx  dx'G(x',  x;  r  -  t)q(x)G(x,  x';  t)q(x') 

dxdx'Ac(x',  x;  r  -  t]  exp[*Wc(x',  x;  r  - 


=  ]P  / 

'  J 


c,c' 

x  A^xX;  0  expfW^xX;  0/%0«0»  (49) 


where  in  the  last  step,  we  have  written  down  symbolically  the  semiclassical  expressions 
for  the  Green  propagators.  We  have  here  assumed  that  the  operator  q  depends  only  on  the 
position  operator,  i.e.,  q  =  q(x).  The  sum  in  the  above  expressions  is  over  classical  paths, 
c  and  c'.  Notice  that  these  are  two  different  paths  in  general  as  they  arise  from  two 
different  propagators.  The  expression  interprets  for  itself  —  from  the  point  x  to  x',  we 
propagate  along  the  path  c'  for  time  t,  and  then  we  propagate  back  to  the  point  x  along  c 
in  time  T  —  t.  Precisely, 


dtL 


f 

Jo 


(50) 


L  being  the  Lagrangian  of  the  system.  The  total  phase  in  the  correlation  function  is  the 
sum  Wc  -f-  Wc>  of  the  two  actions.  By  stationary  phase  method,  it  can  be  shown  that  the 
two  segments  of  the  paths  c  and  c',  in  fact,  form  a  periodic  orbit  of  period  r.  This  is  the 
mechanism  by  which  periodic  orbit  corrections  appear  in  the  main  result  above. 

Classical  contribution  comes  from  the  paths  of  zero  length,  i.e.,  r  =  0,  and  this  is 
precisely  the  leading  term  in  our  expression,  also  called  the  quasiclassical  term.  Thus,  in 
any  calculation  of  interest,  we  need  to  find  the  classical  correlation  function,  the  Weyl 
corrections,  and  then  the  periodic-orbit  corrections.  If  the  energy  spectrum  of  the  system 
is  discrete,  we  notice  that  the  dynamical  susceptibility  (24)  is  a  sum  of  Dirac  peaks 
centered  on  each  of  the  Bohr  frequencies  o>  =  (Em  —  En)/H.  In  the  semiclassical  formula 
(47)  the  first  term  due  to  the  paths  of  zero  length  contribute  to  the  dynamical 
susceptibility  by  a  continuous  function  of  frequency  u)  which  may  be  interpreted  as  the 
average  or  background  of  the  discrete  quantum  susceptibility.  The  second  series  of  terms 
of  (47)  are  due  to  the  periodic  orbits  which  contribute  by  oscillatory  functions  of  the 
frequency  u.  Their  sum  is  supposed  to  reproduce  the  Dirac  peaks  of  the  discrete  quantum 
susceptibility  %"(w)- 

We  notice  that  in  contrast  with  the  well-known  Gutzwiller  formula  for  the  level 
density,  the  quasiclassical  term  here  involves  a  classical  time  correlation  function.  As  a 
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consequence,  the  spectral  properties  of  the  classical  Liouvillian  £c/  will  intervene  in  the 
evaluation  of  this  term  (as  well  as  of  the  following  Weyl  corrections).  In  this  regard,  we 
mention  that  a  periodic-orbit  theory  of  the  classical  Liouvillian  has  been  developed 
[17, 18]  in  terms  of  classical  zeta  functions  which  resemble  (but  differ  from)  the  Selberg- 
Gutzwiller  quantum  zeta  function.  Consequently,  we  have  here  an  example  where  the 
semiclassical  evaluation  of  a  quantum  property,  namely  x"(w)»  involves  the  spectral 
properties  of  the  classical  Liouvillian  along  with  the  Gutzwiller  quantum  periodic-orbit 
sum.  It  is  remarkable  that  both  these  classical  and  semiclassical  terms  coexist  within  the 
same  formula,  which  points  to  the  result  that  the  classical  periodic-orbit  theory  is  only 
one  part  of  the  underlying  quantum  dynamics.  We  believe  that  this  result  is  general  in  the 
semiclassical  evaluation  of  dynamical  susceptibilities,  which  include,  in  particular,  the  ac 
conductivity. 

It  turns  out  that  the  spectral  properties  of  the  Liouvillian  differ  considerably  whether 
the  classical  system  hc\  is  integrable  or  chaotic. 

For  integrable  systems,  the  Liouvillian  spectrum  is  formed  by  discrete  classical 
frequencies,  £cj-0m  =  i&mipm,  on  each  ergodic  component  which  are  the  invariant  tori. 
The  expression  would  then  be  similar  to  the  one  obtained  in  [7].  The  periodic-orbit 
corrections  should  be  given  by  the  Berry-Tabor  semiclassical  theory  under  such 
circumstances  [29].  Let  us  mention  that  a  trace  formula  has  been  recently  derived  for  the 
case  where  an  integrable  system  is  deformed  so  that  a  continuous  symmetry  becomes 
approximate  [30]. 

For  chaotic  systems,  the  classical  time  correlation  function  in  the  quasiclassical  term 
decays  if  the  system  is  mixing  on  the  energy  shell.  Recent  works  have  shown  that  this 
decay  may  be  developed  in  terms  of  the  complex  singularities  of  an  analogue  of  the 
Liouvillian  resolvent  [17,18,31].  When  the  complex  singularities  are  poles  they  are 
called  Pollicott-Ruelle  resonances  and  are  associated  with  exponential  decays.  If  the 
spectral  decomposition  of  the  Liouvillian  dynamics  in  terms  of  the  Pollicott-Ruelle 
resonances  is  known  the  quasiclassical  term  can  be  further  reduced  as  a  sum  over  the 
Pollicott-Ruelle  resonances  over  each  energy  shell  [17, 18,31]. 


4.  Concluding  remarks 

We  have  described  a  systematic  procedure  which  lets  us  write  important  quantities  in 
many-body  theory  in  a  semiclassical  expansion.  All  our  considerations  are  restricted  to 
many-body  systems  reducible  to  uncoupled  one-body  systems.  We  believe  that  it  is  an 
interesting  first  step.  We  have  shown  the  central  role  played  by  the  time  correlation 
functions.  On  the  one  hand,  they  are  related  by  a  Fourier  transform  to  the  response 
function;  on  the  other  hand,  they  facilitate  useful  expansions  in  terms  of  H.  The  fact  that 
the  fermionic  nature  of  the  particles  has  been  taken  into  account  is  an  important  aspect.  In 
contrast  to  a  recent  attempt  along  similar  lines  [7]  where  complete  integrability  was 
assumed,  we  have  here  assumed  fully  developed  chaos.  Moreover,  in  the  analysis  of  [7], 
the  Berry-Tabor  formula  for  periodic-orbit  corrections  has  been  used  which  fails  for  the 
interesting  case  of  the  harmonic  oscillators.  Since  we  now  know  that  the  Gutzwiller  trace 
formula  is  exact  for  harmonic  oscillators  [2],  and  is  also  applicable  for  chaotic  systems, 
we  believe  that  our  analysis  may  be  extended  to  more  general  systems.  As  mentioned 
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earlier,  we  draw  attention  of  the  reader  to  [30]  where  an  expression  for  trace  formula 
applicable  to  mixed  phase  space  dynamics  as  occurring  in  the  case  of  nearly  integrable 
systems  [32]  is  found. 

There  are  many  applications  of  such  a  formalism.  In  any  discussion  of  relaxation 
phenomena,  time  correlation  functions  play  a.n  important  role.  To  understand 
photoabsorption  cross-section  of  atoms  and  molecules,  semiclassical  treatment  along 
these  lines  has  been  used  [1, 28].  In  metallic  clusters  and  quantum  dots,  there  is  a  growing 
interest  in  the  semiclassical  treatment  of  the  absorption  spectra  and  cross-sections.  The 
understanding  of  the  plasmon  modes  in  different  metallic  clusters,  their  splitting  etc.  are 
believed  to  have  a  semiclassical  interpretation.  However,  for  a  systematic  treatment  of  the 
collective  modes,  we  need  to  consider  two-body  interactions,  thus  a  Hamiltonian, 

,  (51) 

where  vy  denotes  the  two-body  term.  What  we  need  to  do  here  is  a  perturbative  treatment 
of  the  two-body  interaction  by  assuming  that  A  <C  1  which,  in  turn,  would  give  rise  to 
collective  modes. 

A  comment  is  here  in  order  that  the  classical  chaos,  which  may  exist  in  the  one-body 
effective  potential  obtained  by  the  Hartree-Fock  approximation,  is  only  a  property 
emerging  out  of  the  wave  mechanics  in  the  classical  limit  where  the  wavelengths  are 
arbitrarily  small.  In  this  work,  however,  we  use  the  classical  periodic  orbits  in  order  to 
derive  useful  expansions  of  the  quantum-mechanical  quantities,  which  apply  in  the 
semiclassical  regimes,  i.e.  before  the  classical  limit  is  reached.  The  situation  is  similar  as 
in  other  perturbative  methods  where  a  property  valid  for  the  (unperturbed)  limiting 
system  does  not  necessarily  extend  to  the  (perturbed)  typical  systems,  and  this  without 
jeopardizing  the  validity  of  the  perturbation  method. 

As  a  last  comment,  let  us  mention  that,  in  a  random-matrix  framework,  one  can  also 
study  the  correlation  functions,  and  it  has  been  recently  shown  that  the  time  correlations 
depend  upon  the  co-dimension  of  level  crossing  [33].  A  comprehensive  understanding  of 
all  the  inter-relations  will  be  one  of  the  aims  we  look  forward  to. 
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Abstract.  Using  quantum  maps,  we  study  the  accuracy  of  semiclassical  trace  formulas.  The  role 
of  chaos  in  improving  the  semiclassical  accuracy  in  some  systems  is  demonstrated  quantitatively. 
However,  our  study  of  the  standard  map  cautions  that  this  may  not  be  most  general.  While  studying 
a  sawtooth  map  we  demonstrate  the  rather  remarkable  fact  that  at  the  level  of  the  time  one  trace 
even  in  the  presence  of  fixed  points  on  singularities  the  trace  formula  may  be  exact,  and  in  any  case 
has  no  logarithmic  divergences  observed  for  the  quantum  bakers  map.  As  a  byproduct  we  introduce 
fantastic  periodic  curves  akin  to  curlicues. 

Keywords.    Quantum  chaos;  quantum  maps;  Gutzwiller's  periodic  orbit  sum. 
PACS  No.    03.65 

1.  Introduction 

The  trace  formula  of  Gutzwiller  [1]  provides  a  window  into  the  semiclassical  treatment  of 
chaotic  systems  and  is  hence  of  considerable  importance  in  many  areas  of  physics  and 
chemistry.  Yet  the  understanding  of  this  formula  has  been  hampered  by  the  complexity  of 
the  classical  task  involved,  and  often  the  semiclassical  effort  is  greater  than  that  for  a  full 
quantum  solution  which  makes  the  semiclassical  methods  either  only  of  academic  interest 
or  as  tools  for  explaining  features  of  a  full  quantum  solution.  In  this  situation  simple 
models  have  played  a  key  role  just  as  has  been  the  case  with  the  understanding  of 
classical  chaos,  as  evidenced  for  example  by  the  study  of  the  logistic  map  and  the  smale 
horseshoe. 

Although  the  trace  formula  (and  its  relatives)  are  important,  the  accuracy  of  these  have 
not  been  rigorously  established.  The  trace  formula  being  the  first  term  in  an  asymptotic 
expansion,  has  many  approximations  with  apparently  uncontrolled  errors.  The  very  many 
applications  of  the  stationary  phase  approximation,  on  functional  and  on  ordinary 
integrals  obscures  the  nature  of  the  errors.  One  ongoing  issue  is  the  time  at  which  we  can 
expect  the  various  semiclassical  approximations  to  go  bad  with  a  candidate  the  so  called 
log  time  which  is  of  the  order  of  —  log(/i)/A,  applicable  for  chaotic  systems  with  a 
Lyapunov  exponent  A,  being  called  too  pessimistic  [2].  We  note  that  although  the 
semiclassical  formula  for  the  time  evolution  (due  to  Van  Vleck)  has  been  essentially 
known  almost  since  the  beginning  of  quantum  mechanics  the  nature  of  the  errors  is  still  a 
matter  of  interest,  especially  since  the  trace  formula  depends  on  a  Laplace  transform  of 
the  Van  Vleck  approximation  [1]. 

The  quantum  bakers  map  has  been  studied  as  a  model  of  quantum  chaos  as  indeed  it 


divergences.  In  fact  we  will  show  that  in  certain  circumstances  the  semiclassical  trace  can 
be  exactly  the  quantum  trace. 

In  pursuance  of  certain  results  in  [5],  we  use  the  existing  models  of  quantum  chaos  to 
study  the  accuracy  of  the  simplest  trace  of  the  quantum  problem.  To  this  end  we  will  in 
particular  use  the  much  studied  quantum  standard  map  and  the  much  less  studied 
quantum  sawtooth  map  both  in  the  settings  of  the  phase  space  as  a  cylinder  and  as  a  torus. 
These  being  maps  derived  from  time  periodic  systems  we  study  simply  the  trace  of  the 
relevant  unitary  Floquet  operators.  In  §  2  the  cylindrical  phase  space  is  used,  while  in  §  3 
the  toral  phase  space  is  used,  we  end  with  some  comments  in  §  4.  In  the  Appendix  we 
provide  a  self  contained  derivation  of  the  "trace  formula"  for  a  class  of  maps  on  the  torus 
that  includes  the  maps  studied  below  and  all  the  so  called  kicked  systems.  While  this  is  in 
principle  known,  it  may  be  useful  to  recollect  it  here,  as  the  other  published  proof  of 
Tabor  [17]  is  for  the  case  of  maps  on  the  entire  plane,  and  is  hence  quite  different.  Those 
with  little  exposure  to  the  trace  formula  may  peruse  the  Appendix  first. 


2.  The  cylindrical  phase  space 

Consider  the  well  known  standard  map  (qn+i  =  qn  +  Tpn,pn+i  =  pn 
defined  on  the  cylinder  [0,2?r)  x  (-00,  oo)  with  periodic  boundary  conditions  in  q.  The 
first  iterative  equation  is  then  to  be  evaluated  with  a  modulo  2?r  rule.  A  simple  change  of 
variables  from  p  to  Tp  implies  that  the  relevant  parameter  is  just  one  in  number  and  is 
k  =  KT.  This  area  preserving  map  and  its  quantization  have  been  studied  in  extensive 
detail  as  a  model  of  chaos  and  its  quantum  manifestations  and  led  to  such  important 
discoveries  as  the  quantum  suppression  of  chaotic  diffusion.  Some  of  these  are  reviewed 
in  [7].  However  its  use  in  understanding  the  Gutzwiller  trace  formula  has  been  limited. 
The  work  of  Tabor  [17]  where  he  derived  the  relevant  trace  formula  for  quantum  maps  on 
the  plane  did  use  the  standard  map  but  no  rigorous  or  extensive  study  has  yet  been 
attempted.  The  quantization  of  the  standard  map  is  described  by  an  infinite  dimensional 
unitary  matrix  with  matrix  elements  in  the  momentum  representation  being 

Um  =  (-i}n~mJn-m(k/TK)  exp(-im2HT/2}.  (1) 

Here  Jv(z)  is  a  Bessel  function.  As  has  been  noted  earlier  there  are  two  relevant 
parameters  quantum  mechanically  and  we  can  take  T  =  1  while  we  do  not  have  the 
freedom  to  choose  H  =  1  anymore.  The  trace  of  the  quantum  map  is  then  given  by 


Tr(J7)  =  J0(k/H)  £  e-'7*"2/2  =  /0(jfc/*)e3(0|  -  »/27r),  (2) 

n=—  oo 

where  63  is  a  Jacobi  theta  function.  An  application  of  the  Poisson  formula  enables  us  to 

Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  H) 
518  Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


Strictly  speaking  this  sum  need  not  exist  and  may  without  further  regularization  be 
infinite.  This  is  not  due  to  any  peculiarities  of  maps,  and  in  fact  this  is  the  rule  in  quantum 
mechanics,  the  trace  of  the  propagator  could  be  infinite  in  real  time.  However  this  will  not 
worry  us  further  as  the  semiclassical  trace  will  also  diverge  exactly,  allowing  us  to  make 
meaningful  relative  error  estimates. 

Now  we  turn  to  a  semiclassical  evaluation  of  the  quantum  trace.  We  wish  to  then  write 
the  sum  in  terms  of  the  period  one  or  fixed  points  of  the  classical  map.  The  Lagrangian 
generating  the  standard  map  on  the  cylinder  is 


L(qn,qn+\,ln)  =  (qn+i  -qn+  27r/,I)2/2  -  kcos(qn+l)  (4) 

as  dL/dqn  =  —pn  and  dL/dqn+\  =  pn+\-  ln  is  the  winding  number  in  the  q  direction  or 
2irln  is  subtracted  by  the  modulo  operation  performed  at  time  n  to  keep  the  map  on  the 
cylinder.  The  fixed  points  are  determined  by  the  condition  (qn+\  =  3n,Pn+]  —  Pn},  and 
are  therefore  simply  given  by  (q  —  0,p  =  2?r/)  and  (q  =  ir,p  =  2?r/),  where  /  runs  over 
the  integers.  We  note  that  the  number  of  fixed  points  are  infinite  and  that  their  positions 
do  not  depend  on  the  classical  chaos  parameter  k.  The  denominator  in  the  Gutzwiller 
trace  formula  (or  its  versions  for  the  case  of  quantum  maps,  they  are  essentially  same)  has 
the  factor  -^/Tr(J]  —  2,  where  J  is  the  classical  stability  matrix  associated  with  the 
periodic  orbit.  In  the  case  of  standard  map  fixed  points  this  is  either  \fk  or  A/^fe 
depending  on  whether  q  =  0  or  q  =  IT. 
The  semiclassical  approximation  is  of  the  form 


(J)--2),  (5) 

fixed  points 

where  S  is  the  classical  action  and  since  we  are  dealing  with  discrete  time  systems  with 
the  time  interval  being  taken  as  unity  this  is  identical  to  the  Lagrangian  written  above.  We 
get  for  the  standard  map  on  the  cylinder  the  following  semiclassical  approximation  for 
the  trace  (see  Appendix): 


«)•  (6) 

/=-00 

This  then  has  to  be  compared  with  (3).  The  asymptotic  form  of  JQ(X)  for  large  arguments 
is  [8] 

f2\    /   A         i   TT\ 

Mx)  ~  \  —    cos  he  -  -    P(x)  -  sin  he  -  -    Q(x) 
V™L        \       4J  \       4J        . 

where 

P(x)  =  1  -  9/(128jc2)  +  •  •  •     and   Q(x)  =  -l/(8x)  +  •  •  • , 
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wnere  me  negieciea  terms  are  at  must  ui  oruer  x     .  we  men  uenvc  mat 

(7) 


Even  this  simple  result  is  instructive.  It  indicates  spectacular  failures  of  the  trace 
formula  when  the  ratio  k/H  passes  through  ?r/4  +  (2m  +  l)?r/2,  where  m  is  a  positive 
integer.  At  these  points  the  semiclassical  trace  becomes  zero  and  the  exact  quantum  trace 
is  also  small  indicating  that  such  a  failure  will  not  be  serious  for  the  evaluation  of  the 
spectra.  But  it  does  suggest  that  the  ratio  of  the  quantum  to  the  semiclassical  need  not  be 
bounded  in  the  entire  range  of  the  parameters.  This  divergence  of  the  ratio  is  thus  not  due 
to  any  coalescing  stationary  points  or  bifurcations  in  general;  as  we  have  already  noted 
the  fixed  points  are  independent  of  the  chaos  parameter  k.  Second,  it  is  clear  that  for  a 
larger  k  or  chaos  the  semiclassical  trace  is  going  to  be  a  better  approximation.  The  authors 
in  [2]  have  observed  this,  and  we  see  that  the  simplest  quantum  trace  provides  a 
quantitative  evidence  of  this.  To  compare  the  absolute  error  rather  than  the  relative,  we 
have  to  deal  with  the  Jacobi  theta  function  which  occurs  identically  both  in  the  quantum 
and  in  the  semiclassical  traces,  and  as  this  can  depend  critically  on  the  rationality  of  ir/H 
we  do  not  pursue  this  here,  except  note  that  in  unbounded  phase  spaces  such  as  the  plane 
and  the  cylinder  the  absolute  value  of  the  traces  might  actually  diverge. 

The  case  of  the  standard  map  has  been  extensively  studied  but  not  so  the  quantum 
mechanics  of  the  sawtooth  map,  which  is  in  many  ways  simpler  because  it  is  a  piecewise 
linear  map  for  which  it  is  easy  to  extract  classical  information.  Consider  the  map 
(qn+]  =qn  +  Tpn,pn+i  =  pn-T-  K(qn+\  -  TT))  defined  on  the  cylinder  [0,  2?r)  x  (-00,  oo), 
which  is  completely  chaotic  for  positive  K.  This  can  be  considered  as  a  simple 
harmonically  kicked  particle  in  one  dimension.  We  may  view  the  system  as  an  otherwise 
free  particle  on  a  unit  circle  being  subjected  to  a  kick  whose  strength  is  proportional  to 
the  physical  angle  and  which  has  a  single  discontinuity  when  the  angle  is  zero.  The  trace 
of  the  quantum  propagator  may  be  written  down  just  as  in  the  case  of  the  standard  map 

(T  =  l)with  the  result  that 

I  /  /1F\          00 

Tr([7)  =  -=  Erf    e-*/4  A/|     £  exp^m2  /*),  (8) 

*  \  /   n=-oo 

where  Erf  is  the  error  function. 

The  semiclassical  trace  is  also  found  quite  easily,  because  the  fixed  points  are  at 
(q  =  ir,p  —  27rZ),  I  being  any  integer.  The  classical  action  is  similar  to  that  of  the  standard 
map  with  the  potential  term  being  replaced  by  a  periodicised  quadratic  function.  Thus 
we  get 


ft),  (9) 

and  the  ratio  of  the  quantum  to  semiclassical  simply  becomes 
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Inserting  the  asymptotic  form  for  the  error  function  we  get  the  relative  error  to  be 

1      Tr(^    ~    /2»  nn 

1  ~T~fTT\    ~  \/rF-  v11; 


The  neglected  terms  are  at  most  of  order  (H/K)3'2. 

In  this  map  also  we  see  the  seemingly  general  result  that  the  larger  the  chaos  (larger  K) 
the  semiclassical  accuracy  is  better.  The  precise  way  in  which  this  occurs  for  this  map  is 
like  K~1/2.  The  difference  between  this  map  and  the  standard  map  is  that  the 
semiclassical  approximation  seems  more  well  behaved  and  the  sudden  decline  in  the 
accuracy  in  the  case  of  the  standard  map,  due  to  the  mismatch  between  the  zeros  of  the 
quantum  and  the  semiclassical  is  absent  here.  However  in  the  regions  where  the 
semiclassical  (and  quantum)  traces  are  not  very  close  to  zero,  the  standard  map  would 
perform  better  as  the  errors  are  of  order  H/k  while  for  the  sawtooth  map  the  errors  are  of 

1  7*7 

order  (H/k}  '  .  We  will  see  that  surprisingly  many  of  our  observations  will  remain  valid  in 
the  toral  setting  as  well. 

3.  The  toral  phase  space 

Two  dimensional  maps  on  the  torus  (periodic  boundary  conditions  in  both  position  and 
momentum)  have  been  studied  extensively,  from  the  standard  map  to  the  cat  map  of 
Arnold.  Their  quantizations  have  also  been  investigated  by  many  [3,  7,  9],  as  this  is  a 
legitimate  model  for  a  bound  Hamiltonian  system  with  two  degrees  of  freedom,  the 
minimum  needed  for  chaos.  Unlike  the  case  of  the  cylindrical  phase  space  where 
scattering  may  take  place,  sheer  topology  now  prohibits  this  on  the  torus.  The  act  of 
taking  periodic  boundary  conditions  on  the  momenta  is  reminiscent  of  a  similar 
procedure  in  condensed  matter  physics  which  has  also  provided  an  interesting  two 
dimensional  map,  namely  the  kicked  Harper  model.  Quantum  mechanics  now  is  on  a 
finite  dimensional  Hilbert  space  and  we  take  the  torus  to  be  a  unit  torus  (the  dissected 
square  is  of  unit  sides).  An  alternate  view  of  toral  quantum  maps  is  as  the  case  of 
"resonance"  of  the  cylindrical  map  in  which  case  the  quantum  mechanics  can  be  exactly 
reduced  to  that  of  a  torus.  Position  and  momentum  take  discrete  values  which  we  take  to 
be  n/N,  where  N  is  the  dimension  of  the  Hilbert  space  and  n  =  0, . . .  ,N  —  1.  Here  N  is 
required  to  be  an  even  integer  and  is  the  inverse  of  the  Planck  constant.  Therefore  the 
classical  limit  is  N  — >  oo. 

In  this  section  we  will  first  deal  with  the  sawtooth  map  on  the  torus.  We  will  write  the 
map  a  little  differently  for  convenience  as  (qn+\  —  (qn  +  pn)modl,  qn+i  = 
(pn  +  Kqn+i}modl}.  The  quantum  map  or  propagator  is  given  by 

e-iV/4 

Um  =  —=-exp(iirKm2/N)  «p(i7r(n  -  m)2 /N}.  (12) 

This  then  is  the  quantum  equivalent  of  the  classical  map  restricted  on  a  torus.  It  is  a  finite 
unitary  matrix  as  all  quantum  maps  on  compact  phase  spaces  are.  Clearly  it  is  one  of  the 
simplest  quantum  objects,  having  elementary  functions  as  matrix  elements.  Yet  we  find 
that  it  is  rich  enough  to  reflect  many  essential  features  of  the  effect  of  classical  chaos  on 
quantum  systems.  For  more  properties  of  a  very  similar  quantum  map  please  refer  [10]. 
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me  Classical  i^agrangian  (,or  acuun;  iui  uus  iimp  is 

L(qa,qn+i,ln,mn)  =  (qn+\  -  qn  +  In}2f2-  +  K(qn+l(modl))2/2  -  mnqn+l: 

(13). 

where  the  integers  /„  and  mn  are  winding  numbers  in  the  q  and  p  directions  respectively. 
The  fixed  points  are  given  by  (q  =  m/K,p  =  0)  where  m  =  0,  1,  2,  .  .  .  ,  [K].  The  fixed 
point  at  the  origin  is  in  fact  on  the  line  of  discontinuity,  and  is  quite  like  the  bakers  map 
and  its  quantization.  In  that  case  it  was  noted  that  such  singular  periodic  orbits,  "half 
hyperbolic  ",  could  give  rise  to  unusual  semiclassical  behaviour  and  that  the  quantum  and 
serniclassical  could  diverge  semiclassically  as  log(^~1).  A  bakers  map  was  constructed  to 
explicitly  avoid  this  situation  and  a  recovery  of  standard  expectations  was  realized  [6]. 
However  if  we  take  the  above  quantization  as  valid  (since  it  follows  canonical 
quantization  closely  [11])  it  will  turn  out  that  the  fixed  point  on  the  discontinuity  is 
benign  in  the  case  of  the  sawtooth  map,  in  that  it  does  not  lead  to  semiclassically 
divergent  terms.  This  may  suggest  that  the  unusual  nature  of  the  bakers  map  originates 
either  from  the  unusualness  of  the  very  quantization  procedure  adopted  or  is  peculiar  to 
the  bakers  map. 

Let  us  look  at  this  case  more  closely,  and  take  0  <  K  <  1,  so  that  there  is  a  single  fixed 
point  at  the  origin  which  is  also  a  point  of  discontinuity.  A  direct  application  of  the 
Gutzwiller  formula  will  be  incorrect  in  this  case,  but  the  rectification  is  obvious.  The 
stationary  phase  integral  must  not  extend  over  the  entire  real  line,  but  must  be  restricted 
to  [0,  oo).  Thus  the  semiclassical  approximation  is  then  the  approximation  (quantum  ~ 
serniclassical) 

_nr/4  N-l  , 

(14) 
{     } 


An  estimate  due  to  Wilton  which  we  give  below  bounds  the  difference  as  N  —  >  oo,  but 
this  is  not  a  good  estimate  of  the  errors  (for  0  <  K  <  1),  as  these  are  of  the  order  of  the 
sums  themselves.  An  application  of  the  Euler-Maclaurin  summation  rule  will  give  us  the 
above  approximation  quite  simply.  But  more  important  is  the  observation  that  it  is 
legitimate  to  replace  summations  with  integrals  even  in  the  presence  of  fixed  points 
sitting  on  discontinuities,  a  rule  that  was  flagrantly  violated  in  the  case  of  the  bakers  map. 
Let  the  errors  be  t(K,N]  -  (Tr(L7)  -Tr([7)scj,  that  is  the  absolute  value  of  the 
difference  between  the  right  hand  and  the  left  hand  side  of  the  (14).  We  now  observe 
some  remarkable  number  theoretic  properties  of  these  errors.  First  we  note  the  surprising 
fact  that  if  N  is  a  multiple  of  4,  e(l/2,  N]  =  0.  In  this  case  not  only  does  the  semiclassical 
and  quantum  converge  in  the  classical  limit,  they  are  exactly  the  same  (at  the  level  of  the 
trace),  despite  the  presence  of  the  fixed  point  on  the  discontinuity!  We  have  noted  before 
that  the  main  feature  which  makes  the  sawtooth  map  generic,  in  the  sense  that  the 
semiclassical  is  only  an  approximation,  and  in  the  sense  of  eigenfunctions  scarring  and 
spectral  random  matrix  properties  is  that  the  incomplete  Gauss  sums  that  arise  while 
compounding  propagators  do  not  reduce  to  simple  forms  [10],  unlike  in  the  case  of  the  cat 
maps  (K  an  integer)  where  we  get  complete  Gauss  sums  and  has  been  investigated  by 
Hannay  and  Berry  [9]  and  the  extensive  number  theoretic  properties  were  explored  by 
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Keating  [12].  Thus  it  is  a  bit  interesting  that  an  incomplete  Gauss  sum  can  be  done 
exactly  and  is  the  semiclassical  approximation. 
Consider  the  sum 

Af-l 

aN  =  ^exp(z7m2/2AO,  (15) 

«=o 

which  is  incomplete  because  the  period  of  sn  =  exp(z7r«2/2/V)  is  2N  and  the  sum  is  only 
over  the  first  N  terms.  First  we  note  that  %/2+m  =  ^Ar/2-m  exp(zTrra),  thus  terms  with  odd  n 
do  not  matter  as  they  cancel  out  of  the  sum  exactly.  Let 

2AT-1 

£2A/  =    /^  Sn 
n=0 

which  is  a  complete  Gauss  sum  [9].  But  also  note  that  if  N  is  a  multiple  of  4, 


This  then  implies  that 


therefore 

Tr(£7)  =  e-^V^/V/V  =  1/V5.  (16) 

If  AMs  not  divisible  by  4,  then  e(l/2,N)  is  no  longer  zero  and  the  semiclassical  result  is 
an  approximation.  Thus  there  are  two  distinct  classes  when  K  =  1/2  as  far  as  the 
behaviour  of  the  error  is  concerned,  This  can  be  generalized  and  we  observe  that  if 
K  =  p/q,  is  a  rational  number,  with  relatively  prime  integers  p  and  q,  then  there  are  as 
many  classes  as  there  are  possible  remainders  of  even  A/Y  on  division  by  q.  Within  each 
class  the  error  monotonically  goes  to  zero  in  the  classical  limit,  N  —  ->•  oo.  We  have 
observed  numerically  that  for  large  N  to  an  excellent  approximation 


(17) 

Figure  l(a)  shows  the  case  K  =  2/5,  when  there  are  five  classes.  The  constant  C(K] 
depends  on  both  K  and  the  class,  but  we  have  observed  that  C(K]  —  C(\  -  K}.  If  K  is 
irrational,  the  number  of  classes  is  infinite,  there  is  no  monotonic  decrease  of  error,  but 
the  general  trend  is  to  decrease,  as  is  evident  from  figure  l(b),  where  we  have  taken  K  to 
be  the  most  irrational  number  7  =  (\/5  -  l)/2,  the  golden  mean.  The  number  theoretical 
properties  of  the  error  is  of  course  a  reflection  of  the  sum  itself,  and  a  fuller  study  is  thus 
desirable. 

We  however  shall  now  turn  to  the  case  when  K  is  larger  than  one,  and  there  are  a 
number  of  fixed  points.  If  K  is  much  larger  than  unity  we  need  not  correct  for  the  corner 
fixed  point  as  the  errors  incurred  will  be  of  the  order  K"1/2,  which  is  anyway  the  order  of 
the  total  error  according  to  Hardy,  Littlewood  and  Wilton,  as  quoted  in  [13].  The 
semiclassical  approximation  is  now  described  by 

(K\ 

expHWn2/*).  (18) 

n=0 
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Figure  1.    The  error  as  a  function  of  inverse  Planck's  constant  N,  for  the  sawtooth 
map  with  (a)  K  =  2/5  (b)  Jf  =  7. 
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Here  [K]  is  the  integer  value  of  K.  Note  that  this  approximation  clearly  demonstrates  the 
conjugacy  between  the  Planck  constant  (1/N)  and  the  chaos  parameter  K.  It  is  a  good 
approximation  if  K  is  fixed  and  N  tends  to  infinity,  namely  the  classical  limit,  but  also  in 
the  opposite  limit  of  extreme  "chaos",  and  small  quantum  system.  For  example  when  K 
tends  to  infinity  for  N  =  2,  a  two  level  system.  Undoubtedly,  this  approach  to  two  level 
systems  is  to  say  the  least  inadvisable,  as  the  quantum  problem  is  much  easier  to  solve. 
Nevertheless  it  throws  up  some  interesting  questions,  because  what  would  one  mean  by 
extreme  chaos  in  a  two  level  system?  Quantum  chaos  for  legitimacy  in  using  the  epithet 
chaos  has  always  relied  on  the  classical  limit,  and  in  the  extreme  quantum  limit  the 
concept  becomes  nebulous. 

The  above  approximation  is  derived  from  the  Gutzwiller  formula,  using  the  [K]  -f  1 
fixed  point  actions  and  the  stability  matrix.  The  fixed  point  at  the  origin  is  included 
wrongly  in  the  formula  and  we  have  discussed  it  separately  in  the  case  0  <  K  <  I  above. 
Now  we  use  an  estimate  of  Wilton,  quoted  in  [13],  to  bound  the  error  in  this 
approximation.  This  estimate  is  the  inequality 


|Tr(t/)-Tr(t/)s 


VN 


(19) 


where  a  and  b  are  constants  of  order  one  whose  values  are  slightly  different  from  the 
quoted  ones,  as  the  sawtooth  sum  is  from  0  to  N  —  1,  rather  than  1  to  N,  but  this  is  a 
trivial  complication.  The  essential  conclusion  is  that  the  error  is  bounded  in  N,  and 
corroborates  our  conclusion  that  there  is  no  log(A^)  problems  in  the  sawtooth  map 
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Figure  2.    The  error  as  a  function  of  the  chaos  parameter  K  for  the  sawtooth  map 
with  N  =  4000. 
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despite  the  presence  of  the  fixed  point  on  the  singularity.  Also  we  once  again  notice  that 
a  larger  K  (larger  chaos)  implies  a  better  semiclassical  accuracy.  In  figure  2  we  show 
the  results  of  a  numerical  computation  of  the  error,  which  shows  that  the  error  goes 
roughly  as  K~ll2,  with  the  slope  depending  weakly  on  N  and  K  and  rather  strongly  on 
the  fractional  part  of  K.  Notice  that  at  integer  K  (cat  maps)  the  approximation  is  exact. 
We  thus  note  that  like  its  relative  on  the  cylinder  the  torus  map  error  is  also  of  the  order 
of  AT1/2. 

We  had  earlier  studied  the  sawtooth  map  on  the  torus  with  slightly  different  boundary 
conditions  [10],  and  came  to  essentially  similar  conclusions,  however  in  that  case  there 
was  no  fixed  point  on  the  discontinuity.  When  the  fixed  points  moved  towards  the 
singularity,  at  the  exact  point  of  contact,  the  map  turned  into  the  continuous  cat  map  when 
the  trace  formula  becomes  exact.  We  now  turn  to  a  smooth  map  on  the  two  torus,  the 
standard  map.  Much  studied  in  other  contexts,  it  is  largely  unstudied  from  the  view  of 
trace  formulas,  although  it  should  form  a  natural  object  of  study,  as  it  represents  generic 
mixed  systems,  and  one  recognizes  the  difficulties  of  such  systems  apropos  trace 
formulas. 

We  will   consider  the  map  once   again  on   the  unit  torus.   The   map   is   now 
n+i  =  (pn  +  k  sm(2Trqn+i  ))modl).  The  Lagrangian  (action)  is 


L(qn,  qn+\  ,  /„,  mn)  =  (qn+i  -qn+ln)2/2-k  cos(27rgn+i  )/2?r  -  mnqn+\  .       (20) 


The  fixed  points  are  easy  to  find  and  are  at  (qm  =  (l/2?r)  arcsin(ra/fc),/7m  =  .0),  where,  m 
is  an  integer  such  that  \m\  <  k.  Once  again  the  location  and  number  of  the  fixed  points  are 
very  much  dependent  on  the  the  chaos  parameter  k  unlike  the  standard  map  on  the 
cylinder.  Note  that  there  are  two  fixed  points  associated  with  each  m. 
The  quantum  propagator  is  again  given  by  (N  even  throughout) 

e-nr/4 

Unm  =—j==-exp(-ikNcos(2irn/N)}  exp(z'7r(«  -  m)  /N}.  (21) 

and  the  trace  is  the  finite  sum 

Tr([/)  =— =-y%xp(-ifcWcos(27r«/AO).  (22) 


The  semiclassical  approximation  must  approximate  this  sum  using  the  classical  fixed 
points,  their  actions  and  their  stabilities.  First  note  that  the  quantum  trace  apart  from  the 
factor  e~1"^  is  real.  The  sawtooth  map  traces  were  Gaussian  sums,  and  partial  sums  of 
these  give  rise  to  the  "curlicues",  that  have  been  studied  in  detail  earlier  and  find 
applications  from  optics  to  quantum  mechanics  [13].  The  partial  sums  associated  with  the 
quantum  trace  of  the  standard  map  ought  to  be  interesting  mathematical  objects  in  their 
own  right. 

Here  we  present  a  preview  of  the  intriguing  class  of  periodic  "curves"  that  this  gives 
rise  to.  Consider  the  sum 
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Figure  3.     A  panel  of  the  partial  sums  crioooCK",  0  (23)  for  various  K. 


The  quantum  standard  map  trace  is  essentially  ON(k,N}.  We  plot  in  figure  3  the  real, 
*(/),  and  imaginary,  y(/),  parts  of  crN(kJ]  as  an  argand  diagram.  Strictly  there  are  only 
as  many  points  as  terms  in  the  partial  sum,  and  as  such  do  not  define  a  curve,  but 
the  connections  showing  the  motion  of  the  points  in  "time"  /  is  suggestive.  Only  the 
sum  up  to  I  —  N/2  is  relevant  as  beyond  this  a  reflection  and  periodic  translation  is 
the  whole  curve.  Hence  unlike  curlicues,  these  are  periodic  curves.  A  plethora  of 
cornu  spirals  interweave  to  conjure  the  extravagantly  ornate  features.  Several  "eyes"  of 
cornu  like  spirals  are  connected  in  a  remarkable  manner.  An  essential  feature  of 
these  curves  is  their  extreme  susceptibility  to  small  changes  in  k,  especially  at  large  N 
values.  We  cannot  rule  out  the  possibility  of  number  theoretic  features  here  as  well, 
and  indeed  we  observe  marked  difference  between  the  cases  when  k  is  an  integer  and 
when  it  is  not. 

The  periodic  nature  of  the  curves  are  easily  understood,  as  y(l  +  N/2)  =  —  y(7) 
and  x(l  +  N/2)  =  x(l)  +  A,  where  A  =  ^n=0)Ar/,2_1cos(-i'A^Vcos(27rn/A'")).  Hence 
the  period  of  the  curves  are  2A.  As  N  is  increased  for  fixed  k  the  curves  tend 
to  smoothen  out  more  but  with  the  cornu  spiral  structures  intact.  Also  note  that  the 
quadratic  approximation  of  the  cosine  term  in  the  exponential  are  the  culicues  studied 
earlier  [13]. 
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more  to  the  central  theme,  namely  the  accuracy  of  semiclassical  approximations.  Define 
9N(t)  =  N(Vk2  -  I1  +  larcsm(l/k}).  (24) 

Then  a  careful  application  of  the  trace  formula  yields  the  semiclassical  approximation 


2sin(Aflk 


[*] 

:+£ 

/=! 


4sin(^(Q+7r/4) 


(25) 


which  is  real  except  for  the  factor  e"1'*/4,  just  as  the  exact  quantum  trace  in  (22).  Here  the 
fixed  point  at  the  origin  has  been  treated  separately  as  corresponding  to  each  |m|  there  are 
four  fixed  points  except  when  m  =  0,  which  gives  rise  to  two  fixed  points.  The  range  of 
the  inverse  sine  function  is  [0,7T/2j.  Note  that  this  approximation  can  be  derived 
independent  of  the  trace  formula,  by  first  converting  the  sum  into  an  infinite  sum  of 
integrals  via  the  Poisson  summation  formula,  and  of  these  integrals  retaining  only  those 
which  contribute  to  the  stationary  phase,  and  evaluating  these  integrals  in  the  quadratic 
approximation.  Indeed  the  trace  formula  in  the  context  of  maps  on  the  torus  is  derivable 
by  this  procedure  and  is  hence  considerably  simpler  than  the  derivation  for  general 
Hamiltonian  flows  (see  Appendix). 

First  we  note  a  drawback  of  the  semiclassical  approximation.  Near  parameter  values 
where  bifurcations  are   going  to   create  new   periodic   points   the   semiclassical 
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Figure  4a.    (Continued) 
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Figure  4.    The  error  for  the  standard  map  on  the  torus  as  a  function  of  (a)  k,  with 
N  =  1000,  (b)  inverse  Planck's  constant  N,  with  k  =  100.5. 


approximation  becomes  poor.  Uniform  approximations  may  be  made  to  smooth  over 
these  parameter  values.  For  instance  in  the  case  of  the  standard  map,  whenever  k  passes 
an  integer  value  it  creates  four  new  fixed  points  via  tangent  bifurcations,  and  the 
approximation  to  the  trace  becomes  poor.  This  is  illustrated  in  figure  4(a).  However 
notice  that  the  approximation  is  indeed  quite  good  even  just  away  from  these  values.  We 
will  below  retain  the  approximation  made  above  and  keep  away  from  points  of 
bifurcations  to  study  various  aspects  of  the  error. 

In  figure  4(b)  we  show  the  absolute  value  of  the  error  e  =  |Tr(£7)  -  Tr(J7)JC|  for 
a  fixed  value  of  k=  100.5,  and  a  range  of  N.  It  is  clear  that  the  errors  involved 
are  not  monotonically  decreasing  with  N,  but  that  there  are  fluctuations  in  the  mean. 
The  mean  however  very  nearly  follows  N~l  behaviour.  This  should  be  contrasted 
with  the  more  cleaner  but  slower  N~1/2  behaviour  of  the  sawtooth  map.  In  general 
the  errors  incurred  for  the  smooth  standard  map  is  smaller  than  those  of  the  sawtooth 
map.  We  see  that  this  is  a  property  that  is  carried  over  from  the  maps  on  the  cylinder. 
It  is  unclear  to  the  author  whether  the  standard  map  fairs  better  due  to  its  smoothness 
or  due  to  some  other  property.  We  note  that  the  trace  formula  approximation 
behaves  better  with  increasing  chaos  in  the  case  of  the  sawtooth  map  than  the  standard 
map. 

At  a  fixed  value  of  N  the  behaviour  of  the  trace  for  increasing  k  is  quite  complex. 
No  simple  rule  like,  higher  chaos  implies  better  semiclassical  accuracy  emerges.  First 
we  note  that  the  ratios  or  the  relative  error  fair  quite  poorly,  with  large  abrupt  deviations. 
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Figure  5.  (a)  The  relative  error  for  the  standard  map,  as  a  function  of  the  chaos 
parameter  k,  N  =  1000,  (b)  the  absolute  error  as  a  function  of  k  in  the  extreme 
quantum  limit,  N  =  2. 

Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  II) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


o 
o 
o 


0.002     - 


0.001     - 


0  - 


8000 


Figure  6.    The  error  as  function  of  k  in  a  wide  range  of  k,  for  the  standard  map  with 
N  =  1000. 


Figure  5(a)  shows  the  relative  errors  |e|/|Tr(C/)JC|,  but  the  range  in  the  y  axis  has 
been  lessened  to  highlight  the  predominant  features.  The  error  could  become  as  high 
as  40  in  the  range  of  k  shown.  This  is  however  easy  to  understand  in  the  context 
of  (7)  for  the  map  on  the  cylinder,  as  due  to  the  mismatch  of  the  zeros  of  the  semi- 
classical  and  the  quantum  traces.  We  expect  the  absolute  value  of  the  errors  to  fair  better. 
Indeed  when  N  is  small,  one  observes  an  increase  of  accuracy  with  k,  and  this  is  a  power 
law  for  small  N.  Thus  we  show  in  figure  5(b)  the  logarithm  of  the  errors  with  the 
logarithm  of  k,  and  we  observe  the  rule  that  e  =  Cfc~7,  with  7  ~  0.25,  for  the  extreme 
quantum  case  N  =  2.  Here  the  k  go  in  steps  of  unity  with  their  fractional  part  begin 
uniformly  0.5. 

However  as  N  is  increased  we  observe  that  this  is  no  longer  true,  and  that  the  errors 
tend  to  oscillate  with  k  in  a  fairly  complex  manner.  The  overall  tendency  seems  to  be  in 
fact  a  deterioration  of  the  accuracy  with  k.  Figure  6  shows  the  errors  as  a  function  of  k  in 
a  large  range.  The  interesting  aspect  is  that  the  oscillation  period  seems  to  increase 
geometrically.  Therefore  there  is  a  possibility  that  the  errors  scale  with  &,  but  in  a 
complex  manner.  However  increasing  k  even  higher  seems  to  render  the  errors  an  almost 
monotonic  growth,  and  we  cannot  comment  on  the  many  details  until  further  analytic 
estimates  are  made  and  computer  generated  errors  are  under  control. 

An  important  difference  between  the  semiclassics  of  the  sawtooth  map  and  the 
standard  map  is  that  because  of  the  piecewise  linear  nature  of  the  sawtooth  map  there  was 
no  need  to  truncate  Taylor  expansions  about  the  periodic  orbits,  while  in  the  case  of  the 
standard  map  this  truncation  is  done  up  to  second  order.  Higher  order  corrections  can  be 
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worked  out  and  have  formed  the  subject  of  recent  works  [15,  16],  it  should  be  interesting 
to  compare  the  errors  when  such  higher  order  terms  are  included. 

4.  Summary 

Using  maps  as  models  of  quantum  chaos  we  have  studied  the  simplest  traces  from  the 
point  of  view  of  semiclassical  approximations,  hoping  that  this  will  reflect  some  more 
general  aspects  of  errors  involved  in  the  Gutzwiller's  periodic  orbit  sum  or  trace  formula. 
We  found  that  indeed  the  maps  on  the  cylinder  had  controllable  relative  errors.  We  found 
that  the  smooth  standard  map  faired  better  than  the  sawtooth  map  in  the  order  of 
magnitude  of  the  errors.  We  also  noted  that  the  mismatch  of  zeros  between  the 
semiclassical  and  the  quantum  can  lead  to  deterioration  of  relative  errors.  In  the  case  of 
toral  maps  we  displayed  a  rather  remarkable  fact  that  even  in  the  presence  of  fixed  points 
on  discontinuities  the  trace  formula  may  be  exact  at  the  level  of  the  time  one  trace. 
Otherwise  we  showed  that  more  generally  there  is  no  log  (ft)  divergences  in  the  sawtooth 
map,  unlike  the  case  of  the  quantum  bakers  map.  We  also  noted  that  the  error  may 
actually  decrease  with  increasing  chaos,  a  rather  counter  intuitive  fact,  which  has 
quantitative  evidence  in  the  case  of  the  sawtooth  map.  However  we  caution  that  this 
cannot  be  a  general  principle  as  a  study  of  the  standard  map  has  shown  a  much  more 
complex  behaviour  of  the  error.  It  is  indeed  a  matter  of  importance  to  study  and  contrast 
the  two  maps,  and  understand  why  the  errors  behave  qualitatively  differently. 

We  have  restricted  our  study  to  the  time  one  trace  and  indeed  it  is  natural  and  necessary 
that  we  study  the  higher  traces.  For  instance  to  study  the  errors  involved  in  the 
semiclassical  evaluation  of  Tr(t/2)  leads  us  into  the  problem  of  enumerating  period  two 
orbits,  which  can  be  fairly  easily  computed  in  the  cases  studied,  but  which  are  much 
larger  in  number  than  the  fixed  points.  Preliminary  results  in  this  direction  indicate  that 
for  the  sawtooth  map  the  error  behaviour  of  the  time  two  trace  is  similar  to  that  of  the 
trace,  although  the  errors  are  larger.  We  have  also  displayed  some  remarkable  curves  akin 
to  curlicues  that  arise  out  of  a  study  of  the  trace  of  the  standard  map  on  the  torus. 

Appendix 

Consider  the  following  map  defined  on  the  unit  torus: 

qj+pj-ij 

(Al) 
Pj  -  v'(<lj+i}  ~  mj> 

where  V(q)  is  a  periodic  "kicking"  potential  with  unit  periodicity,  the  prime  indicates 
derivative,  and  the  integers  m/  and  //  are  such  that  the  map  is  restricted  to  the  unit  torus 
(winding  numbers).  They  are  the  "modulo  one"  operations.  As  is  well  known  the  above 
map  can  be  derived  from  a  time  dependent  Hamiltonian  system.  Also  there  exists  a 
generating  function  for  the  map  on  the  torus  given  by 


(A2) 
from  which  the  map  can  be  derived  as  PJ  =  -dS/dqj  andpj+1  =  dS/dqj+i. 
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Accuracy  of  trace  formulas 

The  quantum  map  corresponding  to  the  (Al)  is  given  by  the  finite  dimensional  unitary 
matrix 

_-«r/4  /  :„  {  n\\ 

{n|f7|n')=-^-exp[-(W-n')2-27r.wUJJ.  (A3) 

This  takes  on  the  function  of  the  propagator  as  it  connects  states  after  consecutive  kicks. 
Here  the  representation  is  in  the  position  basis  and  n  and  n'  may  take  the  integer  values 
from  0  to  N  —  1.  A''  is  itself  the  dimensionality  of  the  Hilbert  space  and  is  also  the  inverse 
Planck  constant,  and  is  restricted  to  be  even  integers  for  preservation  of  toral  boundary 
conditions.  Semiclassics  means  the  study  of  the  unitary  operator  as  N  —  »  oo. 

The  object  of  primary  interest  below  is  the  trace  of  the  powers  of  the  propagator  from 
which  the  spectrum  may  be  derived  by  a  Fourier  transform.  This  can  be  written  as 


with 

=  n\. 


It  is  implied  in  the  above  that  the  outer  sum  is  over  the  T  variables  «.,-,  z  =  1,  2,  .  .  .  ,  T. 
Using  the  Poisson  summation  formula  we  can  rewrite  the  above  as 


,*,-  +  xf  -  Xjxj+l  -  V(Xj]       , 


(A5) 
V         ;=i   \  // 

with 

=  x\     and    fa+i  =  k\  . 


TTie  stationary  phase  approximation,  assuming  that  N  is  large,  gives  the  following  T 
conditions 

Xj+i  =2xj-Xj-i-V'(xj)+kj;     j  =  1,  .  .  .  ,  T,     *0  =  XT,     xi  =  XT+I,       (A6) 
with  the  further  condition  that 
0<^-<  1. 

These  are  precisely  the  equations  that  determine  the  period  T  orbits  of  the  map.  The 
integers  kj  are  related  to  the  integers  lj  and  m/  of  (Al)  by  the  relation 
kj  =  lj-.]  -  lj  -  m/_i.  With  the  assumption  that  they  uniquely  determine  a  periodic  orbit 
of  period  an  integer  fraction  of  T,  we  restrict  the  infinite  sum  to  only  those  that  relate  to 
such  a  periodic  orbit  labeled  below  by  (3.  This  is  the  first  approximation  and  the  second 
one  is  that  we  will  extend  the  ranges  of  integration  to  the  whole  real  line.  These 
approximations  together  are  exact  in  the  case  of  the  cat  maps  [9,  11,  12],  and  the  third 


retaining  up  to  the  quadratic  terms.  This  approximation  is  unnecessary  for  the  piecewise 
linear  sawtooth  maps  [10],  but  is  necessary  in  general. 

Therefore  we  define  new  variables  y_/  such  that  Xj  =  qj  +  yj  and  qj  is  the  central 
periodic  orbit. 


The  sum 


V"(qj}/2}.     (A7) 


(A8) 


can  be  identified  with  the  action  of  the  periodic  orbit,  as  from  (A2)  one  gets 


/  -  qqj+i  ~  V(qj)  +  qjkj  +  lJ/2), 


(A9) 


which  is  essentially  the  sum  in  (A8).  We  can  neglect  the  term  lf/2,  from  the  Lagrangian; 
this  is  legitimate  as  the  action  occurs  as  e2l"ws,  and  since  N  is  an  even  integer  these  terms 
do  not  matter.  Note  that  the  trace  formula  at  T  =  1  is  easily  obtained  from  the  above  and 
has  been  used  in  the  main  body  of  this  paper. 
The  integral  is  evaluated  by  standard  methods. 


(A10) 


Here  y  denotes  the  T  component  vector  {yj}  and  /  is  its  transpose.  The  real  symmetric 
matrix  A/j  is  given  by 


Afl  = 


/2-V'te,)  -1  0      .. 

-1  2-V"(q2]     -1     0 


\     -] 


-1     2~V"(qT)J 


(AH) 


f/j  is  a  "Maslov  like"  index,  and  is  the  number  of  negative  eigenvalues  of  the  matrix  A/j. 

The  nontrivial  problem  of  evaluating  the  determinant  is  already  solved  and  the  results 
are  weD  known  [14].  The  determinant  can  be  related  to  the  stability  of  the  periodic  orbit. 
If  the  residue  of  the  periodic  orbit,  which  is  simply  related  to  the  trace  of  the  stability 
matrix,  is  Rp,  then  Rp  =  -|det(A0).  or  if  we  denote  the  stability  matrix  eigenvalues  by 
A?  and  A£,  det(A0)  =  A^  +  A^  -  2. 

Thus  finally  we  can  write  the  asymptotic  periodic  orbit  sum,  or  the  trace  formula,  for 
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the  toral  maps  as 


The  sum  is  over  periodic  orbits  whose  periods  are  integer  fractions  of  T  are  labeled  by  /3. 
This  is  of  the  canonical  Gutzwiller-Tabor  form  having  in  the  exponent  the  action  of  the 
period  T  orbits,  and  the  prefactor  explicitly  depending  on  the  stability  of  these  orbits. 
The  piecewise  linear  maps  such  as  the  baker's  map,  or  the  sawtooth  map  are  such  that  the 
Maslov  like  phases  are  zero  and  the  only  type  of  periodic  orbits  are  those  of  the  direct 
hyperbolic  kind.  Now  however  we  can  study  the  vast  class  of  mixed  systems,  such  as  the 
well  known  standard  map.  It  is  then  of  interest  to  ask  questions  such  as  how  long  the 
semiclassical  approximations  are  valid  in  such  systems  which  one  knows  are  generic.  The 
quantum  mechanics  on  the  torus  is  exact  and  thus  we  expect  that  the  derivation  above  will 
facilitate  the  investigation  of  the  periodic  orbit  sum  for  mixed  systems.  In  this  note  we 
have  used  the  above  formula  only  in  the  special  case  of  T  =  1 . 
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Abstract  We,  offer  an  alternative  interpretation  of  the  Riemann  zeta  function  £(s)  as  a  scattering 
amplitude  and  its  nontrivial  zeros  as  the  resonances  in  the  scattering  amplitude.  We  also  look  at 
several  different  facets  of  the  phase  of  the  £  function.  For  example,  we  show  that  the  smooth  part  of 
the  C  function  along  the  line  of  the  zeros  is  related  to  the  quantum  density  of  states  of  an  inverted 
oscillator.  On  the  other  hand,  for  3fo  >  1/2,  we  show  that  the  memory  of  the  zeros  fades  only 
gradually  through  a  Lorentzian  smoothing  of  the  delta  functions.  The  corresponding  trace  formula 
for  3fa  »  1  is  shown  to  be  of  the  same  form  as  generated  by  a  one-dimensional  harmonic  oscillator 
in  one  direction  along  with  an  inverted  oscillator  in  the  transverse  direction.  Quite  remarkably  for 
this  simple  model,  the  Gutzwiller  trace  formula  can  be  obtained  analytically  and  is  found  to  agree 
with  the  quantum  result. 
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1.  Introduction 

The  Riemann  zeta  function  £(j)  of  the  complex  variable  s  =  a  +  it  is  defined  for  a  >  I 
by  [1] 

CD 


n=l  peprimes  ^          r      1 

and  for  a  <  1  by  analytic  continuation.  As  is  well  known,  £(s)  has  so  called  trivial 
zeros  at  s  =  —In  with  n  =  1,2, 3 —  All  other  zeros  are  at  complex  values  of  s. 
According  to  Riemann's  celebrated  hypothesis  made  in  1856,  the  nontrivial  zeros  of  £ 
all  lie  symmetrically  on  the  line  cr  =  1/2,  i.e.  (Q±itn)  =  0.  Riemann's  hypothesis 
is  supported  by  numerical  tests  up  to  very  large  values  of  tn  but  mathematicians  are 
still  unable  to  prove  or  disprove  it  rigorously.  It  is  also  well  known  that  there  are 
an  infinite  number  of  zeros  on  the  half-line  a=  1/2  [2].  These  so-called  nontrivial 
zeros  are  of  great  interest  to  mathematicians  from  a  number-theoretic  point  of  view 
and  to  physicists  interested  in  quantum  chaos  and  periodic  orbit  theory  [3].  In  parti- 
cular, the  so-called  nontrivial  zeros  of  C,(s)  exhibit  an  intrinsically  random  distribution 
of  the  GUE  type.  Further,  assuming  that  the  famous  Riemann  hypothesis  is  correct, 
the  density  of  zeros  can  be  shown  to  obey  a  sum  rule  which  is  analogous  to  the 
famous  Gutzwiller  formula  for  the  level  density.  All  this  is  taken  to  suggest  that  there 
may  be  a  classical  chaotic  dynamical  system  without  time-reversal  invariance  and  tn 
are  the  eigenvalues  of  the  quantum  Hamiltonian  which  is  obtained  by  quantizing  this 


found. 

We  have  offered  an  alternative  interpretation  of  the  £  function  as  a  scattering  amplitude 
and  its  nontrivial  zeros  as  resonances  in  the  scattering  amplitude  [7,8].  As  a  bonus,  this 
interpretation  directly  leads  us  to  an  approximate  rule  for  the  location  of  the  zeros  with  an 
error  which  is  at  most  3  per  cent. 

We  have  also  looked  in  detail  at  several  different  facets  of  the  phase  of  the  Riemann  ( 
function.  For  example,  we  show  that  the  smooth  part  of  the  £  function  along  the  line  of 
the  zeros  (i.e.  a  =  1/2)  is  related  to  the  quantum  density  of  states  of  an  inverted 
oscillator.  On  the  other  hand,  for  a  >  1/2,  we  show  that  the  memory  of  the  zeros  fades 
only  gradually  through  a  Lorentzian  smoothing  of  the  delta  functions.  The  corresponding 
trace  formula  for  a  »  1  is  shown  to  be  of  the  same  form  as  generated  by  a  one- 
dimensional  harmonic  oscillator  in  one  direction  along  with  an  inverted  oscillator  in  the 
transverse  direction.  Quite  remarkably,  for  this  simple  model,  the  Gutzwiller  trace 
formula  can  be  obtained  analytically  and  is  found  to  agree  with  the  quantum  result.  As  far 
as  I  am  aware  this  is  the  first  instance  of  a  scattering  system,  for  which  the  Gutzwiller 
formula  is  exact.  Before  I  come  to  these  issues,  I  briefly  review  some  properties  of  the  £ 
function. 


2.  Some  Properties  of  the  Riemann  Zeta  function 

The  Riemann  zeta  function  as  defined  hi  Eq.  (1)  can  also  be  written  as 

p(-/3En)  (2) 


n=0 


where  /?  =  ^,  s  =  £{3  and  En  =  £ln(n  +  1),  with  the  constant  £  setting  the  energy  scale. 
In  this  form  it  is  clearly  the  canonical  partition  function  of  a  quantum  system  with  an 
energy  spectrum  En  as  given  above.  Without  any  loss  of  generality,  we  set  £  =  1 
throughout  this  article. 

Actually,  £  can  also  be  looked  upon  as  a  grand  partition  function;  since  any  integer  n  is 
a  unique  product  of  primes  pt  (ln(n)  )  =  £\  at  ln(p,-)  where  on  are  a  set  of  positive  integers 
or  zero).  Hence  the  zeta  function  may  also  be  written  as 

0) 


Written  in  this  form,  it  can  clearly  be  regarded  as  a  bosonic  grand  partition  function  with 
chemical  potential  \i  —  0.  In  this  article  however  we  shall  make  use  of  the  canonical 
partition  function  interpretation  of  the  £. 

The  quantum  density  of  states  of  the  system  as  given  by  (2)  with  the  energy  spectrum 
En  =  ln(n  +  1)  is  defined  as 


(4) 

n=0  n=0 

Following  Jennings  [9],  let  us  derive  an  expression  for  the  smooth  and  oscillating  parts  of 
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Riemann  zeta  function 
p(E]  for  a  nondegenerate  spectrum.  Using  the  fact  that 

oo  oo 

(x  -  n)  =  1  +  2]T  cos(27rmjc)  (5) 


n=0  m=l 

we  obtain 


m=l 


—     H-2Vcos(27rmG(E))  (6) 

dE  ' 


Here  G(£)  satisfies  the  inverse  relation  n  =  G(E).  For  the  spectrum,  En  —  ln(n  4- 1),  we 
have  G(E]  =  (&E  —  1)  and  hence  the  density  of  states  can  be  written  as 


cos(27rme£) 


m=l 


(7) 


which  is  exponentially  rising!  For  an  exponentially  rising  density  of  states  like  the  one 
here,  the  entropy  S(E)  =  E,  and  the  limiting  inverse  temperature(which  is  the  analogue  of 
the  Hagedorn  temperature  [10]  in  high  energy  physics)  is  given  by  (3C  =  dS/dE  =  1.  This 
is  not  surprising  in  view  of  the  composite  nature  of  the  system  and  the  fact  that  the 
number  of  elementary  constituents  (primes)  tend  to  increase  with  increasing  E 
(corresponding  to  large  n).  A  similar  situation  is  encountered  in  high  energy  physics 
[10]  where  the  hadrons  may  be  considered  to  be  the  composite  of  more  elementary 
constituents  whose  number  also  increases  with  energy. 

Let  us  now  mention  some  standard  results  for  the  Riemann  zeta  Function.  The  most 
important  is  the  functional  equation  satisfied  by  it  [1] 

C(s)  =  2~57r*"1  sin(s7r/2)r(l  -  j)C(l  -  0  (8) 


I  would  like  to  point  out  the  analogy  [11]  between  this  functional  equation  and  the 
Kramers-Wannier  duality  of  the  canonical  partition  function  per  site  for  the  "infinite" 
planar  square  Ising  model  [12]  as  given  by 

Z(Ks)=Z(K's)/stih(2K's)  (9) 

with 

sinh(2J&)  sinh(2K/j)  =  1  (10) 

It  is  worth  emphasizing  that  the  Kramers-Wannier  duality  is  rather  special  and  the 
fixed  point  of  the  duality  equation  turns  out  to  be  the  critical  temperature.  Also  note 
that  both  C  and  Z  are  real  for  real  s  and  their  sets  of  zeros  are  invariant  under  both 
duality  and  complex  conjugation.  I  would  also  like  to  emphasize  the  striking 
similarity  between  the  result  of  Fisher  [13]  on  the  location  of  all  the  zeros  of  Z  on 
the  circle  |  sinh(2JCs)  =  1,  with  the  Riemann  hypothesis  that  all  nontrivial  zeros  are 
on  the  fixed  line  a  =  \.  In  last  two  years,  duality  has  played  a  very  profound  role 
in  supersymmetric  gauge  field  theories  as  well  as  string  theories  [14]  and  it 
would  be  worthwhile  to  enquire  if  some  of  those  ideas  could  also  be  useful  in  the  case 
of  C  function. 


Avinash  Khare 

Using  the  fundamental  functional  relationship  between  £(s)  and  £(1  -  s)  as  given  by 
Eq.  (8),  it  is  easy  to  show  that 


where  F(z)  denotes  the  gamma  function  of  the  argument  z.  We  may  further  write 


where  Z(t)  is  real,  and  0(i)  is  the  phase  angle,  with  the  convention  that  0(0)  =  TT.  Using 
Eqs.  (1)  and  (2),  it  follows  that 

V  (1-4-  it/  2\ 

*_  .        .  (13) 

•L  (4     «/A/ 

The  phase  0,  as  defined  above,  is  smooth  in  the  sense  that  it  does  not  include  the  jumps  by 
TT  due  to  the  zeros  of  Z(t).  Nevertheless,  the  number  of  zeros  between  0  and  t  on  the 
cr  =  1/2  line  is  counted  fairly  accurately  by  0(t),  as  will  become  clear  from  the  Argand 
diagram.  Note  that 


2 
which  satisfies  the  condition  that  0(0)  =.TT.  The  density  of  zeros  is  given  by 

id0     i  r         <JTA    . 


,10 

,  (15) 

27r[  L    V4      2 

where  the  digamma  function  is  defined  as  \I>(z)  =  F'(z)/F(z).  From  the  above,  the 
asymptotic  expression  for  6(t)  may  be  obtained  immediately  by  making  asymptotic 
expansion  of  the  F  functions.  We  denote  this  by  Q(t),  and  it  is  given  by 


8 


3.  Argand  diagram  and  analogy  with  the  scattering  amplitude 

To  bring  out  some  characteristics  of  the  function  £(1/2  +  it],  we  plot  its  Argand  diagram 
in  figure  l(a)  in  the  range  t  =  9  to  t  =  50.  This  shows  a  sequence  of  closed  loops,  one  for 
every  zero  of  the  zeta  function.  At  a  zero  of  £(1/2  +  it),  both  its  real  and  imaginary  parts 
are  zero  at  the  same  value  of  t,  and  therefore  every  loop  converges  at  the  origin.  The 
intercepts  on  the  real  axis  are  the  so-called  "Gram  points"  where  only  imaginary  part  of 
C(s)  is  zero  due  to  the  phase  angle  0(r)  =  n?r.  With  infrequent  exceptions,  there  is  one 
Gram  point  between  two  consecutive  zeros  of  the  £  function.  The  first  two  exceptions  to 
this  rule  occur  for  the  126th  and  the  134th  zeros  at  t  =  282.455  and  295.584  respectively 
[1].  In  figures  l(b)  and  l(c),  Argand  diagrams  are  drawn  away  from  the  1/2-axis,  for 
or  =  0.6  and  cr  =  1  respectively.  These  clearly  show  the  defocussing  at  the  origin  due  to 
the  absence  of  the  zeros  in  the  Zeta  function.  Moreover,  the  number  of  intercepts  along 
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Figure  1.  The  Argand  diagrams  for  £(er  +  it]  for  increasing  ?  and  fixed  a.  (a) 
C(l/2  +  it)  in  the  range  t  =  9  —  50.  The  lower  limit  for  f  is  chosen  so  as  not  to  miss 
the  Gram  point  flanking  the  lowest  zero  at  t  =  14.13.  (b)  ((0.6  +  if)  for  t  =  9  -  50  to 
show  the  defocusing  at  the  origin,  (c)  £(1  +  it)  for  t  =  9  —  50.  Note  the  pronounced 
shift  of  the  diagram  away  from  the  origin  in  this  case. 


the  real-axis  in  the  Argand  diagrams  now  show  a  large  increase  compared  to  the  a  =  1/2 
case,  whereas  the  intercepts  on  the  imaginary  axis  are  few  or  nonexistant.  This  is  a 
reflection  of  the  change  in  the  behaviour  of  the  phase  9(t)  away  from  the  a  =  1/2  line.  In 
figure  2(a),  the  phase  angle  0(i),  as  determined  by  eq.  (14),  is  plotted  as  a  function  of  t  on 
the  1/2-axis.  This  phase  angle  is  a  smooth  function  of  t  because  the  jumps  by  TT  at  every 
zero  (due  to  the  change  in  the  sign  of  the  ^-function)  is  not  registered  by  it.  These 
discontinuities  are  shown  separately  in  figure  2(b).  The  smooth  phase  keeps  increasing 
monotonically  with  /,  since  the  curve  in  the  complex  plane  passes  through  the  origin  at 
every  zero  . 

Finally,  in  figures  3(a)  and  3(b),  the  Argand  diagrams  of  the  Zeta  function  are  drawn 
for  a  much  larger  range  of  f,  from  1  to  500,  on  and  off  the  1/2-axis.  Note  that  the  scale  for 
a  =  1  is  expanded  compared  to  that  for  a  =  1/2.  Borrowing  from  the  terminology  of  the 
motion  of  a  particle  in  phase  space,  it  is  as  if  there  is  an  "attractor"  at  the  origin  for 
0-=  1/2  (figure  3(a)),  which  is  absent  from  the  more  disorderly  tracks  of  figure  3(b), 
which  is  drawn  along  the  a  =  1  line.  The  latter  figure  also  shpws  that  the  real  part  of  the  ( 
function  is  always  positive  for  a  =  1  for  this  entire  range  of  t. 
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Figure  2.  (a)  The  smooth  phase  angle  of  C(l /2  +  it),  as  defined  by  Eq.  14,  is  plotted 
as  a  function  of  t  =  0  -  50.  (b)  The  discontinuities  of  TT  in  the  phase  angle  d(t)  at 
or  =  0.5  at  the  position  of  the  zeros  are  shown.  The  total  phase  is  given  by  the 
superposition  of  (b)  on  (a). 

The  loop  structure  of  the  £  function  at  cr=  1/2,  with  some  near-circular  shapes,  is 
reminiscent  of  the  Argand  plots  for  the  scattering  amplitudes  of  different  partial  waves  in 
the  analysis  of  resonances,  for  example  in  pion-nucleon  scattering  [15].  Consider  the 
partial  wave  amplitude  //(fc),  defined  in  terms  of  the  partial  wave  phase-shift  <$/(£)  and  the 
inelasticity  parameter  r/j(Jk), 

fi(k]  =  (Tfc«p(2W,)  -  l)/2ft.  (17) 

Here  /  refers  to  the  angular  momentum,  and  k  the  wave  number.  Note  that  3tfi(k)  is  never 
negative,  since  the  inelasticity  parameter  r#  is  always  less  than  one.  One  generally  plots 
an  Argand  diagram  with  2*9$  (*)  along  the  y-axis  and  2fc  &//(*)  along  the  x-axis  for 
various  values  of  k.  For  the  case  of  no  inelasticity  (T#  =  1)  and  a  single  resonance,  the 
Argand  diagram  is  a  perfect  circle  with  unit  radius,  with  the  center  on  the  imaginary  axis 
at  1.  By  comparing  this  with  figure  l(a)  at  a  =  1/2,  we  see  that  the  real  and  the 
imaginary  parts  are  interchanged  in  the  latter,  but  otherwise  there  is  a  strong  similarity, 
with  many  of  the  loops  having  inelasticity.  This  analogy  is  flawed,  however,  since 
&C(l/2  +  it]  does  become  negative  in  small  islands  of  t.  Nevertheless,  if  these  islands  are 
ignored,  then  the  phase  shift  Si  may  be  identified  with  the  phase  angle  0  +  n/2,  with  each 
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Figure  3.    The  Argand  diagrams  of  the  Riemann  zeta  function  for  the  wider  range  of 
t- values,  from  t  =  9-  500.  (a)  £(1/2  +  it),  and  (b)  C(l  +  it}. 


closed  loop  in  figure  l(a)  being  regarded  as  in  isolated  resonance.  In  this  approximation, 
the  Gram  points  occur  as  before  for  sin  0  =  0,  while  the  zeros  of  £(1/2  +  it)  are  given  by 
the  condition 


cos0  =  0,     0=(m+l/2)7r,     TO  =1,2,.... 


(18) 


This  condition  for  the  location  of  the  zeros  was  also  obtained  by  Berry  [5]  from  the  first 
term  in  his  approximate  formula.  Eq.  (18)  has  roots  that  yield  the  zeros  on  the  1/2-axis 
with  an  error  of  at  most  3  per  cent. 


4.  The  inverted  harmonic  oscillator 


On  the  a  =  1/2  line,  our  analogy  with  the  scattering  amplitude  suggests  that  the  phase 
angle  B(t]  is  related  to  a  scattering  phase  shift.  We  now  demonstrate  that  the  scattering  of 
a  nonrelativistic  particle  by  an  inverted  harmonic  oscillator  with  a  hard  wall  at  the  origin 
generates  a  phase  shift  that  is  closely  related  to  0(i).  Indeed,  we  show  that  the  quantum 
density  of  states  for  this  problem  is  essentially  the  same  as  Eq.  (14)  for  the  density  of  the 
zeros.  Consider  the  Schrodinger  equation  for  x  >  0, 

h2  d2 

2/wdjc^        2  I  •"•-*/ 

and  impose  the  boundary  condition  that  the  wave  function  $  vanishes  at  the  origin. 
Putting  x2  =  y,  $  =  y?0,  it  becomes 


—  0 _0         0  +  ^0  =  0<  pQj 

Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  II) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences"  543 


In  the  above  equation, 

1       ,      mw 


This  is  effectively  a  three  dimensional  Schrodinger  equation  for  a  repulsive  Coulomb 
potential  in  the  variable  y.  To  obtain  the  phase  shift,  we  write  the  asymptotic  solution  of 
the  above  equation  as 

#y)  -  sin  (ky  -  '-\*(2ky)  -~  +  'n\  (22) 

\  L  L  J 

where  rji  is  the  phase  shift  with  respect  to  the  distorted  Coulomb  wave,  given  by 
arg  r(/+  1  +  it/  2}.  For  our  one-dimensional  problem,  only  Z  =  -1/4  is  relevant.  For 
this  case,  omitting  the  subscript  /,  the  phase  shift  77  is 


(23) 

(|  -  zy)  =      .     ^.  .       .  (24) 

4  coshTry  +  ismhTry 

the  number  of  quantum  states  n(t)t  between  0  and  t,  is  then  given  by 


In  the  above  equation,  C  is  a  smooth  function  given  by 

C  =  ~  tan~!  (cosech  Tit)  .  (26) 

Note  from  above  that  the  expression  for  rj(t}  is  not  quite  identical  to  6(t)  as  defined  in 
Eq.  (14).  However,  their  derivatives,  the  quantum  density  of  states,  only  differ  by  a 
constant  and  an  exponentially  small  term.  It  should  also  be  pointed  out  that  even  if  we 
had  started  with  a  full  inverted  harmonic  oscillator  (rather  than  the  half-oscillator),  the 
same  conclusion  would  be  reached,  even  though  some  nonuniqueness  may  arise  in  the 
choice  of  the  boundary  condition.  The  inverted  harmonic  oscillator  problem  has  been 
studied  by  a  number  of  authors  in  the  past  in  relation  to  time-delay  [16]  and  string  theory 
[17].  No  connection,  however,  was  made  to  the  phase  of  C(l/2  +  it}- 


5.  The  phase  of  the  Zeta  function  for  a  >  \ 

We  define  the  phase  of  C(s)  along  the  imaginary  axis  t  for  a  fixed  a  to  be  6a(t),  given  by 

:0ff(0).  (27) 


When  a  =1/2,  we  shall  simply  drop  the  subscript,  and  denote  the  phase  by  0. 
Since  |£(.y)|  is  always  positive,  the  discontinuous  jumps  due  to  the  sign  changes  are 
included  in  the  phase  0.  This  is  different  from  the  definition  of  the  phase  given  in 
Eq.  (12),  where  B  denoted  the  smooth  part  only.  From  the  above  definition  (27),  it 
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follows  that 


It  is  the  derivative  of  the  phase  6  with  respect  to  t  that  contains  the  information  about  the 
density  of  the  zeros  on  the  line  s  =  1/2  +  it.  We  wish  to  examine  the  same  derivative  for 
cr  >  1/2.  To  this  end,  it  is  convenient  to  define  [1]  the  entire  function  £(s)  which  has  the 
same  zeros  on  the  complex  plane  as  C(s), 

«'>-  (29) 


Because  £(j)  is  an  entire  function,  it  may  be  expressed  as  [1] 


where  sn  are  the  zeros  of  £(s)  on  the  complex  plane.  Some  straight-forward  algebra  then 
yields 

"  (3!) 


In  the  above,  we  have  assumed  the  Riemann  hypothesis,  that  the  only  zeros  of  £(s}  are 
at  s  =  sn  =  1/2  +  itn.  Noting  the  representation  of  the  Dirac  delta  function 

*(f-fa)  =  lim(^ 


we  immediately  see  that  the  derivative  of  the  phase  changes  from  delta  function  spikes  to 
Lorentzian  as  we  move  away  from  the  a  =  1/2  line,  with  a  width  7  =  (a  —  1/2).  Let  us 
denote  the  density  of  the  zeros  of  £(1/2  +  it)  f  or  t  >  0  by 


-'»)•  (32) 

Then  the  Lorentz  smoothed  density  is  expressed  as 


The  complete  expression  for  the  phase  derivative  A0a/dt  can  now  be  obtained  by  using 
the  asymptotic  Stirling's  formula  for  the  large  arguments  of  the  T  function.  The  final 
expression,  after  some  algebra,  is  given  by 

WO  ~  ArWl  =  -^(0,  (34) 


TroV       ^"     TT  dr 
where 
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Figure  4.  The  derivative  of  the  phase  Oa  of  the  Lorentz-smoothed  Riemann  zeta 
function,  as  in  Eq.  (34),  is  obtained  from  the  trace  formula,  Eq.  (38).  It  is  plotted  as  a 
function  of  t  for  a  —  1.1  in  the  range  0  <  t  <  50  (upper  panel)  and  1000  <  t  <  1050 
(lower  panel).  The  dashed  line  is  calculated  by  using  the  first  ten  primes  and 
truncating  the  sum  over  the  repetitions  k  at  k^  =  10.  The  solid  line  includes  the  first 
100  prime  numbers,  and  km^  —  100.  The  exact  position  of  the  Riemann  zeros  on  the 
a  =  1  /2  axis  are  shown  by  open  circles. 

and^/v  is  given  by  Eq.  (33).  In  the  above  equation,  we  have  neglected  terms  of  order  t~2 
in  pff(i).  For  a  =  1/2,  Eq.  (34)  reduces  to 


(36) 


This  sharply  discontinuous  function,  according  to  Eqs.  (33)  and  (34),  gets  Lorentz- 
smoothed  by  a  width  (a-  1/2)  as  the  derivative  of  the  phase  is  computed  along  the 
imaginary  axis  at  a  >  1/2.  Nevertheless,  the  position  of  the  Riemann  zeros  are  still 
remembered,  with  the  memory  gradually  fading  with  increasing  cr,  as  shown  in  figure  4. 
We  next  consider  the  Euler  product  form  for  £(j)  with  a  view  to  construct  a  dynamical 
model  for  large  cr.  This  is  given  by  [1] 


(37) 
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where  the  product  is  over  all  the  primes  p.  For  cr  >  1,  the  above  expression  is  convergent. 
Following  [4],  one  may  then  obtain  the  convergent  trace  formula 

(38) 


(P) 

For  a  >  1,  only  the  p  =  2  term  dominates  in  the  above  trace-formula,  and  the 
contribution  of  the  higher  harmonics  (k  >  1)  may  be  neglected.  Then  Eq.  (38)  may  be 
written  as  [5] 


U\  Sim(7TUJff  /UJ\  ) 

where 

2?r 


We  shall  now  present  a  dynamical  toy  model  which  will  generate  a  semiclassical  trace 
formula,  which  for  its  lowest  harmonic  k  =  I  is  the  same  as  Eq.  (39).  Moreover,  in  this 
situation,  since  only  one  term  survives  in  the  Euler  product  formula  (37),  the  Argand 
diagram  for  £(s)  is  a  circle.  The  analogy  with  the  Argand  diagram  for  the  scattering 
amplitude  is  now  vivid,  with  one  elastic  resonance  only. 


6.  The  density  of  states  for  a  Toy-model 

6.1  The  Gutzwiller  trace  formula 

In  this  section,  we  first  derive  semiclassically  the  trace  formula  for  a  particle  executing 
unstable  simple  harmonic  motion  in  one  direction,  perched  on  the  edge  of  an  inverted 
harmonic  oscillator  in  a  transverse  direction.We  also  perform  the  quantum-mechanical 
calculation  to  test  the  semiclassical  formula.  Consider  a  particle  described  by  the 
Hamiltonian  (m  =  1) 

ff  =  1(^+^+^-0^).  (41) 

In  the  present  problem  the  situation  is  rather  simple,  since  there  is  only  one  primitive 
orbit  along  x.  The  trace  formula  is  of  the  form 


Spa  =  k.  -„.  (42) 

n  v     ' 


Here  T\  =  2ir/u\  is  the  period  of  the  isolated  orbit  along  x,  and  Si  (E)  =  2-jrE/uJi  is  the 
corresponding  action.  Aff  is  the  2  x  2-monodromy  matrix,  and  a\  the  Maslov  index.  The 
monodromy  calculation  is  straight-forward,  and  confirms  that  the  orbit  is  unstable 
(hyperbolic). 

To  establish  the  notation  for  this,  and  the  subsequent  Maslov  index  calculation  we 
follow  Creagh  et  al  [18].  We  start  with  a  periodic  orbit  with  coordinates  given  by 
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Q  =  (x,y)>  anu  momenta  p  =  (px,py).  ine  stability  ot  the  orbit  is  determined  oy  tne 
propagation  of  small  perturbations  <5q(f),  8p(t)  away  from  the  periodic  solution.  The  time 
evolution  of  these  perturbations  using  the  linearized  equations  of  motion  may  be  written      _ 
in  terms  of  a  4  x  4  matrix  (called  the  matrizant)  X(t):  \ 

(43)       i 

with  the  initial  condition  X(Q)  =  1.  The  value  of  the  matrizant  after  one  period  (along  the 
x-axis  TI)  is  the  full  monodromy  matrix  MI  : 

Afi=X(ri).  (44) 

Two  of  the  eigenvalues  of  M\  are  unity,  and  the  4  x  4  matrix  M  may  be  written  in 
uncoupled  blocks  of  a  2  x  2  monodromy  matrix  M\  and  a  2  x  2  unit  matrix.  It  is  this 
reduced  matrix  M\  that  is  given  above  in  Eq.  (42),  the  superscript  k  denoting  the  matrix 
for  fc-cycles.  The  monodromy  matrix  after  one  complete  period  is  given  by  (we  drop  the        } 
superscript  k  =  1) 


The   eigenvalues    are    obtained   from   det  \M\  -  XI\  =  0   and   are    found    to  be 

exp(±2?ra;2/a;i).  Therefore 


-l)l  =  2  sinh  .  (46) 


We  next  calculate  the  Maslov  index  cr\  occuring  in  Eq.  (42).  For  an  unstable 
(hyperbolic)  orbit,  the  Maslov  index  for  the  £m  harmonic  is  simply  ktn.  For  our  case,  <r\ 
can  be  calculated  analytically  and  one  can  show  that  the  Maslov  index  is  simply  2k. 

The  semiclassical  trace  formula  given  by  Eq.  (42)  now  reduces  to 


Spd(E)  =        y(_1}i  (47) 

fiwi  fef         sinh(7rJtw2/o;1)  ' 

For  W2/o>i  >  1,  the  higher  harmonics  are  severely  damped,  and  the  above  equation  is 
of  the  same  form  as  given  by  Eq.  (39).  Note  from  Eq.  (38),  however,  that  there  is 
an  overall  negative  sign  in  the  trace  formula  for  the  zeros  of  the  Riemann  Zeta 
function,  indicating  that  the  Maslov  index  for  every  periodic  orbit,  irrespective  of  the 
harmonic,  is  TT.  Our  dynamical  formula  (47),  on  the  other  hand,  has  alternating  sign  with 
the  harmonic  number  k,  since  our  Maslov  index  is  Jbr.  This  anomaly  in  the  Maslov 
index  has  been  noted  in  the  literature  [19].  Berry  has  speculated  that  it  may  not  be 
possible  to  overcome  this  difficulty  with  a  Hamiltonian  that  is  separable  into  kinetic  and 
potential  terms. 

Our  next  task  is  to  perform  a  quantum-mechanical  calculation  of  6p(E)  to  check  the 
validity  of  the  semiclassical  result 
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6.2  The  quantum  density  of  states 

We  seek  to  obtain  the  quantum  density  of  states  for  the  two-dimensional  system  given 
by  Eq.  (41).  Since  there  is  no  interaction  coupling  the  two  degrees  of  freedom,  the 
desired  density  of  states  may  be  readily  obtained  by  convoluting  the  quantum  density  of 
the  harmonic  oscillator  in  the  jc-direction  with  that  of  the  inverted  oscillator  in  the  v- 
direction.  The  density  of  states  of  the  harmonic  oscillator  with  oscillator  constant  uj\  is 
given  by 


-  (n  +  l/2)«wi),  (48) 

«=o 

where  the  energy  E  >  0.  The  quantum  density  of  states  of  an  inverted  harmonic  oscillator 
has  been  examined  carefully  by  Barton  [12]  and  is  given  by 

P2(E)  =  -  J_K*(l/2  +  iE/fiu*),  (49) 


where  \I/(z)  =d/dzlnF(z)  is  the  digamma  function.  In  the  above  equation,  we  have 
omitted  a  constant  term  2/ir  In  L  that  arises  from  confining  the  system  in  a  large  box  of 
length  L.  To  compare  with  the  semiclassical  density  of  states  given  by  Eq.  (47),  we 
convolute  the  quantum  densities  (48)  and  (49)  for  a  given  energy  E  >  0.  This  total  energy 
E  is  shared  between  the  two  degrees  of  freedom  of  this  conservative  system.  The 
convoluted  density  p(E]  of  the  system  is 

p(E}=   I     Pl(E-E'}p2(E'}dEf.  (50) 

J  —  CO 

Note  that  the  lower  limit  in  the  integration  is  —  oo,  since  a  negative  energy  E'  of  the 
inverted  oscillator  may  add  up  with  the  energy  (E  —  E'}  of  the  harmonic  oscillator  to  still 
yield  the  energy  E  >  0  of  the  total  system.  Substituting  from  Eqs.  (48)  and  (49)  above, 
we  get 


We  proceed  to  examine  to  what  extent  the  semiclassical  formula,  given  by  Eq.  (47), 
agrees  with  the  quantum  result,  Eq.  (51). 

6.3  Comparison  of  the  semiclassical  and  quantum  formulae 

The  quantum  formula  given  above  is  for  the  full  density  of  states,  comprising  both  the 
smooth  and  the  oscillating  parts,  whereas  the  semiclassical  result  (47)  is  only  for  the 
oscillating  part  8p(E).  It  is  therefore  necessary  to  subtract  off  the  smooth  part  of  the 
density  of  states  from  the  quantum  result  before  a  comparison  is  made.  The  smooth  part  is 
most  readily  obtained  numerically  by  performing  a  Strutinsky-type  Gaussian  smoothing 
[20]  on  the  quantum  density.  The  method  consists  of  smoothing  the  quantum  density  of 
states,  given  by  Eq.  (51),  by  a  Gaussian  of  width  7,  and  modulated  by  a  curvature 
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Figure  5.  The  quantum  density  of  states  (wavy  curve),  given  by  Eq.  (51),  and  the 
Strutinsky-smoothed  density,  Eq.  (52)  (dashed  line),  are  plotted  as  a  function  of  energy 
E.  Only  the  representative  case  LJ\  =  u>2  =  10  is  shown.  The  smoothing  width  7  is 
chosen  to  be  20.  The  level  density  is  negative  because  a  large  positive  constant  term, 
2/7rln  L,  where  L  »  E,  has  been  omitted. 


function  Pn[(E-E'}/j}: 


fff  E-/N 

P       IL~E       }dp> 

rn  \  | uc  . 

7 


(52) 


where  the  smoothing  width  7  is  larger  than  the  spacing  u\  of  the  oscillator  shells.  The 
curvature  function  Pn  for  Gaussian  smoothing  is  given  by  the  associated  Laguerre 
polynomial  of  order  2n:  - 

pn(x}  =  L1/2^).  (53) 

The  curvature  function  Pn(x)  ensures  the  internal  consistency  of  the  smoothing 
procedure  if  p  is  a  polynomial  of  order  (2n  +1).  In  our  calculation,  a  curvature 
function  with  2n  =  2  is  sufficient  to  yield  an  accurate  p(£).  The  smoothing  width  7  was 
taken  to  be  20  and  the  oscillator  spacing  u>i  was  kept  fixed  at  10  (for  h  =  1).  The 
oscillating1' part  of  the  density  of  states  Sp(E)  is  obtained  by  taking  p(E)  -  p(E).  This  is 
compared  with  the  semiclassical  form  (47)  as  a  function  of  E  for  various  values  of  the 
ratio  W2/wi.  The  results  of  the  calculation  are  displayed  in  figures  5  and  6.  The 
agreement  between  the  quantum  and  semiclassical  densities  is  excellent,  and  the 
difference  is  almost  negligible. 
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Figure  6.  Comparison  of  the  quantum  and  semiclassical  oscillating  density  of  states 
8p(E),  shown  by  the  continuous  and  the  dashed  curves,  for  different  ratios  of  u>i[(jj\ 
given  in  each  panel.  The  quantum  result  is  obtained  by  subtracting  the  Strutinsky- 
smoothed  density,  Eq.  (52),  from  p(E],  given  by  Eq.  (51),  with  7  =  20  and  huj\  =  10. 
The  semiclassical  density  8pci  is  obtained  from  Eq.  (47)  with  kmax  —  100. 


Hence  we  may  write,  following  Bogomolny  [21] 


An  instructive  way  to  analyze  the  semiclassical  formula  (47)  is  to  construct  a 
dynamical  (or  Selberg)  zeta  function  £5(,s)  from  it,  and  quantize  its  energies  by  setting  £$ 
to  zero.  Let  us  define 

Ms).  (54) 


(55) 


where  8pci(E]  is  given  by  Eq.  (47).  Note  that  the  sign  of  6pci  in  the  above  equation  is 
opposite  to  that  of  the  analogous  Eq.  (34)  for  the  Riemann  zeta  function.  In  the  latter,  as 
was  discussed  in  the  earlier  section,  there  is  an  overall  minus  sign  in  the  trace  formula 
(12)  for  8p  ,  which  may  be  regarded  as  arising  from  a  Maslov  index  of  TT  common  to  every 
periodic  orbit  [19].  This  extra  negative  sign  is  disregarded  in  (55),  since  such  a  common 
Maslov  index  does  not  arise  in  the  dynamical  case. 
From  Eqs.  (47)  and  (55),  we  deduce  that 


2 


^     ' 


(57) 


2smh(*/2) 
and  summing  the  series  over  k,  we  obtain 

9S(E]  =  -9  fl  +expE  +  i/+  (58) 


From  Eq.  (54),  0$  is  the  same  as  -Of  In  &,  so  we  obtain  for  the  Selberg  zeta  function  the 
expression 

/  n-  \    \ 

1+exP          (£  +  ^+1/2)^2       .  (59) 


=0  v^i 

The  quantized  energies  are  obtained  by  setting  this  equal  to  zero,  giving 

E=(n  +  l/2)Rwi  -  i(l  +  l/2)hu2,  (60) 


where  /,  n  take  on  positive  integral  values  including  zero.  Note  that  in  our  case,  the 
quantized  energies  E  are  complex.  It  is  gratifying  to  see  that  the  semiclassical  Gutzwiller 
trace  formula  is  so  successful  in  describing  the  oscillating  density  of  states  in  the 
continuum.  To  our  knowledge,  such  an  application  has  not  been  made  before.  It  is  all  the 
more  interesting  because  this  toy-model  allows  us  to  construct  a  convergent  dynamical 
zeta  function.  Furthermore,  the  model  itself  has  some  relevance,  for  asymptotically  large 
a,  to  the  Riemann  zeta  function. 
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Exotic  spectra,  wavefunctions  and  transport  in 
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Abstract.  We  present  numerical  results  on  a  range  of  related  issues  for  a  number  of 
incommensurate  TBM's,  each  of  which  shows  a  metal-insulator  type  transition  as  a  binding-to- 
hopping  ratio  is  made  to  increase  through  some  limiting  value.  These  supplement  a  series  of  similar 
results  on  a  couple  of  ID  lattices  in  a  number  of  recent  works  (see  below).  A  brief  review  pertaining 
to  spectral  properties  and  wavefunctions  in  incommensurate  lattices  is  followed  by  results  on  the 
above  TBM's  relating  to  an  interesting  correlation  between  the  gross  features  of  wavefunctions  and 
the  energies  arranged  in  a  particular  sequence  termed  the  lattice-ordered  sequence,  and  also 
between  the  lattice-ordered  energies  and  the  on-site  potentials.  We  present  a  qualitative  explanation 
of  these  correlations  on  the  basis  of  perturbation  theory.  Basic  results  on  dynamics  of  wavepackets 
in  relation  to  spectral  characteristics  of  incommensurate  TBM's  are  also  reviewed.  Features  of 
lattice-ordered  energies  and  wavefunctions  for  the  TBM's  under  study  are  used  in  the  framework  of 
the  so-called  Maryland  construction,  leading  to  a  qualitative  prediction  of  criteria  for  recurrent  and 
non-recurrent  wavepacket  dynamics  in  these  lattices,  and  these  predictions  are  checked  against 
numerical  iterations  of  the  relevant  'quantum  maps'.  Closely  related  to  the  dynamics  of 
wavepackets  are  the  transport  properties  of  these  lattices.  Results  are  available  to  indicate  that  the 
unusual  spectral  characteristics  of  pseudorandom  lattices  lead  to  novel  features  in  transport 
properties  of  these  systems.  In  this  context,  low  temperature  a.c  conductivity  in  these  lattices  is  a 
good  probe  for  the  spectral  characteristics  and  wavefunctions.  However,  not  much  is  known  about 
the  a.c  conductivity,  excepting  a  set  of  early  results  pertaining  to  the  low  frequency  regime, 
principally  because  of  the  fact  that  the  a.c  conductivity  depends  on  global  characteristics  of  the 
spectrum  and  the  entire  set  of  wavefunctions.  We  present  a  simple  model  whereby  the  gross 
structure  of  variation  of  the  a.c  conductivity  with  frequency  can  be  obtained  from  a  knowledge  of 
the  spectrum  alone  for  the  set  of  TBM's  under  consideration.  Numerical  computations  show  that 
despite  its  simplicity,  the  model  leads  to  results  in  good  agreement  with  those  from  the  Kubo- 
Greenwood  formula  for  a.c  conductivity. 

Keywords.  Tight  binding  model;  aperiodic  lattice;  localization;  m-i  transition;  lattice  ordered 
sequence;  transport  properties;  ac  conductivity;  wave  packet  dynamics. 

PACS  Nos    71.50;  05.30;  71.55;  71.30;  05.45;  72.15 

1.  Introduction 

Deterministic  aperiodic  lattices  are  interesting  objects  from  both  theoretical  and  applied 
viewpoints.  Fabrication  of  structures  of  various  descriptions,  intermediate  between 
amorphous  and  periodic  ones,  are  now  within  the  realm  of  technical  possibility.  The  first 
metastable  quasicrystals  were  grown  by  Schechtman  et  al  [I].  Modulated  2D  structures 
were  first  prepared  by  Weiss  et  al  [2]  by  implanting  arrays  of  scatterers  in  a 
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semiconductor  heterojunction.  Techniques  are  now  available  to  grow  superlattices,  i.e., 
periodically  modulated  structures  with  large  lattice  constants,  in  heterojunctions.  Under 
the  influence  of  imposed  magnetic  fields,  electrons  in  these  superlattices  behave  as  if  they 
were  in  a  quasiperiodic  environment. 

Even  ID  deterministic  aperiodic  lattice  structures  are  now  at  hand,  following  the 
successful  fabrication  of  semiconducting  superstructures  grown  epitaxially  in  accordance 
with  the  rules  of  the  Thue-Morse  and  Fibonacci  sequences  [3].  A  large  variety  of 
experimental  data  is  presently  available  on  epitaxially  grown  superlattice  structures  on 
the  basis  of  X-ray  and  neutron  diffraction,  Raman  scattering,  and  similar  other  techniques 
(see,  e.g.,  [3,  4]  for  an  introduction).  Concurrently,  one  finds  extensive  literature  relating 
to  theoretical  studies  on  ID  quasirandom  lattices  perfomed  during  the  last  decade  and  a 
half.  A  large  fraction  of  these  studies  pertain  to  simulations  of  electronic  states  resulting 
from  tight  binding  models  (TBM's),  the  initial  lead  in  this  direction  having  been  provided 
by  Hofstadter  [5],  Aubry  and  Andre  [6],  Kohmoto  et  al  [7]  and  Ostlund  et  al  [8].  These 
early  works  already  established  exotic  features  of  solutions  of  the  Schrodinger  equation 
on  a  quasiperiodic  lattice,  namely   Cantor  structures   of  spectra,   and  the  critical, 
algebraically  localised,  nature  of  wavefunctions.  In  this  context,  most  studies  on  ID 
incommensurate  lattices  relate  to  Anderson-Mott  localisation  in  random  lattices  on  the 
one  hand,  and  to  Bloch-wave  solutions  in  periodic  lattices  on  the  other.  One  finds  that  a 
subtle  interplay  between  contrary  features  of  localised  and  extended  states,  or  those  of 
pure  point  and  continuous  spectra,  indeed  lends  a  rich  variety  to  the  physics  of  excitations 
on  incommensurate  lattices. 

Lateral  surface  superlattices  and  ID  semiconductor  heterostructures  offer  the  welcome 
opportunity  of  experimentally  studying  the  classical-  to-quantum  cross-over  in  the  time- 
evolution  of  electronic  states.  This  has  caused  a  resurgence  in  theoretical  and  numerical 
investigations  on  quantum  dynamics  in  situations  where  the  underlying  classical  motion 
is  irregular  or  chaotic.  This  is  the  area  of  'dynamical  quantum  chaos1  which  initially 
received  attention  through  studies  on  the  kicked  rotator  [9].  Ideas  of  Anderson-Mott 
localization,  when  applied  to  results  of  early  investigations  in  this  area,  led  to  the  concept 
of  'dynamic  localisation'  [10],  and  interest  in  this  field  was  about  to  dry  up  when  the 
whole  question  was  reopened  with  investigations  on  quantum  dynamics  on  deterministic 
aperiodic  structures,  and  with  the  observation  that  there  is  no  direct  correlation  between 
classical  chaos  and  quantum  diffusion  (see,  e.g.,[ll])  in  these  structures. 

Closely  related  to  the  exotic  features  of  excitations  and  wavepacket  dynamics  in 
quasirandom  systems  is  the  question  of  transport  properties  in  deterministic  aperiodic 
media.  In  this  context,  novel  transport  properties  appear  to  be  the  order  of  the  day,  not 
only  in  relation  to  electronic  states,  but  to  vibrational  and  electromagnetic  waves  as  well. 
For  instance,  interwoven  regimes  involving  phonons  and  localised  vibrational  modes 
(fractons)  lead  to  interesting  effects  [12,  13].  The  mode  of  attenuation  of  a  wave 
transmitted  through  a  finite  ID  chain  depends  sensitively  on  the  type  of  eigenstates 
supported  by  the  chain  [14, 15].  All  these  observations  point  to  interesting  possibilities  in 
fabrication  of  materials  having  new  and  useful  properties. 

A  ^  ^^  mterest  SK  to®  ac  t™™port  properties  of  deterministic  aperiodic  chains. 
Apart  from  the  possible  future  use  of  novelties  in  a.c  transport  in  incommensurate  or 
quasirandom  systems,  there  is  great  theoretical  interest  in  the  study  of  a.c  transport 
properties  [16,  17].  Such  a  study,  on  the  other  hand,  is  beset  with  difficulties  of  quite 
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enormous  magnitude  since  the  a.c  transport  depends  on  global,  as  opposed  to  local, 
features  of  the  excitation  spectrum  and  eigenfunctions  relevant  to  the  lattice  under 
consideration. 

We  address  in  this  paper  a  range  of  questions  relating  to  the  above  context.  In  §  2  we 
point  out  an  approximate  correlation  between  energies  and  localised  eigenfunctions  in  a 
number  of  incommensurate  TBM's  showing  metal-insulator  (m-i)  type  transitions.  This 
correlation  will  form  the  basis  of  all  subsequent  discussion  in  the  paper.  Section  3 
includes  a  short  review  of  a  few  interesting  results  on  quantum  wavepacket  dynamics  in 
incommensurate  systems  and  its  correlation  with  the  fractal  properties  of  underlying 
spectra.  We  then  present  in  this  section  a  brief  study  on  wavepacket  dynamics  under 
periodic  kicking  in  the  incommensurate  TBM's  considered  in  §  2.  Section  4  addresses  the 
question  as  to  how  far  the  gross  features  of  variation  of  the  a.c  conductivity  in  an 
incommensurate  lattice  can  be  inferred  from  a  knowledge  of  the  spectrum  alone.  As  we 
shall  see,  the  correlation  between  energies  and  eigenfunctions  presented  in  §  2  will  enable 
us  to  infer  that  this  can  indeed  be  done  to  a  certain  extent,  thus  paving  the  way  towards  a 
better  future  understanding  of  this  as  yet  largely  uncharted  territory.  Section  5  includes 
summary  and  concluding  remarks. 

The  work  contained  in  the  present  paper  is,  in  the  main,  based  on  a  number  of  previous 
papers  [18-22].  The  first  of  these  papers  contained  preliminary  results  concerning  the 
spectrum,  density  of  states,  and  localisation  features  in  a  self-similar  TBM  based  on  a 
universal  recursive  sequence  encountered  in  Feigenbaum's  theory  of  period  doubling 
bifurcations  [23].  The  subsequent  papers  in  the  series  presented  results  relating  to  a 
simpler  self-similar  TBM  termed  the  'Cantor  lattice',  and  to  the  Aubry  model.  We  present 
in  this  paper  evidence  relating  to  a  number  of  other  lattices  admitting  of  m-i  type 
transitions,  to  establish  that  the  results  arrived  at  in  the  earlier  papers  are  not  peculiar  to 
the  particular  lattices  considered  therein. 


2.  Spectra  and  wavefunctions  in  ID  incommensurate  TBM's 

2.1  Metal-insulator  type  transitions  in  ID  TBM's 

TBM's  have  been  widely  studied  in  the  literature  as  useful  objects  mimicing  real  life 
aperiodic  and  incommensurate  systems.  In  particular,  despite  their  extreme  artificiality, 
ID  TBM's  have  led  to  important  insights  based  on  exact  results  and  a  wealth  of  numerical 
findings.  They  have  been  fruitfully  employed  to  explore  the  whole  spectrum  of  structures 
ranging  from  periodic  to  random  ones. 

A  ID  TBM  with  nearest  neighbour  hopping  is  typically  represented  by  an  equation  of 
the  form 

rVn^n  +  X(ljJn+l  +  V>n_l)  =  E^n,  (1) 

where  {Vn}  is  some  specified  sequence  of  on-site  potentials,  r  a  binding  strength,  and  A 
the  nearest  neighbour  hopping  strength  (for  introductory  background  on  the  tight  binding 
method  in  solid  state  theory  see,  e.g.,  [24];  properties  of  ID  TBM's  with  random  on-site 
potential  sequences  are  presented  in  [25-27]).  In  what  follows,  the  binding-to-hopping 
ratio  (r/A)  will  be  denoted  by  z/. 
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smtaoie  sequences  \vn$,  it  admits  of  a  set  of  essentially  complex  features  like  fractal  or 
singular-continuous  spectra  (see  Simon  and  Reed  [28]  for  an  explanation  of  the  term,  this 
reference  establishes  the  absence  of  singular  continuous  spectra  in  a  large  class  of 
quantum  mechanical  problems),  self-similar  and  critical  wavefunctions,  coexisting 
spectra  of  pure  point,  singular-continuous  and  absolutely  continuous  varieties,  and  m-i 
type  transitions. 

An  idea  of  the  complexities  can  be  had  from  rigorous  results  on  the  TBM  with  a 
quasiperiodic  sequence  of  on-site  potentials 

(2) 


where  a  is  an  irrational  number  and  9  is  a  phase  parameter.  For  instance,  with  v  >  2,  the 
spectrum  has  been  shown  to  be  singular-continuous  when  a  is  a  Liouville  number,  the 
integrated  density  of  states  being  a  Devil's  staircase.  More  generally,  the  singular 
continuous  spectrum  has  been  shown  to  have  a  Cantor  set  structure  for  a  dense  set  of  pairs 
(v,6)  [29-31].  Aubry  and  Andre  [6]  established  that,  for  irrational  a,  a  transition  from 
extended  to  localised  wavefunctions  occurs  at  the  value  v  —  vc  =  2  (the  'self-dual' 
situation)  which  is  thus  a  critical  parameter  in  a  m-i  type  transition. 

A  TBM  with  quasiperiodic  on-site  potentials  (2)  has  been  termed  the  Aubry  model  in 
the  literature.  An  alternative  nomenclature  for  the  incommensurate  lattice,  used  in 
refs  [19-22],  is  the  quasiperiodic  lattice  (QPL).  For  the  self-dual  situation  v  —  2,  it  is  also 
termed  Harper's  model,  which  represents  the  Schrodinger  equation  for  an  electron  in  a 
periodic  potential  and  in  a  competing  magnetic  field,  and  some  of  whose  exotic  features 
were  first  pointed  out  by  Hofstadter  [5]. 

While  the  wavefunctions  in  the  QPL  with  v  ^  2  are  either  all  exponentially  localised 
or  all  extended,  coexisting  states  are  encountered  in  some  other  models,  e.g.,  one 
investigated  by  Soukoulis  and  Economou  [32].  In  particular,  a  class  of  'slowly  varying' 
potentials  (see  below)  show  mobility  edges  in  their  spectra  as  a  common  feature  for  a 
range  of  hopping-to-binding  ratios  and,  at  the  same  time,  yield  a  m-i  type  transition. 

2.2  'Slowfy  varying'  potentials  and  m-i  type  transitions 

Interesting  spectral  features  are  obtained  for  a  number  of  TBM's  with  on-site  energies 
possessing  self-similar  and  hierarchical  structures.  Roy  and  Lahiri  [18]  investigated  a 
TBM  with  a  self-similar  'Feigenbaum'  structure  in  which  the  sequence  of  on-site  energies 
is  given  by 


(3) 

where  g(x)  is  a  well-known  universal  function  encountered  in  the  scaring  theory  of 
period-doubling  bifurcations  and  gW  represents  the  n-times  iterate  of  g.  The 
deterministic  aperiodic  lattice  represented  by  this  TBM  will  be  termed  the  Feigenbaum 
lattice  (FL). 

A  simpler  model  with  a  self-similar  structure  is  the  so-called  Cantor  lattice  (CL) 
considered  in  refs  [19-22],  where  the  on-site  potentials  are  generated  by  the  algorithm 

Vi=0;     ^-1  =  ^/3;      V*  =  V7n.l  +2/3.  (4a,  b,  c) 
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odic  with  periods  2*  for  all  integers  k  >  0,  the  periodicity  being  more 
larger  and  larger  k. 

CL  both  have  a  hierarchical,  self-similar  structure  and,  as  indicated 
litatively  similar  results  relating  to  spectra  and  wavefunctions.  Other 
milar,  hierarchical,  and  algorithm-generated  structures  have  also  been 
literature  (see,  e.g.,  refs  [15],  [33]).  Vibrational  properties  of  a  binary 
Dr  chain  involve  interesting  phonon  and  fracton  spectra  [12-13].  While 
:utional  chains  (e.g.,  the  Fibonacci  chain  and  the  Thue-Morse  chain) 
discussed  in  the  literature,  we  shall  not  refer  to  such  lattices  in  the 

i  lattice  and  the  Cantor  lattice  both  show  a  m-i  type  transition,  but 
radual  and  more  complex  than  that  met  with  in  the  QPL.  For  either 
at  low  values  of  the  binding-to-hopping  ratio  (z/),  the  spectrum  is 
ided  states,  but  one  finds  in  the  spectrum  sparsely  distributed  localised 
ith  increasing  v  the  measure  of  localised  states  increases  and,  for 
>  above  a  certain  limiting  value  (/>c)  all  states  become  localized.  The 
low  which  a  hierarchy  of  mobility  edges  are  encountered  in  the 
to  lie  somewhere  between  5  and  6  for  the  FL  and  between  3.0  and  3.5 

'L  and  the  CL  have  qualitatively  similar  structures  and  correspondingly 
tics,  the  latter  is,  in  a  sense,  simpler  than  the  former  in  that  the  middle 
las  a  single  scaling  over  the  entire  interval  [0,  1]  and  is  globally  self- 
set  of  points  g^(0)  is  only  locally  self-similar  and  has  numerous  (in 
ny)  scale  factors  varying  over  the  real  line  (see  [18]  and  references 
details). 

mce  concerning  the  m-i  type  transition  in  the  FL  is  to  be  found  in  [18] 
.,  has  been  briefly  indicated  in  [19].  The  on-site  potential  sequences  in 

share  with  the  QPL  the  common  feature  of  being  'slowly  varying' 
tance,  the  sequence  for  the  CL  can  be  looked  upon  as  a  long-range 

periodic  potential  of  period  2k  for  sufficiently  large  k. 
ig  potential  of  the  form 

:os27r(cw5  +  0),     (a, 6     as  in  (2),  6  <  1),  (5) 

y  Das  Sharma  et  al  [34]  (this  being  essentially  the  so-called  'cosine 
dered  by  Griniasty  and  Fishman  [35]  and  Brenner  and  Fishman  [36])  in 
•  a  WKB  type  analysis,  which  was  shown  to  yield  correct  results 
-i  type  transition  observed  numerically  in  the  model:  for  v  <  2  there 
•bility  edges  separating  a  band  of  Bloch  wave  type  extended  states  from 
onentially  localised  states;  as  v  increases  towards  the  value  vc  =  2,  the 
ressively,  till  it  disappears  and  all  states  become  exponentially  localised 
this  paper  we  shall  also  consider  a  TBM  based  on  the  'slowly  varying' 
equence 

fcos27r(an  +  0)+icos27r(2cm  +  20),  (6) 
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0  =  0  and  a  =  (\/5  -  l)/2  in  (2),  (5)  and  (6)).  This  model  also  shows  mobility  edges  in 
the  spectrum  and  a  ni-i  type  transition  as  v  is  increased  beyond  a  limiting  value  vc  slightly 
above  1.5. 

In  the  following,  the  ID  TBM's  with  on-site  potentials  given  by  (5),  (6),  will  be  termed 
the  cosine  lattice  (CSL)  and  the  Soukoulis-Economou  lattice  (SEL)  respectively.  Results 
presented  in  this  paper  will,  in  the  main,  pertain  to  the  FL,  CSL,  and  the  SEL,  while  a  few 
results  on  the  CL  and  the  QPL  will  be  quoted  for  the  sake  of  comparison. 

Figures  l(a,b,c)  indicate  the  m-i  transitions  in  the  FL,  SEL,  and  the  CSL  respectively 
by  showing,  in  each  case,  the  variation  of  the  inverse  localisation  lengths  (7)  computed 
from  the  Thouless  formula  (see  (38)  in  §4  below),  against  energy  eigenvalues  computed 
numerically,  for  a  number  of  values  of  the  parameter  v.  One  observes  for  the  FL,  for 
instance,  that  for  v  =  5.0,  there  appear  to  be  a  few  extended  states  at  one  edge  of  the 
spectrum  while,  for  v  =  6.0,  all  states  appear  to  be  localised,  with  vc  apparently  lying 
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Figure  1.  Inverse  localization  length  (7)  against  energy  (E1)  for  (a)  the  Feigenbaum 
lattice  (FL)  with  (from  bottom  to  top)  v  =  5.0,5.5,6.0,  and  6.5  respectively,  (b)  the 
Soukoulis-Economou  lattice  (SEL)  with  (from  bottom  to  top)  v  —  1.5,3.0,4.5,  and 
6.0  respectively,  (c)  the  cosine  lattice  (CSL)  with  (from  bottom  to  top) 
v  —  1.0,2.0,3.0,  and  4.0  respectively.  In  each  of  these  lattices  the  lowermost  curve 
includes  extended  states  in  the  spectrum,  the  next  curve  is  for  a  value  of  v  close  to  vc 
(exactly  i/c  for  the  CSL),  while  the  other  two  correspond  to  the  localised  regime.  Here, 
as  in  all  subsequent  figures,  the  lattice  size  is  N  =  1023  and  we  choose  A  =  1.0  (see 
(1)).  The  energies  have  been  obtained  by  direct  diagonalization  while  the  inverse 
localization  lengths  have  been  computed  from  the  Thouless  formula. 


close  to  5.5.  For  the  SEL  one  observes  narrow  bands  of  extended  states  at  v  =  1.5,  which 
disappear  rapidly  as  v  is  made  to  increase  further  so  that  with  v  =  3.0,  for  instance,  there 
appear  to  be  no  extended  states  in  the  spectrum.  For  the  CSL,  on  the  other  hand,  the 
transition  is  sharp  and  occurs  at  z/c  =  2.0. 


2.3  Peaks  of  localised  wavefunctions:  Lattice-ordered  energy  sequence 

Of  especial  interest  in  this  paper  will  be  a  set  of  results,  reported  in  refs  [19-22]  for  some 
of  these  models,  for  values  of  v  beyond  the  limiting  value  i^c,  when  all  the  states  are 
exponentially  localized.  We  summarize  the  principal  features  of  these  results  as  follows: 

(i)  for  sufficiently  large  v  above  i/c,  the  spectrum  of  eigenvalues  is  remarkably  similar  to 

the  set  of  on-site  potentials  {VM}; 
(ii)  there  exists  a  certain  ordering  of  the  eigenvalues,  called  lattice -ordering,  for  which 

the  locations  of  the  peaks  of  the  corresponding  wavefunctions  are  strongly  correlated 

with  the  energies. 

Thus,  the  density  of  states  resembles  very  closely  the  density  of  distribution  of  the  on- 
site  energies.  For  instance,  figures  2(a,b)  present  the  density  of  states  and  the  density  of 
on-site  potentials  for  the  Feigenbaum  lattice  with  v  =  7.0,  wherefrom  the  resemblance  in 
self-similarity  between  the  sets  of  eigenvalues  and  of  the  on-site  potentials  is  quite 
apparent.  Significantly,  the  value  of  v  used  here  is  not  very  much  above  the  limiting 
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Figure  2.  (a)  Density  of  states  /?(£)  as  a  function  of  energy  E  and  (b)  density  of  on- 
site  potentials  p(V)  as  a  function  of  V  for  the  FL  with  v  =  7.0.  Both  the  densities 
appear  to  be  similar.  Both  densities  are  normalized  to  maximum  value  unity. 

value,  which  means  that  one  does  not  have  to  move  far  beyond  the  m-i  transition  for  the 
above  interesting  features  to  be  evident.  As  another  instance  of  the  similarity  being 
re  erred  to  arranging  the  set  of  eigenvalues  in  an  ascending  sequence  {en},  and 
calculating  (mmn(en+2  -  ~enl}/  mmn(en+l  -  gB)),  for  the  CL  with  v  =  6.0  and  with  even 

srTrn  1  r/,"2  '  °ne  finds  a  ratio  ^  dose  to  ™>  confirming  that  the 
spectrum  has  mdeed  the  same  scaling  property  as  the  Cantor  set  of  on-site  potentials.  In 
pamcular,  the  entire  spectrum  is  singular-continuous  in  this  case  as  also  for  the  FL  above 

^to^TT  ^1S  respecti  the  situation  is  similar  to  *** in  case  °f  Ae  QpL  with 

v  >  2.0  and  with  a  a  Liouville  number. 
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«fe*     *    » -~Ii          7"  "'  Ult  *  ^  Dually  involves  an  uncountable  number  of 
sites  and,  strictly  speaking,  the  same  is  true  for  incommensurate  lattices  like  the 
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-,  CSL,  and  SEL.  However,  in  the  present  paper  we  consider,  in  each  case,  some 
iciently  large  finite  truncation  of  the  lattice  under  consideration.  While  the  orderings 
rred  to  in  the  following  will  pertain  to  these  finite  truncations,  corresponding 
wrings  on  the  infinite  lattices  also  exist,  though  they  require  infinite  numbers  of 
ger-valued  indices  for  their  enumeration.] 

fot  only  do  the  eigenvalues  resemble  the  on-site  energies  as  point  sets  but  the 
mblance  goes  even  further:  in  particular,  the  two  are  remarkably  similar  when 
sidered  as  sequences,  following  a  particular  ordering  procedure.  Thus,  together  with 
sequence  {Vn}  of  on-site  potentials  describing  the  lattice  under  consideration,  we 
5ider  the  sequence  {Vn}  of  the  same  on-site  potentials,  but  this  time  arranged  in  an 
inding  order.  Let  the  integer  valued  function  /j,(n)  of  the  integer  variable  n  give  the 
iping  between  these  two  sequences  in  the  sense  that 

Vn  =  V,(n}.  (7a) 

further  the  energy  eigenvalues,  arranged  in  the  ascending  sequence,  be  denoted  by 
[.  Then  the  required  ordering  of  eigenvalues,  to  be  called  the  lattice-ordered 
lence,  is  the  sequence  {<?„}  defined  by 

en  =  ^(«)'  (7b) 

ther  words,  the  same  shuffling  that  yields  {Vn}  from  {Vn}  is  used  to  generate  {<?„} 

i  {«„}. 

ks  already  stated,  the  two  sequences  {en}  and  {Vn}  resemble  each  other  strongly. 
s,  for  the  QPL,  the  variation  of  the  lattice-ordered  energies  {en}  with  n  is  itself  found 
e  quasiperiodic  (see,  e.g.,  [20],  figure  2),  exactly  similar  to  the  variation  of  the  on-site 
mtials. 

/hat  is  more  important,  features  of  the  wavef unctions  corresponding  to  the  energies 
nged  in  the  lattice-ordered  sequence  are  strongly  correlated  to  the  energies 
nselves.  One  of  the  more  unusual  features  of  incommensurate  systems  is  that  the 
'efunctions  vary  extremely  erratically  as  one  moves  through  the  energy  scale.  In 
icular,  gross  features  of  the  wavefunction  like  the  locations  of  the  peaks  or  their 
ilisation  lengths  have  little  correlation  with  the  energies  when  the  latter  are  arranged 
te  ascending  or  any  other  arbitrarily  chosen  order.  For  instance,  figure  3  is  a  plot  of 
itions  N(H)  of  peaks  of  the  wavefunctions  correponding  to  energies  en  arranged  in  the 
ending  sequence  against  the  sequential  order  n  of  the  energies,  for  the  QPL  in  the 
ilized  regime  with  z/  =  4.,  which  show  that  N(H}  is  an  extremely  violently  fluctuating 
;tion  of  n. 

L  correlation,  however,  is  most  strikingly  revealed  when  one  observes  that  the  peak  of 
wavefunction  if)^  corresponding  to  the  energy  en  arranged  in  the  lattice-ordered 
ience  is  precisely  at  the  lattice  site  n,  with  only  few  exceptions.  In  other  words, 
oting  the  location  of  the  peak  of  t/>W  by  J\f(n)  one  has,  to  a  high  degree  of  accuracy, 

JV(n)  =  n.  (8) 

ares  4(a,b)  are  plots  of  A/"(n)  vs.  n  for  the  FL  and  the  SEL,  both  at  values  of  v  above 
respective  limiting  values  vc.  While  the  correlation  expressed  by  (8)  is  found  to  hold 
larkably  well  in  these  cases,  figure  5 (a)  is  the  same  plot  for  the  CSL,  where  one  finds 
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Figure  3.  N(H]  vs.  n  plot  for  the  QPL  with  v  =  4.0.  Here  N(U]  stands  for  the  peak  of 
the  wavefunction  corresponding  to  energy  en,  where  {en}  denotes  the  sequence  of 
energy  eigenvalues  arranged  in  the  ascending  order.  The  points  appear  to  be  scattered 
throughout  the  plot,  indicating  an  erratic  variation  of  N  with  n  implying  that  there  is  no 
correlation  between  the  energies  arranged  in  the  ascending  sequence  and  features  or 
the  corresponding  wavefunctions.  The  wavefunctions  have  been  obtained  by 
numerical  computation  of  eigenvectors. 

that  the  correlation  (8)  holds  only  rather  weakly.  Finally,  figure  5(b)  is  the  N(n)  vs.   n 
plot  for  a  random  lattice,  where  the  correlation  is  found  to  be   altogether  absent. 

A  first-hand  explanation  of  the  correlations  noted  above  can  be  had  from  a  simple 
perturbation  theoretic  argument.  Thus,  a  TBM  Hamiltonian  can  be  written  as 

H  =  T(HV  +  V-IHI],  '     (9) 

where  HQ  is  diagonal  with  diagonal  elements  forming  the  sequence  {  Vn},  and  HI  is  given 
in  the  corresponding  basis  by 

H^jn+lXnl  +  lnXn  +  ll.  (10) 

When  H  is  written  in  the  form  (9),  v~l  can  be  used  as  a  convenient  perturbation 
parameter,  and  for  z/"1  =  0  (infinitely  strong  binding),  the  eigenvalues  of  H  are  just  the 
Vn's,  with  the  \  rc)'s  being  the  corresponding  eigenvectors.  In  other  words,  in  this  limit, 
the  lattice-ordered  sequence  of  eigenvalues  is  trivially  the  same  as  the  sequence  of  on-site 
potentials,  and  i/ff  is  just  6^,  so  that  (8)  is  also  trivially  satisfied.  For  small  and  finite  v~l 
one  has  now  to  see  the  effect  of  the  perturbation  v~lH\. 

The  first  observation  is  that  the  perturbation  theory  is  applicable  only  in  the  regime  of 
localized  states  since  the  uncorrected  states  are  all  localized,  and  that  v~lH\  cannot  at  all 
be  looked  upon  as  a  perturbation  unless  the  corrections  as  predicted  by  the  perturbation 
series  (see,  e.g.,  [38])  are  actually  small.  Thus,  noting  that  the  first  order  correction  to  the 
energies  is  zero,  and  considering  the  second  order  correction,  since  (n\H\  \  m}  is  non- 
zero only  when  m  =  n±l,  the  validity  of  perturbation  theory  depends  on  how  the 
successive  uncorrected  energies  in  the  lattice-ordered  sequence,  i.e.,  the  V'ns,  are  clustered 
together.  It  is  precisely  here  that  the  incommensurate  lattices  we  are  considering  in  this 
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Figure  4.  M(n)  vs.  n  plot  for  (a)  the  FL  with  v  =  7.0  and  (b)  the  SEL  with  v  =  6.0 
(see  §2  (8)  for  explanation).  Plots  show  a  strong  correlation  between  the  energies 
arranged  in  the  lattice-ordered  sequence  and  the  location  of  peaks  of  corresponding 
wavefunctions.  Of  the  total  number  of  data  points  in  (a),  85.7%  fall  within  the  band 
N  =  n  ±  5,  while  the  corresponding  figure  for  (b)  is  60.2%. 

paper  differ  radically  from  the  random  lattice.  Thus,  for  the  Cantor  lattice,  there  is  a 
recurrence  in  the  VM's  only  at  large  separations,  and  successive  VVs  differ  considerably 
from  one  another,  so  that  the  second  order  perturbation  correction  is  always  small, 
whereas  in  the  case  of  the  random  lattice,  there  is  always  a  considerable  fraction  of 
neighbouring  Vn's  differing  by  small  margins. 

This  conclusion  is  not  modified  when  one  goes  over  to  higher  order  perturbation 
corrections.  A  typical  term  in  the  perturbation  series  has  a  denominator  involving 
differences  of  the  form  (Vn  —  Vm)  which,  for  the  CL,  is  small  only  when  |  m  —  n  |~  2k, 
for  k  large.  However,  every  such  small  denominator  is  offset  by  an  immensely  smaller 
numerator  of  the  form  (v~1}2  ,  so  that  the  perturbation  expansion  is  never  vitiated.  The 
situation  is  entirely  different  for  the  random  lattice,  for  which  there  is  no  long  range 
correlation  among  the  Vn's,  and  the  perturbation  series  makes  no  sense. 

Thus,  for  the  incommensurate  lattices  we  are  considering,  the  perturbation  theory  is 
applicable  in  the  regime  of  localised  states  and  so,  the  sequence  of  lattice-ordered 
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Figure  5.  JV(n)  vs.  n  plot  for  (a)  the  CSL  with  v  =  6.0,  and  (b)  a  random  lattici 
with  i>  =  5.0.  In  (a)  the  correlation  between  the  energies  in  the  lattice-ordere< 
sequence  and  the  location  of  the  peaks  of  the  corresponding  wavefunctions  is  found  t< 
be  weaker  compared  to  correlations  for  the  FL,  the  QPL  and  the  SEL  found  ii 
figure  5,  only  24.8%  of  data  points  lying  within  the  band  A/"  =  n  ±  5,  while  in  (b)  th 
correlation  appears  to  be  altogether  non-existent.  The  random  lattice  was  generate 
with  a  programme  yielding  numbers  randomly  in  the  interval  (0,1). 

energies,  which  arise  from  the  sequence  of  on-site  potentials  continuously  and  withoi 
level  crossings,  can  indeed  be  expected  to  be  qualitatively  similar  to  the  latter  sequenc 
itself. 

An  essentially  similar  argument  applies  to  the  case  of  the  eigenfunctions.  Here  agai 
the  uncorrected  functions  given  by  iffi  =  ^m  serve  as  good  starting  points  for  the  actui 
eigenfunctions  for  the  incommensurate  lattices  but  not  for  the  random  lattice.  Since  th 
effect  of  the  perturbation  is  continuous  in  the  strength  of  perturbation  and  since  tr 
location,  of  the  peak  is  an  integer  valued  functional  on  the  wavefunction,  it  is  real] 
exceptionally  stable  against  the  perturbation,  and  this  is  what  we  find  in  figures  4  and  ; 

We  close  this  section  with  the  remark  that  the  differences  in  degree  to  which  the  abo> 
correlations  are  found  to  hold  for  the  different  incommensurate  lattices  considered  in  th 
paper  (and  in  the  previous  series  of  papers,  refs  [19-22])  are  related  to  the  differences 
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the  extent  to  which  the  values  of  the  on-site  potentials  Vn  are  clustered  together.  More 
detailed  numerical  results  in  this  respect  will  be  presented  in  a  future  publication. 
These  correlations  will  form  the  basis  of  the  results  and  observations  to  follow. 


3.  Dynamics  of  wavepackets  and  quantum  non-recurrence 

3.1  Recurrent  and  non-recurrent  quantum  dynamics 

The  comparison  and  correspondence  between  classical  and  quantum  time-evolutions  has 
always  been  a  keenly  studied  subject.  While  'most'  classical  Hamiltonians  are  non- 
integrable,  and  chaotic  trajectories  are  encountered  generically  in  classical  motions, 
quantum  dynamics  governed  by  the  Schrodinger  equation  is,  by  contrast,  integrable  [39, 
40].  However,  this  'simplicity'  in  quantum  dynamics  is  offset  by  the  fact  that  the  state 
space  is  infinite  dimensional.  As  will  be  seen  below,  this  gives  rise  to  an  essential 
complication  in  quantum  time  evolution  even  in  a  time-independent  problem,  as  soon  as 
one  tries  to  look  into  the  evolution  of  a  wave-packet  composed  of  stationary  states. 

An  essential  characteristic  of  classical  chaos  is  that  it  is  manifested  only  as  an 
asymptotic  feature  of  the  motion  in  which  a  trajectory  explores  the  phase  space  on  a  finer 
and  finer  scale  in  progressively  larger  time  intervals.  Quantum  interference  rules  out  a 
corresponding  occurrence  as  soon  as  a  non-zero  n  appears  in  the  theory.  This  is  usually 
interpreted  as  the  'absence  of  quantum  chaos'.  The  presence  of  chaos  in  the  classical 
regime  and  its  absence  -  in  the  above  sense  -  in  the  quantum  regime  do  not  contradict  the 
quantum-classical  correspondence  principle  because  of  the  fact  that  the  limits  H  —>  0  and 
t  ->  oo  do  not  commute  with  each  other  (see  e.g.,  [40]). 

However,  even  though  an  infinitely  detailed  exploration  of  the  phase  space  is  ruled  out 
in  the  quantum  regime,  there  is  nothing  in  principle  to  rule  out  a  similarly  detailed 
exploration  of  the  Hilbert  space,  and  this  may  quite  possibly  lead  to  a  complicated 
dynamics  peculiar  to  the  quantum  regime.  In  any  case,  the  phase  space  as  such  is  not  a 
well-defined  or  useful  concept  in  the  quantum  regime  and  as  we  shall  see  below, 
wavepackets  do  indeed  evolve  in  certain  situations  in  a  diffusive  manner,  which  may  be 
looked  upon  as  some  sort  of  'chaotic'  dynamics,  especially  so  since  it  entails  a  decay  of 
the  relevant  auto-correlation  function.  Such  a  diffusive  quantal  time  evolution  occurs 
when  there  are  accumulation  points  in  the  underlying  spectrum  and  involves 
progressively  detailed  and  fine  exploration  of  the  spectrum  with  the  passage  of  time.  It 
is  here  that  the  infinite  dimensionality  of  the  Hilbert  space  plays  a  crucial  role.  Recent 
years  have  witnessed  a  shift  towards  this  area  of  discourse  in  investigations  on -dynamic 
quantum  chaos.  Thus,  while  earlier  studies  looked  principally  into  possible  signatures  of 
classical  chaos  in  quantum  time  evolution,  more  recent  studies  are  based  on  a  recognition 
of  the  distinctness  of  the  two  regimes  and  of  essentially  independent  criteria,  involving 
distinct  time  scales,  governing  classical  and  quantum  chaos. 

The  two  traditional  areas  of  investigation  in  'quantum  chaos'  have  been  the  'static'  and 
the  'dynamic'  problems  respectively  -  the  former  dealing  with  signatures  of  classical 
chaos  in  quantum  spectra  and  stationary  wavefunctions  and  the  latter  with  the  possible 
existence  of  chaos  in  quantal  time  evolution.  While  we  have  singled  out  the  latter  area  in 
the  above  discussion  and  will,  in  the  remainder  of  this  section,  continue  to  dwell  mainly 
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set  oi  parameter  values  characterizing  the  Harmltonian.  As  already  mentioned,  most 
results  in  these  investigations  pointed  to  what  is  now  known  as  quantal  suppression 
of  classical  chaos  or,  equivalently,  'dynamic  localization'.  This  essentially  means  that 
asymptotically,  i.e.,  for  large  times,  a  wavepacket  evolves  in  an  effectively  quasiperiodic 
manner.  This  is  the  hall-mark  of  integrable  dynamics,  and  is  reminiscent  of  KAM-type 
time-evolution  in  classical  phase  space.  More  precisely,  a  typical  wavepacket  first 
undergoes  an  apparently  diffusive  spread  reflecting  the  chaos  in  the  underlying  classical 
dynamics  but,  after  the  lapse  of  a  characteristic  time  scale,  there  occurs  a  cross-over 
to  a  typically  quasiperiodic  regime.  The  problem  of  quantal  time-evolution  can  be 
mapped  to  an  equivalent  stationary  problem  on  what  may  be  termed  the  'energy  lattice', 
and  can  then  be  discussed  in  the  framework  of  Anderson-Mott  theory  of  localization. 
If  this  equivalent  problem  yields  localized  states  on  the  energy  lattice  then  the  time- 
evolution  of  the  wavepacket  in  the  original  dynamical  problem  may  be  taken  to 
be  quasiperiodic.  It  is  in  this  sense  that  one  talks  of  'dynamic  localization'  in  quantum 
time  evolution.  In  this  case  the  set  of  stationary  states  contributing  to  the  evolving 
wavepacket  fluctuates  with  time  within  limits,  as  a  result  of  which  the  wavepacket  sort  of 
reassembles  itself,  though  the  process  of  re-assembly  is  quasiperiodic  rather   than 
periodic.  We  term  such  a  time-evolution  'recurrent',  as  opposed  to  'non-recurrent'  time- 
evolution  where  the  wavepacket  does  not  re-assemble  itself  even  approximately  at  any 
point  of  time.  Non-recurrent  dynamics  can  in  turn  be  of  essentially  two  distinct  types, 
namely  diffusive  and  ballistic,  the  latter  resembling  a  freely  propagating  wavepacket 
while  the  former  resembles  what  is  known  as  'mixing'  motion  in  classical  dynamics,  the 
autocorrelation  function  of  the  evolving  wavepacket  decaying  more  slowly  than  in  the 
case  of  ballistic  motion. 

As  we  shall  indicate  in  this  section,  these  different  types  of  time-evolution  are 
intimately  related  to  characteristics  of  the  underlying  energy  or  quasi-energy  (see  below) 
spectra,  and  it  is  in  this  context  that  dynamics  in  pseudorandom  systems  acquires  a 
crucial  significance.  In  particular,  the  possibility  of  singular-continuous  spectra  and  co- 
existing spectral  measures  of  distinct  types  makes  the  problem  of  dynamics  in  such 
systems  a  highly  non-trivial  one,  and  one  finds  that  the  negative  results  (i.e.,  the  ones 
implying  the  absence  of  quantum  non-reccurence)  encountered  in  the  context  of  the 
kicked  rotator  are  by  no  means  of  a  representative  nature.  A  number  of  studies  on  ID 
incommensurate  lattices  have  already  established  that  subtle  novelties  are  in  store  in  this 
area.  Moreover,  the  technical  possibility  of  actually  fabricating  ID  and  2D  superlattices- 
instances  of  the  so-called  mesoscopic  systems-makes  the  classical-quantum  cross-over 
regime  an  experimentally  accessible  one.  All  this  has  made  the  problem  of  quantum  time 
evolution  in  incommensurate  systems  an  altogether  interesting  study. 

3.2  Spectral  characteristics  and  time  evolution:  The  quasi-energy  spectrum 

The  simplest  of  quantum  time-evolutions  is  that  of  a  stationary  state  of  a   time- 
independent  system.  In  principle,  the  time  evolution  of  an  arbitrary  state  or  a  wavepacket 
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for  a  system  described  by  a  time-independent  Hamiltonian  is  a  superposition  of 
harmonically  evolving  terms;  but  whether  this  superposition  leads  to  a  recurrent  or  non- 
recurrent time  evolution  depends  on  the  set  of  frequencies  involved,  i.e.,  on  the  energy 
spectrum.  We  confine  our  discussion  to  quantum  dynamics  in  random  and  pseudorandom 
systems,  referring,  in  particular  to  ID  TBM's  for  illustration. 

A  pure  point  or  discrete  spectrum  results  in  a  recurrent  time  evolution  where  the 
wavepacket  pulsates  regularly  and  its  mean  squared  spread  does  not  increase 
unboundedly  with  time.  This  is  the  situation  in  a  ID  random  lattice:  the  wavepacket 
does  not  diffuse  and  if  the  wavepacket  is  initially  localised  on  a  particular  site,  then  the 
probability  of  finding  it  at  the  same  site  at  a  later  time  t  does  not  go  to  zero  as  t  — »  oo  (see 
e.g.,  [26]). 

On  the  other  hand,  the  presence  of  a  singular  continuous  or  absolutely  continuous  part 
in  the  spectrum  complicates  matters.  If  the  initial  wavepacket  belongs  to  the  subspace 
built  up  from  the  absolutely  continuous  and  singular-continuous  parts  of  the  spectrum 
then  it  becomes  a  scattering  state,  i.e.,  ultimately  escapes  to  infinity  [29].  This  is  the  case 
of  nonrecurrent  dynamics.  It  includes,  in  turn,  a  variety  of  different  possible  types  of 
motion  differing  from  one  another  in  subtle  details. 

For  instance,  a  periodic  lattice  with  an  absolutely  continuous  spectrum  forming  a  band 
yields  a  ballistic  motion,  with  the  mean  squared  spread 

A2(f)  =  £(tt-no)2|^(f)|2,  (11) 

n 

of  the  wavepacket  varying  asymptotically  as  the  square  of  the  time  elapsed  :  A2(f)  oc  t2 
(here  «o  is  the  mean  position  of  the  wavepacket  and  ijjn  denotes  the  component  of  the 
wavepacket  on  lattice  site  n). 

The  role  of  singular-continuous  spectra  in  leading  to  non-recurrent  motions  and  the 
correlation  between  multifractal  properties  of  these  spectra  and  various  characteristics  of 
the  non-recurrent  motions  have  drawn  intense  attention  in  recent  years.  Guameri  [41,  42] 
related  the  dynamics  of  wavepacket  spreading  to  the  fractal  properties  of  the  spectrum 
and  showed  that  for  unbounded  diffusion  of  the  wavepacket  to  take  place  it  is  sufficient 
that  the  information  dimension  of  the  spectrum  be  larger  than  0.5.  Geisel,  Ketzmerick  and 
Petschel  [43]  made  a  numerical  investigation  of  wavepacket  spreading  in  Harper's  model 
which  is  known  to  possess  a  singular-continuous  spectrum  and  found  that 

A2ocr*',  (12) 

with  xi  dose  to  1  for  a  =  'golden  mean'  and  other  strongly  irrational  values,  in  close 
conformity  with  the  earlier  results  of  Hiramoto  and  Abe  [44].  The  spectrum  in  Harper's 
model  exhibits  what  Geisel  et  al  call  'level  clustering',  with  the  probability  density  p(s] 
of  nearest  neighbour  level  spacings  (s)  behaving  as  p(s)  ~  s~P,/3  —  3/2.  Such  level 
clustering  is  typical  of  singular-continuous  spectra,  as  opposed  to  'level  repulsion'  in 
random  matrix  ensembles.  The  exponent  (3  is  related  to  the  Hausdroff  dimension  of  the 
spectrum  as  DH  =  /5  -  1,  and  Geisel  et  al  conjectured  that 

Xi  =  2DH,  (13) 

which  implies,  for  Harper's  model,  a  spreading  of  wavepackets  through  what  is  known  as 
'normal'  diffusion.  Since  the  information  dimension  is  bounded  from  above  by  the 
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bound  established  by  Guarneri. 

While  typical  wavepackets  in  the  Harper  model  spread  through  normal  diffusion,  other 
ID  pseudorandom  lattices  exhibit  'anomalous'  diffusion  with  various  different  exponents 
Xi(<  1).  Thus,  in  the  Fibonacci  lattice,  it  varies  with  the  binding-to-hopping  ratio  v, 
decreasing  from  a  value  close  to  1  (normal  diffusion)  for  low  v  to  xi  ~  0-25  for  v  w  10 
[45].  For  the  Thue-Morse  chain,  on  the  other  hand,  there  is  a  preponderance  of  extended 
states  in  the  spectrum  and  the  spreading  is  'superdiffusive'  in  nature,  the  exponent  x\ 
decreasing  from  a  value  close  to  2  (ballistic  motion)  to  close  to  3/2  for  large  v  (Katsanos 
et  al  [46]). 

Apart  from  the  exponent  x  characterising  the  spread  of  the  wavepacket,  another 
quantity  characteristic  of  the  nature  of  quantum  time  evolution  is  the  large  time  behaviour 
of  the  temporal  autocorrelation  C(t)  of  the  wavefunction  if)(t): 


While  pure  point  spectra  are  distinguished  by  non-decaying  autocorrelations,  the  situation 
is  more  complex  for  singular-continuous  and  absolutely  continuous  spectra.  A  decay  of 
the  autocorrelation  faster  than  any  power  law  can  be  ascribed  to  an  absolutely  continuous 
spectrum,  but  the  converse  does  not  appear  to  be  necessarily  true:  there  are  absolutely 
continuous  spectra  for  which  the  decay  follows  a  power  law.  Consequently,  Avron  and 
Simon  [47]  distinguished  between  two  types  of  absolutely  continuous  spectra-the  so- 
called  transient  and  recurrent  varieties.  For  instance,  the  Aubry  model  with  v  <  2  is 
characterised  by  a  decay  of  the  autocorrelation  of  the  form 

C(t]  oc  r»,  (15) 

where  the  exponent  xi  is  seen  to  be  nearly  0.84,  and  this  slow  decay  has  been  ascribed  to 
a  recurrent  absolutely  continuous  spectrum.  For  v  =  2,  on  the  other  hand  (Harper's 
model),  X2  —  0.14,  pointing  to  a  singular-continuous  spectrum  [45]. 

The  classical  analog  of  Harper's  model  is  known  to  be  integrable  with  regular  KAM- 
like  motion  and  so,  the  diffusive  spread  of  wavepackets  in  the  Harper  model,  with  its 
attendant  decay  of  the  autocorrelation  function,  is  already  sort  of  a  surprise.  It  shows  that 
there  is  really  no  simple  tie-up  between  recurrence/non-recurrence  of  quantum  time 
evolution  and  regularity  /chaos  in  the  underlying  classical  motion. 

A  more  realistic  model  of  an  electron  in  a  lateral  surface  superlattice,  subjected  to  a 
magnetic  field,  leads  to  a  classical  analog  that  is  non-integrable  and  is  characterized  by 
chaotic  orbits  [48,49].  It  is  thus  a  matter  of  considerable  interest  to  look  into  how  the 
onset  of  classical  chaos  affects  level  clustering  and  the  characteristics  of  wavepacket 
evolution,  such  as  the  exponents  xi  and  XT.  introduced  above. 

The  most  widely  studied  system  in  this  context  is  the  periodically  kicked  Harper  model 
(KHM),  which  also  has  a  classically  chaotic  phase  space  [50]. 

Here  the  Hamiltonian  has  a  periodic  time-dependence  and  the  quantum  time  evolution 
is  governed  by  what  is  known  as  the  quasi-energy  spectrum.  Briefly,  if  T  be  the  time 
period  of  kicking,  then  the  evolution  operator  U(t  +  T,  t]  propagating  the  state  through  a 
time  interval  T  contains  all  the  relevant  information  concerning  asymptotic  time 
evolution.  This  is  the  so-called  Floquet  operator  which  is  unitary  and  whose  eigenvalues 
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Incommensurate  lattices 

are  all  of  the  form  exp(— zeT)  (we  set  h  =  1).  The  real  numbers  e  constitute  the  quasi- 
energy  spectrum  of  the  problem.  There  are  special  evolving  states  with  quasiperiodic  time 
evolution  given  by 

ijj(t)  =  exp(— iet)4>(t),  (16) 

where  c/)(t)  is  periodic  with  period  7".  These  are  known  as  quasi-stationary  states  and  are 
the  generalisations  of  stationary  states  relevant  for  a  periodically  driven  system. 

The  role  of  quasi-energy  spectra  in  determining  the  nature  of  time  evolution  of 
wavepackets  for  a  periodically  driven  system  is  similar  to  that  of  the  energy  spectra  in 
case  of  time  independent  systems.  Thus,  pure  point  quasi-energy  spectra  lead  to  recurrent 
dynamics  manifested  through  both  the  mean  squared  spread  of  the  wavepacket  and  the 
fluctuations  of  the  autocorrelation  (see  [51,  52]),  while  the  time  evolution  is  non-recurrent 
for  absolutely  continuous  and  singular-continuous  quasi-energy  spectra  [53]. 

Studies  on  the  KHM  show  that  a  rich  variety  of  time  evolution  results  for  different 
possible  choices  of  the  parameters  characterizing  the  model  and  that  unbounded  diffusion 
in  the  classical  regime  is  quite  different  compared  to  that  in  the  quantum  regime,  the 
latter  arising  due  to  the  hierarchical  level  clustering  in  the  spectrum.  Artuso  et  al  [54] 
have  confirmed  that  the  relation  (13)  between  the  exponent  xi  an^  tne  fractal 
dimension  DH  remains  valid  for  the  KHM.  The  'phase  diagram'  in  the  parameter  space 
demarcating  different  types  of  wavepacket  evolution  has  been  studied  in  Artuso  et  al 
[1 1]).  One  feels  that  a  definitive  understanding  of  the  KHM  has  not  yet  been  arrived  at  in 
the  literature. 

3.3  Periodic  kicking  in  TBM's:  Wavepacket  dynamics  in  energy  space 

Thus,  an  understanding  of  the  nature  of  wavepacket  dynamics  for  a  periodically  driven 
system  requires  a  knowledge  of  its  quasienergy  spectrum.  This  can  be  had  by  making  use 
of  the  so-called  Maryland  construction  [55, 56],  whereby  a  problem  with  periodic  delta 
function  kicking  is  mapped  to  an  equivalent  stationary  problem  on  an  'energy  lattice' (see 
[20]).  We  illustrate  this  in  the  context  of  a  tight  binding  Hamiltonian  with  periodic 
kicking  of  the  form 

00 

H(f)  =HQ  +  V  ^  6(t-  sT],  (17) 

s=— oo 

where  HQ  is  a  time  independent  TBM  Hamiltonian  with  energy  eigenvalues  En  and 
corresponding  eigenvectors  |  n): 

,\n)(n\.  (18) 

In  this  notation,  the  index  n  no  longer  represents  site  in  the  actual  'real  space'  lattice.  A 
typical  quasistationary  state  with  quasi-energy  u  (in  units  of  K)  can  be  written  as 

1  ill    •  t\  —  p~iwr   I  i/    -A  /"1Qa^ 

I  (rW) '/  —  c          |  uui  i/t  ^li/aj 

where 


and  the  Hermitian  operators  V,  H0  through  the  relations 

exp(-iVO  =  (/  +  iV)(/-ry)-1,  (21a) 

exp(-iHo)  =  (/  +  iHo}(I  -  iffo)"1  •  (21b) 

Then,  further  defining  j  u)  through 

|«H>s(/  +  iff0)|a),  (22) 

and  expanding  |  u)  on  the  energy  lattice 

n,  (23) 


one  arrives  at  an  equivalent  description 

(cn  +  (n  |  V  |  n))«n  =  -  £(n  |  V  |  m)Mm  ,  (24a) 

m-^n 

where 

cn=tan[T(u-En)/2].  (24b) 

If  V  is  chosen  such  that  V  has  matrix  elements  Vmn  only  for  m  =  n  ±  1  and,  further,  all 
matrix  elements  between  neighbouring  states  are  chosen  to  be  same  (say,  v)  we  get 

cnun  +  v(un+l  +  MB_I)  =  0,  (25) 

which  is  an  equivalent  stationary  TBM  with  on-site  potentials  ca  given  by  (24b)  (termed 
the  'tangent  potential').  A  quasienergy  cj  is  determined  implicitly  by  the  requirement  that 
this  equivalent  TBM  possesses  a  zero  eigenvalue.  The  wn's  determined  by  the 
corresponding  eigenvector  of  the  equivalent  TBM  are  essentially  the  components  of 
the  quasi-stationary  states  along  the  energy  eigenstates  |  n}  of  the  original  TBM. 

For  large  T  the  tangent  potential  is  unbounded  and  quasi-stationary  states  are  localised 
in  the  energy  lattice.  For  smaller  values  of  T  on  the  other  hand,  the  spectrum  and 
eigenfunctions  of  the  equivalent  'tangent'  lattice  are  determined  by  the  parameter  v  since 
the  effective  binding-to-hopping  ratio  is  v  =  1/v. 

The  above  construction  was  employed  in  [20]  to  the  case  of  an  electron  in  a  CL  or  a 
QPL  derived  periodically  by  6-function  impulses  in  order  to  arrive  at  a  useful  scheme  for 
determining  the  nature  of  time  evolution  of  wavepackets.  The  TBM  one  starts  with  can  be 
written  as 


Vn\un}(un\  +  A(KXMn+1|  +  KXu^l),  (26) 


where  we  assume  that  boundary  effects  can  be  ignored  and  where  the   fwn)'s    are 
basis  vectors  defining  the  lattice  (to  be  designated  the  'real  space  lattice'  in  the  present 

Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  U) 
572  Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


.lenoung  me  component  01  \  un)  ai  me  muice  sue  m. 
this  TBM  Hamiltonian  we  add  a  periodic  kicking  term  so  as  to  arrive  at 


(28) 


V  =  ^v(|wn)(Mn+i|  +  |Mn)(MH_i|),  (29) 

n 

ioting  the  strength  of  kicking. 

t  now  \u}n)  denote  the  eigenvector  of  HO  belonging  to  the  eigenvalues 
=  1,2,...),  [en]  being  the  lattice-ordered  sequence  of  eigenvectors.  For  the 
[  lattices  of  the  slowly  varying  type  introduced  in  §  2,  we  already  know  from  the 
nations  in  §  2  that  for  large  enough  v  (above  the  relevant  limiting  value  vc)  |u>H)  is 
2d  at  site  n  for  almost  every  n  and,  the  localization  lengths  being  small  enough  for 
v,  we  can  effectively  replace  wn)'s  by  |wn)'s  in  (29).  Since  HQ  is  diagonal  in  the 
s,  the  Hamiltonian  with  the  kicking  can  now  be  written  as 


v(M(un+l\  +  \LJn)(un-i\)6(t-sT).  (30) 

n  n,s 

choice  of  the  sequence  of  eigenvalues  en  and  the  replacement  of  the  |  wn)'s  by  the 
's,  based  on  the  results  of  §  2,  is  crucial  to  the  scheme  being  considered  here.  It 
ies  that  we  are  in  the  insulator  regime  v  >  vc,  where  we  can  now  employ  the 
fland  construction  as  in  the  case  of  (17).  In  the  present  situation  V  does  possess 
;st  neighbour  matrix  elements  only,  and  the  matrix  element,  is  independent  of  the  site 
B  energy  lattice.  Since  V  is  a  regular  function  of  V,  we  can  assume  similar  properties 
rnm  as  well.  What  is  essential  here  is  that  Vnm  should  not  vary  irregularly  with  n  and 
id  that  it  should  not  have  significantly  large  values  for  large  n  —  m,  both  of  which 
be  seen  to  follow  as  consequences  of  (30).  In  fact,  to  a  good  degree  of 
oximation,  we  arrive  at  the  equivalent  tangent  potential  model 

(31) 


A  =--!-/     d0cos0tan(^cos0Y  (32) 

27T  J0  \2  / 

the  on-site  tangent  potentials  are  given  by  (24b),  with  En's  replaced  by  the  lattice 
red  en's.  The  binding-to-hopping  ratio  in  this  equivalent  TBM  is  A"1  which  decreases 
increasing  v.  For  small  enough  T  the  sequence  {cn}  is  essentially  similar  to  {en}  and 
idicated  in  §  2,  this  in  turn  is  similar  to  {Vn}.  Hence,  for  small  T,  a  m-i  type  transition 
irs  in  the  tangent  potential  TBM  in  an  essentially  similar  manner  as  in  the  original 
f  as  v,  the  strength  of  kicking,  is  made  to  vary.  In  particular,  for  small  enough  v,  all  the 
nstates  in  this  equivalent  lattice  are  localised  while,  for  larger  v,  extended  states 
inate  the  spectrum.  This  means  that  for  small  v,  the  quasistationary  states  of  the  time 
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Figure  6.  Root  mean  squared  spread  (A)  of  wavepacket  in  the  energy  lattice  against 
number  of  kicks  (t)  for  the  kicked  FL  for  v  —  7.0  with  kicking  parameters  (a)  T  =  0.1, 
v  =  0.1  and  (b)  T  =  0.1,  v  =  2.0.  In  each  case  the  initial  wavefunction  was  assumed  to 
be  concentrated  at  the  centre  of  the  lattice  and  the  quantum  map  of  (34)  was 
numerically  iterated,  retaining  Bessel  function  coefficients  Jk  up  to  |  fe  |=  6.  The 
quasiperiodic  nature  of  the  time  evolution  in  (a)  is  evident  while  in  (b)  the  spread  of 
the  wavefunction  appears  to  be  ballistic. 

dependent  problem  (28)  are  localised  in  the  energy  lattice  while,  for  larger  v,  there  appear 
quasistationary  states  extended  in  the  energy  lattice. 

This  enables  us  to  make  predictions,  in  the  context  of  the  periodically  driven 
Hamiltonian  (28),  about  the  time  evolution  of  wavepackets  in  the  energy  lattice.  For 
instance,  for  sufficiently  small  T  and  large  enough  v,  the  presence  of  extended  quasi- 
energy  states  with  associated  absolutely  continuous  parts  in  the  spectrum  implies  that 
there  occurs  a  ballistic  spread  of  wavepackets  in  the  energy  lattice,  while  for  smaller 
values  of  v  one  expects  a  recurrent  time  evolution. 

The  time  evolution  of  a  wavepacket 

.(Ok),  (33) 
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Figure  7.  Root  mean  squared  spread  (A)  of  wavepacket  in  the  energy  lattice 
against  number  of  kicks  (t)  for  the  kicked  SEL  for  v  =  6.0  with  kicking  parameters 
(a)  T  =  0.1,  v  =  0.1,  and  (b)  T  =  0.1,  v  =  2.0.  See  legend  to  figure  6. 


governed  by  the  Hamiltonian  (30)  is  given  by  the  quantum  map 


(34) 


where  psn  =  lim£^o+  (sT  +  e)  and  Jk  is  the  Bessel  function  of  order  k.  This  map  can  be 
iterated  numerically  for  given  v,  T.  For  the  sake  of  convenience  we  take  for  s  =  0 

rf  =  «- 

for  some  given  ra  (i.e.,  an  eigenstate  \ujm)  as  the  initial  |-0}),  and  on  iterating  through  s 
impulses  compute 

Pnf  (35a) 
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Figure  8.  Root  mean  squared  spread  (A)  of  wavepacket  in  the  energy  lattice 
against  number  of  kicks  (t)  for  the  kicked  CSL  for  v  =  6.0  with  kicking  parameters 
(a)  T  =  0.1,  v  =  0.1,  and  (b)  T  =  0.1,  v  =  2.0.  See  legend  to  figure  6. 


(35b) 


which  are  respectively  the  mean  position  and  mean  squared  spread  of  the  wavepacket  on 
the  energy  lattice  after  s  impulses.  While  the  right  hand  side  of  (34)  involves  an  infinite 
summation,  in  practice  one  can  truncate  after  a  few  terms  since  the  Bessel  functions 
decrease  with  increasing  \k\. 

Figures  6(a,b)  present  results  of  numerical  iteration  of  the  quantum  map  (34)  for  the 
Feigenbaum  lattice  (with  v  >  i/c)  for  two'  different  sets  of  driving  parameters.  The 
agreement  with  the  predictions  of  the  above  theory  is  immediately  apparent.  Thus,  as  the 
kicking  strength  v  is  increased  from  0.1  to  2.0,  the  quasiperiodic  fluctuations  in  the  mean 
squared  spread  of  the  wavepacket  are  seen  to  give  way  to  a  unbounded  ballistic  growth. 

Similar  results  are  encountered  with  for  the  SEL  and  the  CSL  as  well,  as  seen  from 
figures  7(a,b)  and  8(a,b)  respectively.  In  other  words,  the  theory  outlined  above  does 
indeed  give  us  correct  predictions  about  the  quasienergy  spectra  and  wavepacket 
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dynamics  in  the  energy  lattice  for  the  TBM's  with  'slowly  varying'  on-site  potentials 
being  considered  in  this  paper.  It  should  be  mentioned  that  motion  in  the  energy  lattice  is 
not  quite  the  same  thing  as  that  in  the  real  space  lattice,  and  so  the  findings  obtained  here 
cannot  be  immediately  compared  with  the  corresponding  ones  on  the  KHM.  Nevertheless 
the  above  theory  is  evidently  useful  in  leading  to  a  formulation  of  criteria  for  recurrent 
and  not-recurrent  dynamics  for  at  least  a  class  of  mesoscopic  systems.  Since  the  theory 
does  not  refer  to  the  classical  phase  space,  these  may  be  looked  upon  as  purely  quantum 
criteria  independent  of  criteria  for  classical  chaos.  Such  criteria  appear  to  be  necessary  in 
the  context  of  findings  on  the  kicked  Harper  model. 

[Remark.  It  is  neccessary  at  this  point  to  set  right  a  computational  error  in  [20,  21]  where 
numerical  iteration  of  the  quantum  map  for  the  CL  and  the  QPL  in  presence  of  kicking 
appeared  to  give  rise  to  normal  diffusion  rather  than  ballistic  spread  for  large  values  of 
kicking  strength.  This,  however,  was  an  artefact  caused  by  the  numerical  programme 
which  retained  too  few  terms  in  the  right  hand  side  of  (34),  thus  truncating  away  a 
number  of  terms  with  significant  values.  In  the  present  series  of  results,  on  the  other  hand, 
we  have  retained  Bessel  functions  Jfc  with  \k\  <  fcmax,  where  fcmax  has  been  chosen 
sufficiently  large.  As  indicated  above,  a  ballistic  spread  for  large  v  is  actually  more  in 
keeping  with  the  prediction  from  the  theory  we  have  outlined.] 

4.  AC  Conductivity  of  incommensurate  lattices 

The  a.c  electrical  conductivity  a((J)  at  low  temperatures  (71)  in  incommensurate  lattices  is 
a  good  probe  for  the  nature  of  spectra  and  wavefunctions.  While  the  d.c  limit  gives  us 
only  local  information  on  individual  states  near  the  Fermi  level,  the  w-dependence  of  the 
a.c  conductivity  is  a  vastly  richer  source  of  information,  involving  the  global  structure  of 
the  spectrum  and  of  the  wavefunctions.  Even  the  small  u  regime  at  T  — >  0  gives  us  new 
insights,  especially  concerning  extremely  distant  but  resonating  localized  states.  The  uj- 
dependence  in  this  regime  already  shows  subtle  variations  with  the  arithmetic  nature  of 
the  incommensuration,  and  differs  from  corresponding  results  on  completely  disordered 
lattices  [57].  From  the  applied  point  of  view,  finite-u;  a.c  conductivity  in  incommensurate 
lattices  provides  the  context  for  developing  materials  with  interesting  frequency-selective 
transport  properties,  differing  from  ordered  structures  on  the  one  hand,  and  from 
completely  disordered  ones  on  the  other. 

However,  precisely  because  of  the  fact  that  the  a.c  conductivity  depends  on  the  global 
structure  of  the  spectrum  and  of  the  wavefunctions,  its  theoretical  calculation  or 
numerical  computation  is  beset  with  formidable  difficulties.  In  particular,  the  knowledge 
of  all  the  stationary  state  wavefunctions,  which  is  necessary  for  the  computation  of  cr(u), 
is  a  severe  pre-requisite  for  large  systems.  The  principal  difficulty  lies  in  the  fact, 
indicated  in  §  2,  that  wavefunctions  vary  extremely  erratically  with  energy.  Thus,  as  seen 
from  figure  3,  wavefunctions  with  energies  close  to  one  another  have  their  peaks  at 
widely  different  locations. 

In  [22],  the  numerical  observations  of  §2  were  used  in  the  context  of  the  CL  and  the 
QPL  to  establish  that  gross  features  of  the  uj-dependence  of  the  a.c  conductivity  can  be 
accounted  for  quite  satisfactorily  in  terms  of  the  energy  spectra  alone,  without  direct 
reference  to  the  wavefunctions. 
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A  Lahiri  et  al 

A.  simple  model  based  on  the  correlations  outlined  in  §2  was  proposed  for  the 
computation  of  the  gross  features  in  the  variation  of  cr(u;)  and  was  found  to  compare 
favourably  with  numerical  computations  of  <T(OJ)  in  accordance  with  the  Kubo- 
Greenwood  formula. 

Since  we  have  seen  that  these  correlations  hold  for  other  incommensurate  lattices 
admitting  of  m-i  type  transitions,  in  particular  the  FL,  the  SEL  and  the  CSL,  the  above 
simple  model  should  in  principle  work  for  these  lattices  as  well. 

In  this  section  we  summarise  relevant  background  from  [22]  and  then  show  that  the 
domain  of  applicability  of  the  simple  model  does  indeed  include  these  other  lattices  as 
well.  We  point  out  in  the  next  section  a  few  features  of  a(ui}  for  the  lattices  considered,  by 
way  of  concluding  remarks. 


4.1  Resonance  peaks  in  a.c  conductivity 

At  T  — >  0  there  are  no  phonons  in  a  lattice  assisting  the  conductivity  and,  in  the  single 
particle  approximation,  the  a.c  conductivity  as  a  function  of  the  frequency  u  is  given  (up 
to  constants,  in  units  of  e2/h)  by  the  Kubo-Greenwood  formula  (see,  e.g.,  [16]) 

-E»-  ^)'  (36) 

which  has  been  written  in  a  form  suitable  for  a  system  with  localised  states.  Here  Q, 
stands  for  the  system  size,  the  Em's  are  the  energy  eigenvalues,  the  fm's  are  the  Fermi 
occupation  probabilities,  and  xmn  stands  for  the  matrix  element 

*™  =  £^im)^n),  (37) 

/ 

where  f  is  the  component  at  lattice  site  /  of  the  normalised  wavefunction 
corresponding  to  energy  Em.  In  (36),  the  indices  m,n  appear  under  the  summation,  and 
hence  the  order  hi  which  the  energy  eigenvalues  and  the  corresponding  eigenfunctions  are 
arranged  does  not  matter. 

Equation  (36)  tells  us  that,  apart  from  an  over-all  factor  w,  the  variation  of  a  with  tu 
involves  a  series  of  resonance  peaks,  a  peak  at  any  given  frequency  uj  arising  when  there 
exists  a  pair  of  states,  one  below  and  the  other  above  the  Fermi  level,  such  that  the  energy 
interval  between  the  two  is  equal  to  u  (in  units  of  K)  and  the  co-ordinate  matrix  element 
between  the  two  states  is  appreciable.  Such  a  pair  of  states  may  be  termed  a  'resonating' 
pair.  It  may  be  noted  that,  in  general,  a  resonance  peak  in  o-(u]  may  involve  more  than 
one  pair  of  resonating  states.  Indeed,  experimentally  it  is  not  <j(u;),  but  an  average  taken 
over  some  small  oj-interval  that  is  usually  relevant.  In  this  averaged  a,  each  resonance 
peak  will  more  often  than  not  actually  involve  more  than  one  pair  of  resonating  states. 
Since  all  states  are  involved  in  the  summation  (36),  and  since  there  may  be  resonating 
states  distributed  throughout  the  spectrum,  it  is  apparent  that  the  calculation  of  cr(u) 
requires  the  knowledge  of  the  entire  spectrum  and  the  complete  set  of  wavefunctions. 

Early  computations  of  low  temperature  a.c  conductivity  for  ID  random  lattices  were  by 
Penchina  and  Mitchell  [58],  and  by  Albers  and  Gubernatis  [59]  (see  [17]).  The  resonances 
are  dense  in  a  random  lattice,  with  strong  fluctuations  in  the  finite  u  regime.  Mott  (see, 
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e.g.,  [60])  argued  that,  in  the  small  ui  regime,  a  has  a  a;2  behaviour,  with  a  logarithmic  in- 
dependence thrown  in  due  to  the  so-called  multiple!  formation  by  extremely  distant 
resonating  states.  Prange,  Grempel,  and  Fishman  [57]  calculated  a(uS)  in  an  exactly 
solvable  incommensurate  TBM,  and  pointed  out  that  numerous  possible  types  of 
behaviour  in  the  small  u;  regime  might  be  possible  for  different  types  of  incommensura- 
tion,  which  are  clearly  distinct  from  the  u  dependence  of  a  random  lattice. 

However,  large  w  regimes  are  largely  outside  the  purview  of  these  earlier  studies,  since 
no  systematic  computation  of  the  resonances  could  be  possible  without  a  knowledge  of  all 
the  wavefunctions  ip^  and  since,  as  already  seen  in  the  small  ui  regime,  resonance 
involves  apparently  uncorrelated  states  for  random  and  quasirandom  lattices. 

4.2  Gross  structure  of  o~(ui):  Locating  the  resonant  peaks 

In  this  context  the  results  presented  in  §  2  shed  a  new  light  on  the  problem.  Indeed,  the 
calculation  of  a(ui)  becomes  immediately  simpler  as  soon  as  one  takes  the  sequence  {£„} 
to  be  the  lattice-ordered  sequence  of  energies  since  we  already  know  that  in  this  ordering 
the  gross  structures  of  the  corresponding  wavefunctions  are  strongly  correlated  with  the 
energies  themselves.  This  enables  us  to  compute  the  locations  of  a  considerable  fraction 
of  the  resonant  peaks  from  a  knowledge  of  the  spectrum  alone,  using  the  lattice-ordered 
sequence  of  energies  to  supply  the  relevant  gross  features  of  the  wavefunctions.  What 
cannot  be  captured  in  this  approach  is  the  fine  structure  in  the  variation  of  a,  namely  the 
actual  sizes  of  these  resonant  peaks,  since  this  does  in  fact  demand  a  detailed  knowledge 
of  the  wavefunctions. 

Thus,  we  make  use  of  the  fact  that  when  the  energies  are  reckoned  in  the  lattice- 
ordered  sequence,  the  wavefunctions  ip^  and  -0M  involved  in  the  matrix  element  xmn  are 
peaked  at  lattice  sites  m,  n  respectively  and  this  matrix  element  can  have  a  substantial 
magnitude  only  when  the  wavefunctions  have  a  considerable  overlap.  Remembering  that 
the  wavefunctions  themselves  are  exponentially  localized  with  certain  localization 
widths,  it  is  apparent  that  the  matrix  element  xmn  can  be  appreciable  only  when  the 
combined  widths  of  the  wavefunctions  involved  is  less  than  the  distance  between  their 
peaks.  In  this  context  we  use  the  Thouless  formula  [61]  for  the-  inverse  localization 
lengths  in  a  ID  chain: 

(38) 

This  formula  relates  the  inverse  localization  lengths  with  the  spectrum.  So,  the  spectrum 
alone  supplies  all  the  information  required  in  the  calculation  of  the  gross  structure  of  a(ui) 
if  we  adopt  the  following  working  hypothesis:  two  states  tfj^  and  ijj^  at  an  energy  gap  o> 
constitute  a  pair  of  resonating  states  only  if\m  —  n\  is  less  than  ^m~l  +  1n~l- 

This  working  hypothesis  implies  that  the  matrix  element  xmn  is  appreciable  only  when 
1  m  -  n  |  is  small  enough,  and  is  based  on  the  correlation  between  the  peaks  of  the 
wavefunctions  and  the  energies  in  the  lattice-ordered  sequence.  As  such,  it  is  expected  to 
hold  for  the  incommensurate  TBM's  we  are  considering,  and  not  for  a  random  lattice. 
This  was  corroborated  numerically  for  the  CL  and  QPL  vis  a  vis  the  random  lattice  in  ref. 
[22].  In  figure  9(a,b)  we  plot  the  magnitude  of  the  matrix  element,  calculated  for  a  fixed 
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Figure  9.  Average  overlap  u(k]  =  N~l  £m  |  Jcm,m+Jt  |,  N  =  lattice  size  (for  notations, 
see  text),  against  separation,  k  between  peaks  for  (a)  the  FL  with  v  =  7.0,  .and  (b)  a 
random  lattice  with  v  =  5.0.  In  the  plots,  max  u(k)  has  been  normalised  to  1. 


separation  between  the  peaks  and  averaged  over  the  lattice,  as  a  function  of  the  separation 
for  the  FL  and  the  random  lattice  respectively.  It  is  apparent  from  these  plots  that  the 
above  working  hypothesis  is  indeed  tenable  for  the  FL  while,  for  the  random  lattice,  the 
matrix  element  fluctuates  without  any  appreciable  decimation  as  the  distance  between  the 
peaks  is  made  to  increase. 

With  this  justification  for  our  working  hypothesis  and  following  [22],  we  compute  the 
locations  of  the  resonances  as  follows:  in  the  Kubo  formula  (36),  we  substitute  xmn  =  1 
whenever  |  m  -  n  |<  ^m~l  +  ^n~lt  and  zero  otherwise.  We  do  not  distinguish  between 
the  magnitudes  of  the  non-zero  matrix  elements  since  we  are  interested  primarily  in  the 
locations  of  the  resonances. 

Numerical  results  of  [22]  indicate  that  this  approach,  simple  as  it  is,  is  surprisingly 
successful  in  reproducing  correctly  the  gross  features  of  the  variation  of  a  with  u>  for  the 
CL  and  the  QPL. 
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We  present  below  corresponding  numerical  evidence  for  the  other  TBM's  introduced  in 
this  paper,  namely  the  FL,  the  SEL,  and  the  CSL. 

Variations  of  the  zero  temperature  a.c  conductivity  with  frequency,  as  computed  from 
the  Kubo  formula  (36)  with  the  help  of  numerically  computed  wavefunctions,  are  shown 
in  figures  10(a,b,c)  for  the  FL,  the  SEL  and  the  CSL  respectively.  One  finds  that  while  the 
resonant  peaks  are  clustered  in  band-like  groups  for  the  latter  two  lattices,  they  are 
relatively  sparsely  distributed  in  the  self-similar  Feigenbaum  lattice.  In  this  respect  the  FL 
resembles  the  CL  since,  in  both  these  lattices,  nearby  energies  in  the  lattice  ordered 
sequence  are  clustered  together  into  closely  packed  groups  along  the  energy  scale. 

In  order  to  compare  the  gross  features  of  this  variation,  computed  from  the  Kubo 
formula,  with  the  results  of  our  above  simplified  computation  of  the  resonance  peaks,  we 
first  ignore  those  resonant  peaks  obtained  from  the  Kubo  formula  whose  magnitudes  are 
less  than  some  small  suitably  chosen  fraction  (n )  of  the  most  pronounced  peak,  thereby 
eliminating  the  minor  peaks  from  the  comparison,  and  then  set  a  =  1  for  all  the 
remaining  peaks  resulting  from  the  Kubo  formula  and  display  them  as  upward  spikes  in  a 
<7  —  w  plot.  While  computing  these  resonant  peaks,  the  relevant  w-interval  (depending  on 
the  width  of  the  spectrum  in  the  energy  scale)  was  sampled  by  a  number  of  points 
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Figure  lOa-b.    (Continued) 
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Figure  10.  Plot  of  <r(u)  vs  u  for  (a)  the  FL  with  v  =  7.0,  (b)  the  SEL  with  v  =  6.0, 
and  (c)  the  CSL  with  v  =  4.0.  Each  spike  represents  an  average  of  a  for  a  small 
interval  (0.025)  of  frequency;  data  points  are  at  w-intervals  0.05.  Fermi  levels,  to 
which  the  plots  are  not  particularly  sensitive,  were  chosen  at  energies  0.7509  •  •  •, 
-1.3505  •  •  -,  and  -0.0815  •  •  •  respectively. 


distributed  uniformly,  and  at  each  sampling  point  a  coarse-grained  a  was  calculated  by 
including  all  pairs  of  resonating  states  whose  energy  separation  lies  within  an  interval  u 
to  u>  +  6u,  for  some  suitably  chosen  small  6u.  Likewise,  the  resonances  were  computed 
from  the  simplified  model  for  the  same  set  of  sampling  points  and  with  the  same  coarse- 
graining,  and  now  a  is  set  at  —1  at  all  these  resonances,  superposing  these  in  the  same 
a  —  u)  plot  as  downward  spikes.  Analogous  to  the  use  of  the  fraction  r\  in  eliminating  the 
relatively  minor  upward  spikes,  a  fraction  r-i  has  been  used  for  the  downward  spikes  and, 
in  some  instances,  r2  has  been  chosen  different  from  ri  for  facility  of  comparison.  In 
chosing  this  mode  of  presentation,  we  ignore  the  variations  in  the  magnitude  of  the  a.c 
conductivity  at  the  resonant  peaks,  instead  concentrating  on  the  locations  of  the  more 
major  peaks. 

Figures  11,  12,  and  13  are  results  for  the  FL,  the  SEL  and  the  CSL  respectively.  One 
finds  in  each  case  that,  considering  the  simplicity  of  the  model,  a  surprisingly  large 
fraction  of  actual  resonances  are  reproduced  without  explicit  use  of  the  wavefunctions 
and  what  is  more,  there  appears  to  be  quite  a  substantial  agreement  even  for  values  of  v 
below  the  limiting  value  vc  where  extended  states  exist  in  the  spectrum  dominated  by 
localized  states.  Even  though  the  correlation  expressed  by  (8)  does  not  make  sense  for  all 
the  wavefunctions  for  v  <  vc,  till  it  does  continue  to  hold  good  for  a  considerable  fraction 
of  the  localized  states;  and  the  contribution  of  these  states  to  a(uS]  makes  our  simplified 
model  useful  even  before  the  m-i  transition  is  completed. 

For  instance,  the  coarse-grained  <r's  computed  in  the  two  approaches  agree  at  a  fraction 
0.926  of  the  total  number  of  sampling  points  for  the  FL  with  v  =  5.0  (figure  11  (a);  most 
of  these  data  points  correspond  to  a  =  0),  and  95.4%  of  the  upward  spikes  are  actually 
reproduced  by  the  downward  ones.  As  expected,  the  agreement  improves  as  y  is  made  to 
increase  beyond  the  limiting  value  into  the  insulator  regime.  Thus,  for  the  FL  with 
v  -  7.0  (figure  (13(b)),  a  fraction  0.99  of  the  coarse-grained  cr's  calculated  in  the  two 
approaches  agree  (most  of  which,  once  again,  correspond  to  a  =  0)  and  all  the  upward 
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Figure  11.  Locations  of  major  resonance  peaks  in  a.c  conductivity  in  accordance 
with  Kubo  formula  with  numerically  obtained  wavefunctions  (upward  spikes) 
compared  with  the  locations  obtained  from  the  simplified  model  of  the  overlap  xmn 
(downward  spikes)  as  explained  in  the  text,  for  the  FL.  Notations  :  r\ ,  r2  as  explained 
in  the  text;  eF,  Fermi  level,  (a)  v  =  5.0,  r\  =  0.02,  r2  =  0.02,  eF  =  0.3364  •  •  •,  (b) 
v  =  7.0,  n  =  0.04,  r2  =  0.01,  eF  =  0.7509  ••-.  Calculated  values  of  a  have  been 
averaged  over  w-intervals  of  width  0.025,  and  data  points  are  at  id-intervals  0.05  as  in 
figure  10. 

spikes  are  reproduced  by  downward  ones,  even  while  our  model  yields  some  downward 
spikes  of  its  own,  not  actually  there  according  to  the  Kubo  formula.  Similarly,  for  the 
Soukoulis-Economou  model,  the  two  computations  agree  at  a  fraction  0.893  of  the 
sampling  points,  and  92%  of  upward  spikes  are  reproduced  by  downward  ones  even  at 
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Figure  12.  Comparison  of  locations  of  peaks  in  a.c  conductivity,  for  the 
(see  legend  to  figure  11;  same  notations),  (a)  i/=1.5,  n=0.03,  r2  = 
ff  =  -0.145  -  •  -,  (b)  v  =  6.0,  r,  =  0.02,  r2  =  0.02,  eF  =  -13505  •  •  -. 

v-\.5  (figure  12(a)),  while  for  i/  =  6.0  (figure  12(b))  the  corresponding  figures  are  0.966 
and  97.7%  respectively.  The  agreement  is  comparatively  poorer  for  the  cosine  model, 
where  the  <r's  computed  from  the  two  approaches  agree  at  a  fraction  0.815  of  the  data 
points  and  77.7%  of  the  upward  spikes  are  reproduced  in  the  set  of  downward  ones 
(figure  13(a))  for  v  ~  1.5  which  is  less  than  the  limiting  value.  The  corresponding  figures 
for  i/  =  4.0-well  inside  the  insulator  regime  -  are  0.985  and  84.6%  respectively 
(figure  1 3(b)).  Moreover,  in  the  CSL  with  v  <  vc  (figure  (13(a)),  our  simple  model  appears 
to  pick  up  numerous  spurious  resonant  peaks  at  the  high  frequency  end  of  the  spectrum. 
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Figure  13.  Comparison  of  locations  of  peaks  in  a.c  conductivity,  for  the  CSL 
(see  legend  to  figure  11;  same  notations),  (a)  v  =  1.5,  r\  =  0.03,^2  =  0.01, 
€P  =  -0.2308  •  •  -,  (b)  v  =  4.0,  n  =  0.02,  r2  =  0.02,  eF  =  -0.0815  •  •  -. 


This  discernible  discrepancy  in  the  CSL  means  that  the  perturbation  series  for  the  energies 
and  wavefunctions  in  the  perturbation  scheme  indicated  in  §  2  does  not  work  so  well  in  this 
case.  However,  even  here  the  level  of  agreement  is  not  one  to  be  dismissed  lightly. 

5.  Summary  and  concluding  remarks 

In  this  paper  we  have  reviewed  a  set  of  known  results  on  unusual  spectral  properties  and 
related  features  of  wavefunctions  in  ID  incommensurate  TBM's  as  well  as  on  features 
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reiaung  10  recurrent  ana  non-recurrent  quantum  dynamics  m  tnese  systems,  we  nave 
presented  numerical  results  on  a  range  of  questions  relating  to  a  set  of  TBM's 
characterised  by  what  may  be  looked  upon  as  'slowly  varying'  on-site  potentials  and  by 
m-i  type  transitions  in  their  spectra.  These  results  supplement  corresponding  findings  on  a 
couple  of  TBM's  of  the  above  type,  presented  in  a  series  of  earlier  papers.  All  these 
results  are  based  on  a  set  of  interesting  correlations  concerning  their  spectra  and 
wavefunctions  that  are  seen  to  hold  in  the  localized  regime,  beyond  the  limiting  value  of 
the  binding-to-hopping  ratio,  and  we  have  presented  a  qualitative  explanation  of  these 
correlations  on  the  basis  of  perturbation  theory. 

We  have  also  reviewed  a  set  of  basic  results  on  the  quantum  dynamics  of  wavepackets 
in  mesoscopic  systems  and,  in  this  context,  have  made  use  of  the  above  set  of  correlations 
for  the  TBM's  under  study  to  arrive  at  a  model,  based  on  the  so-called  Maryland 
construction,  that  enables  us  to  compute  the  quasienergy  spectra  for  problems  involving 
periodic  kicking  in  these  lattices.  While  we  have  not  actually  computed  the  quasienergy 
spectra  due  to  the  prohibitive  size  of  the  computations  involved,  we  have  tested 
qualitative  predictions  from  the  model  against  numerical  iterations  of  the  relevant 
quantum  maps. 

While  our  model  of  wavepacket  dynamics  can  be  used  to  compute  the  quasienergy 
spectra  from  the  equivalent  stationary  TBM  involving  the  'tangent  potential',  it  is  to  be 
noted  that  the  dynamics  of  wavepackets  on  this  equivalent  TBM  is  not  the  same  as 
wavepacket  dynamics  in  the  original  periodically  driven  problem  that  we  start  with 
because  the  equivalent  TBM  is  after  all  nothing  more  than  an  artefact.  Hence,  results  on 
correlation  between  diffusion  in  a  stationary  TBM  and  the  corresponding   spectral 
characteristics  are  not  directly  relevant  in  this  problem.  On  the  other  hand,  features  of  the 
wavefunctions  in  the  equivalent  TBM's  such  as  whether  these  are  extended  or  localised 
are,  to  a  certain  extent  relevant  and  we  have  made  use  of  these  features  to  arrive  at  the 
qualitative  predictions  which  do  agree  with  results  of  numerical  iterations  on  the  quantum 
maps.  However,  the  types  of  wavepacket  spreading  we  have  encountered  in  our  numerical 
iterations  cannot  be  said  to  be  exhaustive  in  the  sense  of  being  comparable  in  variety  to 
those  found  in,  say,  the  kicked  Harper's  model.  For  instance,  we  have  not  encountered  the 
diffusive  spread,  either  normal  or  'anomalous',  of  the  wavepackets.  It  is  not  immediately 
clear  whether  this  is  due  to  the  fact  that  the  wavepacket  dynamics  we  have  looked  into  is 
in  the  energy  lattice  rather  than  in  the  real  space  lattice.  A  more  precise  and  thorough 
analysis  of  the  wavepacket  dynamics  in  the  context  of  our  model  will  be  presented  in  a 
future  communication. 

Closely  related  to  the  dynamics  of  wavepackets  are  the  transport  properties  of  the 
lattices.  Transport  in  ID  incommensurate  lattices  is  characterized  by  numerous  novel 
features  due  to  exotic  spectral  characteristics.  Conversely,  experimentally  observed 
features  of  transport  in  pseudorandom  lattices  can  be  used  as  probes  to  gain  information 
on  spectral  features  and  wavefunctions.  In  particular,  a.c  conductivity  at  low  temperatures 
is  a  highly  sensitive  probe  in  this  respect  as  revealed  from  early  studies  in  the  low 
frequency  regime.  However,  the  theoretical  study  or  numerical  computation  of  the  a.c 
conductivity  in  the  finite  frequency  regime  is  beset  with  formidable  difficulties  because  it 
requires  a  knowledge  of  the  entire  spectrum  and  the  complete  set  of  wavefunctions.  We 
have  made  use  of  the  above  correlations  established  for  the  TBM's  under  study  to  arrive 
at  a  simple  model  for  predicting  the  gross  structure  of  the  variation  of  the  a.c  conductivity 
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with  frequency,  and  have  shown  that  this  simple  model  does  indeed  predict  the  gross 
structure  quite  successfully. 

The  a.c  conductivity  of  incommensurate  lattices  appears  to  be  of  special  interest  since 
the  variation  of  a.c  conductivity  with  frequency,  consisting  of  series  of  resonant  peaks,  is 
of  a  selective  nature  and  is  moreover  highly  specific  to  details  of  structure  of  the  lattice, 
unlike  the  dense  fluctuations  one  observes  in  random  lattices  or  the  featureless  variations 
in  periodic  lattices.  This  is  apparent  from  figures  10(a,b,c)  where  the  locations  of  the 
resonant  peaks  in  CT(OJ)  are  seen  to  follow  distinct  patterns  for  the  FL,  the  SEL,  and  the 
CSL.  For  instance,  most  of  the  resonances  fall  into  band-like  groups  in  the  SEL  and  the 
CSL,  while  they  are  less  clustered  in  the  FL,  even  though  there  appears  to  be  points  of 
accumulation  on  the  frequency  scale  in  this  case.  In  this  context,  our  simple  model 
leading  to  considerably  successful  predictions  on  the  locations  of  these  resonant  peaks 
acquires  significance. 
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Abstract.  Incommensurate  quantum  systems  with  two  competing  periodicities  exhibit  metallic 
(with  Bloch-type  extended  wave  functions),  insulating  (with  exponentially  localized  wave 
functions)  as  well  as  critical  (with  fractal  wave  functions)  phases.  An  exact  renormalization 
method,  which  takes  into  account  the  inherent  incommensurability,  is  used  to  obtain  the  phase 
diagram  of  various  quantum  models  for  the  two-dimensional  electron  gas  and  for  quantum  spin 
chains  in  a  magnetic  field.  In  this  approach,  the  scaling  properties  of  the  fractal  eigenstates  are 
characterized  by  a  fixed  point  or  a  strange  invariant  set  of  the  renormalization  flow.  One  of  our 
novel  results  is  the  existence  of  self-similar  fluctuations  in  the  localized  states  once  the 
exponentially  decaying  envelope  is  factorized  out.  In  almost  all  cases  under  investigation  here,  the 
universality  classes  can  be  broadly  classified  as  those  of  the  nearest-neighbor  square  or  triangular 
lattices. 

Keywords.    Incommensurate  systems;  localization;  fractal. 
PACS  Nos    72.15;  64.60 

1.  Introduction 

Incommensurate  or  quasiperiodic  (QP)  systems  with  two  irrationally  related  frequencies 
provide  a  class  of  aperiodic  models  which  have  been  of  great  interest  to  experimentalists, 
theoreticians  as  well  as  mathematicians.  Incommensurate  structures  appear  in  many 
physical  systems  such  as  quasicrystals,  two-dimensional  electron  systems,  magnetic 
superlattices,  spin  and  charge-density  waves  as  well  as  in  one-dimensional  ionic 
conductors.  [1]  The  mathematical  interest  in  QP  systems  is  due  to  the  small-divisor 
problems  [2]  which  arise  in  that  context.  Theoretically,  the  subject  is  of  great  interest 
because  these  systems  are  considered  to  be  in  between  periodic  and  random.  Periodic 
systems  exhibit  Bloch-type  extended  (E)  states  and  a  continuous  spectrum  while  for  a 
random  potential  all  states  are  localized  (L)  in  one  dimension.  However,  QP  systems 
exhibit  the  metal-insulator  (or  E-L)  transition  in  one  dimension  [1].  At  the  transition  point 
the  eigenfuntions  are  neither  extended  nor  exponentially  localized  and  are  referred  to  as 
critical  (C).  The  spectrum  in  QP  systems  can  be  either  absolutely  continuous,  point-like, 
singular  continuous,  or  a  mixture  of  these  [3]. 

In  this  paper  we  describe  the  results  of  our  new,  exact  nonperturbative  renormalization 
group  (RG)  to  study  the  exotic  fractal  characteristics  of  QP  systems  modelled  by  a 
discrete  Schrodinger  equation.  With  the  exception  of  some  results  in  §  §  4  and  5,  most  of 
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commonality  between  a  variety  of  systems  and  hence  puts  our  results  on  a  more  general 
perspective.   Furthermore,   it   proves   the   usefulness   of  our  RG   methodology  in 
quasiperiodic  systems.  We  hope  that  this  review  will  serve  as  a  useful  source  in  the 
literature  on  incommensurate  systems. 
We  will  describe  a  variety  of  systems  which  can  be  written  in  the  tight  binding  form 

=  0.  (1) 


Here,  •*/>/  is  the  fermion  wave  function  and  a,  b,  and  c  are  site-dependent  real  or  complex 
functions  determined  by  the  potential  and  die  eigenvalue.  We  assume  the  quasiperiodicity 
is  such  that  it  can  be  described  by  a  single  frequency  a  relative  to  the  underlying  lattice, 
i.e.,  a,  b,  and  c  are  essentially  functions  of  the  fractional  part  of  ia  where  a  is  irrational. 
Our  formulation  is  valid  also  for  the  case  where  ^  is  a  multi-component  vector  and  a,  b, 
and  c  are  matrices. 

Our  exact  decimation  scheme  takes  into  account  the  inherent  quasiperiodicity  of  the 
system.  The  resulting  formulation  provides  a  very  precise  criterion  to  distinguish  the  E,  C 
and  L  phases  and  hence  obtaining  the  phase  diagram.  Furthermore,  different  QP 
potentials  may  be  described  by  the  same  asymptotic  RG  flow  which  implies  universality 
for  the  scaling  of  the  fractal  spectra  and  wave  functions.  In  general,  the  scaling  of  the 
wave  functions  is  determined  by  the  asymptotic  RG  fixed  point  or  limit  cycle  whereas  the 
scaling  exponents  for  the  spectrum  are  obtained  from  the  eigenvalues  of  the  linearized 
RG  transformation.  The  existence  of  such  finite  RG  attractors  provides  a  new  and  very 
accurate  method  to  compute  e.g.,  eigenvalues  or  phase  boundaries.  As  we  will  show,  in 
some  cases  the  RG  flow  is  attracted  by  an  infinite  RG  strange  set.  Thus,  the  nonlinearity 
of  the  RG  transformation  opens  the  way  to  chaotic  dynamics  even  though  the  original 
tight  binding  model  (TBM)  is  linear. 

One  of  the  key  questions  mat  we  discuss  here  is  the  fluctuations  of  localized 
wavefunctions.  Localization  of  electrons  in  disordered  materials  has  been  one  of  the 
central  problems  in  condensed  matter  physics.  Various  recent  publications  [9]  have 
addressed  the  problem  of  scaling  of  localized  wave  functions  for  the  case  of  random 
disorder  and  have  speculated  that  exponentially  localized  wave  functions  may  exhibit 
fractal  fluctuations.  However,  the  issue  has  remained  controversial  as  these  numerical 
studies  have  been  plagued  by  finite  size  effects.  We  study  the  fluctuations  of  the  localized 
wave  functions  in  QP  systems  using  our  exact  RG  approach.  The  motivation  for  studying 
such  deterministic  disorder  is  that  in  the  case  of  random  disorder  additional  complexity 
arises  from  distinguishing  between  the  space  fluctuations  of  a  single  wave  function  and 
the  disorder  fluctuations  coming  from  various  different  realizations  of  the  randomness. 
Furthermore,  our  new  RG  scheme  for  QP  models  can  be  implemented  upto  machine 
precision  and  is  not  handicapped  by  finite  size  effects  encountered  in  the  previous  studies. 
In  this  paper,  we  concentrate  on  the  Harper  (10)  and  its  various  generalizations  which 
exhibit  metallic,  insulating,  and  critical  phases.  Our  main  focus  is  the  universal  scaling 
behavior  in  the  C  phase  as  well  as  on  the  universal  self-similar  fluctuations  of  the  L 
phase.  We  discuss  two  different  generalizations  of  the  Harper  equation:  one  is  the  TBM 
associated  with  two-dimensional   electron  gas  with  next  nearest  neighbor    (NNN) 
interaction  and  the  other  is  the  vector  TBM  associated  with  anisotropic  quantum  XY 
model  in  a  transverse  field.  In  these  generalized  models  the  critical  point  is  replaced  by  a 
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n  a  finite  parameter  interval  leading  to  additional  richness  and  various 
Masses. 

res  of  the  RG  method  are  summarized  in  §  2.  In  §  3,  we  briefly  review 
ion  and  describe  the  scaling  associated  with  critical  and  localized 
ection  4  deals  with  the  generalized  Harper  equation  in  the  two- 
•J  coupling  case.  The  resulting  TBM  can  be  either  real  or  complex 
isotropy  properties  of  the  NNN  couplings,  and  we  discuss  the  phase 
cases.  The  real  TBM  exhibits  a  fat  C  phase  described  by  a  strange 
G  flow.  On  the  other  hand,  the  complex  TBM  provides  an  interesting 
itrant  metallic  phase.  In  §  5,  we  discuss  the  vector  TBM  for  the  spinless 
tation  of  the  anisotropic  quantum  XY  model  in  an  inhomogeneous 
The  exchange  anisotropy  causes  this  model  to  deviate  from  the  Harper 
tens  the  critical  point  into  a  C  phase.  We  also  elaborate  on  the 
een  the  TBM  for  an  electron  on  the  triangular  lattice  and  the  quantum 
transverse  field.  Finally,  in  §  6  we  summarize  our  conclusions. 

icthod 

y  developed  a  new  exact  RG  approach  which  takes  into  account  the 
y  a  of  the  system.  For  details  of  the  RG  approach,  we  refer  the  readers  to 
s  [4-8].  Our  methodology  is  somewhat  similar  to  that  of  Ostlund  et  al 
es  the  scaling  properties  of  the  wave  functions  for  a  specific  eigenvalue. 
mod  of  Ostlund  et  al  which  was  based  on  transfer  matrices,  had  limited 
he  fact  that  at  the  localization  threshold  infinite  products  of  transfer 
d.  We  propose  a  scheme  where  instead  of  multiplying  transfer  matrices, 
is  decimated.  The  main  advantage  is  the  reduction  in  the  number  of 
I  to  carry  out  the  renormalization.  The  price  we  have  to  pay  is  that  our 
is  will  be  slighly  more  complicated.  However,  it  turns  out  that  with  fewer 
;  able  to  eliminate  directions  which  lead  to  divergences.  This  not  only 
;hing  the  RG  problem  but  also  provides  practical  means  of  calculating 
quantities  like  the  localization  threshold  and  eigenvalues. 
>n  method  can  be  implemented  for  any  irrational  a  but  here  we  consider 
e  where  u  is  given  by  the  inverse  golden  ratio  a  ~  (\/5  —  l)/2.  In  this 
•priate  to  decimate  out  all  sites  except  those  labelled  by  the  Fibonacci 
lich  are  the  best  rational  approximants  of  the  golden  ratio).  At  the  nth 
1,  the  TBM  is  expressed  in  the  form 

tl>(i  +  Fn+i)=ij>(i  +  Fn)+en(i)ij>(i).  (2) 

)perty  Fn+\  =  Fn  +  Fn-\  of  the  Fibonacci  numbers  provides  exact  recur- 
>r  the  decimation  functions  en  and  /„: 
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me  uerauon  or  me  recursion  relations  is  numerically  more  siaoie  man  uiai  01 
([TBM]). 

(2)  The  possible  self-similarity  observed  by  monitoring  the  behaviour  of  ipj  over  the 
range  i  G  [—  Fn,Fn]  with  increasing  n  can  be  captured  by  a  simple  asymptotic  limit 
cycle  for  the  decimation  functions  en  and/,,.  This  in  turn  determines  the  universal 
scaling  ratios 

0  -  lim  \il>(Fpn+J)/1>(0)\',     j  =  0,  .  .  .  ,p  -  1,  (5) 

It  —  *"OC 

where  p  is  the  cycle  length. 

(3)  An  unknown  or  inaccurate  parameter  in  the  TBM  and  the  asymptotic  limit  cycle  can 
be  simultaneously  determined  self-consistently.  This  is  based  on  using  the  secant 
method  to  calculate  the  parameter  value  for  which  e.g.,  /„(())  =/n_p(0)  for  some  very 
high  n.  In  other  words,  the  decimation  equations  themselves  provide  a  new  method  to 
compute  eigenvalues  or  phase  boundaries  up  to  machine  precision. 

3.  Harper  equation 

The  Harper  equation  [10],  which  describes  a  wide  variety  of  condensed  matter  problems 
including  the  two-dimensional  electron  gas  in  a  transverse  magnetic  field,  has  been  in  the 
forefront  of  theoretical  research  for  the  last  two  decades.  In  addition  to  various  numerical 
and  RG  studies  [1],  the  model  has  also  been  solved  using  the  Bethe-ansatz  [12], 
The  Harper  equation  has  the  simple  form 


V>i+i  +  Vi-i  +  2A  cos[27r(zcr  +  0)]^  =  E^.  (6) 

The  parameter  a  describing  the  magnetic  flux  in  the  two-dimensional  electron  problem 
introduces  quasiperiodicity  into  the  system  and  is  taken  as  the  inverse  golden  ratio 
a  -  (\/5  -  l)/2.  Although  the  phase  diagram  is  the  same  for  all  diophantine  cr,  the 
choice  of  the  golden  mean  simplifies  the  RG  analysis.  Furthermore,  it  is  assumed  that  the 
phase  0  is  chosen  so  that  the  main  peak  of  the  wave  function  &  is  at  i  =  0  [5,  11].  The 
variation  in  A  leads  to  the  metal-insulator  transition.  At  the  onset  of  transition  A  =  1  ,  the 
model  has  been  shown  to  be  self-dual  [13]  with  critical  states  and  a  fractal  butterfly 
spectrum  [14]. 

Figure  1  shows  the  wave  function  with  £  =  0  at  the  critical  point  A=  1.  The 
wavefunction  repeats  itself  at  every  third  Fibonacci  site.  In  fact,  this  repetition  of  the 
wave  function  takes  place  at  all  sites:  i.e.,  for  every  given  site,  there  exists  a  whole 
sequence  of  sites  displaced  by  Qk  (related  to  the  rational  approximants  of  cr3),  where  the 
wave  function  approaches  the  same  amplitude.  [5,  6]  This  type  of  translational  symmetry 
in  the  wave  function  on  a  Fibonacci-lattice  or  Qk-lattice  is  a  characteristic  feature  of  the 
QP  systems.  Therefore,  the  wave  function  does  not  vanish  asymptotically  but  there  are  an 
infinite  set  of  universal  numbers  C  (some  of  them  are  defined  in  5)  describing  the 
amplitude  of  the  wave  function  at  points  which  are  spaced  further  and  further  apart  from 
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Figure  1.    Wave  function  of  the  Harper  equation  at  the  critical  point  A  =  1  with 
E  =  0  and<£  =  1/4. 

the  central  peak.  In  addition,  the  structure  around  the  subpeaks  mimics  the  behavior  seen 
on  a  larger  length  scale  resulting  in  self-similarity  in  the  wave  function. 

In  the  RG  approach,  the  self-similarity  results  in  a  nontrivial  asymptotic  p-cycle  for  the 
decimation  functions  and  the  scaling  ratio.  In  the  E  phase,  the  RG  cycle  is  trivial  with 
£  =  1.  It  should  be  noted  that  eigenstates  with  different  energies  are  characterized  by 
different  asymptotic  behavior  of  the  RG  trajectories.  In  addition  to  E  =  0,  the  eigenstates 
at  the  band  edges  are  described  by  a  limit  cycle  while  almost  all  eigen  energies  are 
believed  to  correspond  to  a  strange  invariant  set. 

In  order  to  investigate  the  localized  states,  the  wave  function  •?/>,•  is  written  as 


=  e-^? 


(7) 


where  7  is  the  Lyapunov  exponent  which  vanishes  in  the  E  and  C  phase.  The  L  phase  is 
characterized  by  a  positive  Lyapunov  exponent  7  =  ln(A)  corresponding  to  an 
exponential  decay  of  the  wave  function  [AA].  Therefore,  the  function  r?/  describing  the 
fluctuations  of  the  localized  wave  function  satisfies  the  following  TBM  for  i  >  0: 

--?7M-i  +  A?7,'_i  +  2Acos[27r(z'cr  +  d})}rji  =  En.  (8) 

A 

We  study  this  TBM  using  the  RG  method.  Figure  2  shows  the  fluctuations  77,-  at  a  band 
maximum  (the  band  center  with  E  —  0  is  described  by  a  period-3  RG  cycle).  The 
fluctuations  for  A  >  1  possess  the  same  complexity  and  richness  as  the  critical  states. 
Furthermore,  it  turns  out  that  the  self-similar  fluctuations  are  universal  throughout  the  L 
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Figure  2.    Absolute  value  of  self-similar  universal  fluctuations  in  the  supercritical 
phase  of  the  Harper  equation. 

phase  and  are  characterized  by  a  unique  strong  coupling  RG  fixed  point  with 
C  =  0.172586  •  •  • .  In  particular,  after  writing  the  RG  transformation  and  the  decimation 
functions  in  the  form  where  the  discrete  lattice  index  i  is  replaced  by  a  continuous 
renormalized  variable,  we  are  able  to  solve  for  a  power  series  expansion  of  the  fixed  point 
by  the  Newton  method.  Details  of  the  RG  fixed  point  and  the  eigenvalue  analysis  are 
discussed  elsewhere.  [8]  It  should  be  noted  that  although  the  fluctuations  bear  a 
qualitative  resemblance  to  the  wave  functions  in  the  C  phase,  quantitatively  they  belong 
to  different  universality  classes. 

4.  Generalized  Harper  equation:  Bloch  electrons  with  NNN  interaction 

The  Harper  equation  models  a  two-dimensional  Bloch  electron  on  a  square  lattice  in  a 
transverse  magnetic  field  when  only  the  hopping  to  nearest-neighbor  (NN)  lattice  sites  is 
taken  into  account.  Including  the  hopping  also  to  the  NNN  sites  one  obtains  the  following 
IBM: 


cos[27r(fecr 


(9) 
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Figure  3.  Phase  diagram  of  the  two-dimensional  electron  gas  with  NNN  interaction. 
For  tab  =  ta-b,  the  region  bounded  by  the  AL  and  CE  lines  is  critical.  However,  as  soon 
as  these  two  NNN  couplings  become  different  from  each  other,  the  C  phase  is  replaced 
by  a  reentrant  E  phase.  The  solid  lines  BC,  AC,  and  CE  always  remain  critical.  The 
extension  of  the  line  AC  (line  CL)  separates  the  L  phase  into  two  different  regions 
with  different  scaling  properties. 


Here  ta  and  tb  are  the  NN  couplings  while  tab  and  ta-b  are  the  diagonal  NNN  couplings 
which  are  taken  to  be  zero  in  the  case  of  the  Harper  equation.  In  the  isotropic  limit 
tab  —  ta-b,  the  TBM  becomes  real  and  has  been  studied  in  various  recent  papers  [15,  7,  8]. 
The  two  cases  ta^  —  tai  and  tab  ^  tai  will  be  discussed  separately  below. 

Case  I.  tab  —  tab-  In  this  limit,  the  above  TBM  can  be  written  as 

{1  +  a  cos[27r(o-(i  +  i)  +  <£)]}^H-I  +  {1  +  a  cos[27r(o-(i  -  5)  +  0)]M-i  ' 
+  2A  cos[27r(<ri  -f  0)]V>;  =  Etjji .  (10) 

Here  A  =  tb/ta  and  a  =  2tab/ta.  The  Harper  equation  corresponds  to  the  limit  a  =  0 
where  the  NNN  coupling  term  is  "zero. 

Figure  3  shows  the  phase  diagram  of  the  model  in  the  A  —  a  plane  [15,  7].  By 
carrying  out  the  RG  analysis  at  the  band  minimum,  we  find  that  the  model  belongs 
to  the  universality  class  of  the  Harper  equation  for  a  <  I  [7].  Therefore,  along  the 
line  BC  (except  at  the  point  C),  the  universal  scaling  is  identical  to  that  of  the 
Harper  equation.  An  interesting  feature  of  the  phase  diagram  is  that  the  C  phase 
exists  in  a  finite  region  of  parameter  values.  Unlike  the  scaling  at  the  critical  point 
of  the  Harper  equation  given  by  limit  cycles  of  the  RG  equation,  the  scaling  in  this 
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Figure  4.  A  two-dimensional  projection  of  the  attractor  of  the  renormalization  flow 
in  the  strong  coupling  limit  A  — >•  oo  with  a  =  fi  >  A.  For  about  4000  different 
parameter  values,  decimation  equations  were  iterated  35  times  and  the  first  10  iterates 
were  ignored  as  transients. 


C  phase  was  found  to  be  characterized  by  an  infinite  strange  set  of  the  RG  flow 
[7].  Therefore,  the  fractal  wave  functions  do  not  exhibit  self-similarity  and  are 
described  by  an  ergodic  set  which  brings  out  the  order  underlying  the  chaotic  RG 
trajectories. 

It  turns  out  that  the  qualitative  behavior  of  the  RG  trajectories  describing  the 
fluctuations  in  the  L  phase  are  similar  to  that  of  the  C  phase  discussed  above.  The 
universality  class  for  the  fluctuations  in  the  L  phase  turns  out  to  be  unaltered  as  long  as 
a  <  1  [8].  Above  the  threshold  a  =  1  (above  the  line  CL),  the  situation  is  again  much 
more  complicated.  Almost  all  parameter  values  in  this  region  are  described  by  the  same 
infinite  ergodic  set  of  scaling  ratios.  In  analogy  with  the  C  phase  discussed  above,  the 
ergodic  set  of  scalings,  when  projected  on  two  dimensions,  exhibit  an  intriguing  order 
manifested  in  the  form  of  an  orchid-flower  (see  figure  4). 

Another  intriguing  feature  of  the  C  and  L  phase  is  that  there  appears  to  be  a  special  set 
of  parameter  values  a  =  1/|  cos(27rr)|  with  rational  r  where  the  RG  trajectories  in  both 
the  L  and  C  phase  converge  on  a  limit  cycle  of  a  high  period  (for  a  fixed  a,  the  same  limit 
cycle  is  observed  for  all  A  in  the  L  phase  whereas  in  the  C  phase  also  A  has  to  be  chosen 
carefully  to  obtain  a  finite  cycle).  We  conjecture  that  these  unstable  fixed  points  are  dense 
on  the  strange  attractor  of  the  RG  flow. 

Detailed  numerical  studies  of  RG  flow  show  that  the  lines  in  the  phase  diagram 
describing  the  boundaries  between  the  E,  C  and  L  phases  exhibit  fractal  wave  functions 
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which  differ  quantitatively  from  the  wave  functions  in  the  C  phase  discussed  above.  The 
lines  AC  (E-C  transition)  and  CE  (C-L  transition)  appear  to  correspond  to  their  own  RG 
strange  sets  [7].  The  attractor  associated  with  the  line  CE  seems  to  envelope  the  strange 
set  of  the  C  phase  enclosed  by  AC  and  CE. 

Case  II.  tab  i^tal-  fr  tms  case' me  TBM  is  complex  and  has  not  been  fully  investigated 
previously.  Using  the  duality  property  of  the  model,  Han  et  al  [15]  calculated  the 
Lyapunov  exponent  of  the  model  analytically  in  the  space  of  the  three  parameters 
A  =  tb/ta,  a  —  2tab/ta,  and  /3  =  2tai/ta.  They  concluded  that  the  system  is  localized  for 
A  >  1  if  (a  +  /3)/2  <  1  and  for  A  >  (a  +  (3}/2  otherwise.  Apart  from  the  existence  of 
the  metal-insulator  transition,  nothing  was  known  about  the  scaling  properties  of  the 
complex  model.  In  particular,  one  of  the  important  questions  is  how  the  inequality  in  the 
NNN  couplings  affects  the  scaling  properties  in  the  C  and  L  phases  described  by  the 
strange  invariant  sets. 

With  the  decimation  methodology,  we  found  that  the  phase  diagram  changed 
discontinuosly  as  a  —  ft  became  different  from  zero.  As  soon  as  a  and  J3  differ,  a.  —  ft 
was  found  to  be  an  irrelevant  parameter  and  the  phase  diagram  is  determined  solely  by 
two  parameters:  A  and  (a  4-  j3)/2. 

In  the  parameter  range  (a  +  ft}/2  <  1,  the  decimation  functions  become  asymptoti- 
cally real  approaching  the  same  universal  cycles  as  for  the  Harper  equation  a  =  ft  =  0. 
For  a  4-  /3  >  1,  a  ^  ft,  the  decimation  functions  stay  complex  also  asymptotically. 
Here  the  C  phase  of  the  case  a  =  ft  >  1  is  replaced  by  a  reentrant  E  phase.  Moreover, 
the  E-L  transition  belongs  to  the  universality  class  of  the  corresponding  transition 
in  the  Harper  equation  upto  a  complex  phase  factor  in  £.  The  invariant  strange  sets  of 
the  renormalization  describing  the  fluctuations  in  the  L  phase  for  the  limit  a  =  ft 
degenerates  into  the  strong  coupling  Harper  fixed  point  as  soon  as  a  and  ft  become 
different  [16]. 

On  the  line  (a  +  ft}/ 2  =  1,  the  model  can  be  explicitly  shown  to  be  related  to  the  TBM 
for  the  triangular  lattice,  upto  a  complex  phase  factor  [16].  The  TBM  describing  Bloch 
electrons  on  a  triangular  lattice  is  [17] 

cos [7^0-+  (0  +  ^)]^+!  +cos[7r^cr-{-  (0-0)]^-i 

+  A  cos[27r(icr  +  (j))]^i  =  E^i.  (11) 

Our  detailed  RG  analysis  shows  that  unlike  the  E-L  transition  of  the  TBM  describing  the 
square  lattice,  the  TBM  describing  the  triangular  lattice  exhibits  the  C-L  transition:  i.e., 
the  line  AC  remains  critical  (described  by  a  unique  fixed  point  except  at  the  point  C 
which  is  characterized  by  its  own  fixed  point)  while  the  wave  functions  along  the  line  CL 
are  localized.  The  fractal  characteristics  on  the  line  AC,  point  C  and  the  fluctuations  on 
the  line  CL  define  new  universality  classes  (of  the  triangular  lattice)  which  are 
quantitatively  different  from  those  of  the  square  lattice,  namely  the  Harper  universality 
^  classes.  The  reentrant  E  phase  can  be  understood  in  the  anisotropic  limit,  a  — »  oo, 

0/a  -»  0,  and  \/a  -+  0. 

The  resulting  TBM  at  a  site  k  is  related  to  the  weak  coupling  limit  of  the  Harper  model 
by  a  complex  phase  factor  exp(z27r</>&)  exp(/?rcr^2)  [16]. 
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We  next  describe  a  related  problem,  namely  the  XY  quantum  spin  chain,  described  by  a 
vector  TBM  which  can  be  viewed  as  a  perturbed  Harper  equation 

J(y,x)Vi+}  +  /(*,>)*,_!  +2h&i  =  £*,-,  (12) 

where  h{  -  A  cos  (27r(i<r  +  </>)),  J(x,y)  and  E  are  2x2  matrices  and  $  is  a  two- 
dimensional  vector  describing  the  coupled  system 


/^--  1  +  Zhfa  =  Erjh  (13) 

Jxrji+  1  +  Tyfy--  1  +  ZhiVi  =  Eifri  .  (14) 


The  above  model  is  the  fermion  representation  of  the  quantum  XY  spin  chain  in  a 
transverse  field  with  the  Hamiltonian  [18] 


Here  5  are  the  Pauli  matrices  describing  the  spin  1/2.  Jx  and  Jy  describe  ferromagnetic 
exchange  interactions.  The  model  can  also  be  written  as 

(Hh  -  2gHg)(Hh  +  2gHg}$  =  E2*,  (16) 

where  Hh  denotes  the  Harper  Hamiltonian  and  Hg  denotes  the  part  of  the  Hamiltonian 
which  exists  only  at  a  non-zero  value  of  the  anisotropy  g  defined  as  Jx  =  1  -  g  and 
Jy-l  +  g.  We  see  that  the  anisotropy  perturbs  the  "squared"  Harper  model. 

Figure  5  shows  the  phase  diagram  in  the  A  -  g  plane  [19,  20].  Whenever  A  and  one  of 
the  exchange  interactions  become  equal  either  the  E-C  or  C-L  transition  takes  place.  The 
E-C  transition  corresponds  always  to  a  smaller  absolute  value  of  A,  i.e.  to  the  exchange 
interaction  whose  absolute  value  is  smaller.  Whenever  Jx  and  Jy  differ  a  fat  C  phase  is 
observed  in  the  phase  diagram.  Therefore,  the  fattening  of  the  critical  point  is  due  to  the 
breaking  of  the  17(1)  symmetry  in  the  fermion  Hamiltonian  which  is  a  consequence  of  the 
broken  0(2)  symmetry  in  spin  space. 

An  interesting  consequence  of  the  spectral  and  magnetic  interplay  is  the  fact  that  the 
onset  of  the  C-L  transition  is  coincident  with  the  onset  of  the  magnetic  transition  to  the 
long  range  order  (LRO)  where  two-point  long  range  spin-spin  correlations  vanish. 
However,  the  significance  of  the  E-C  transition  on  the  magnetic  properties  of  the  quantum 
chain  remains  a  mystery. 

The  C  phase  of  the  model  is  found  to  be  described  by  three  different  universality 
classes  [6].  The  E-C  transition  is  characterized  by  a  different  universality  class  from  that 
of  the  C-L  transition  line.  The  critical  phase  sandwiched  between  the  E-C  and  C-L 
transitions  is  conjectured  to  form  its  own  universality  class  although  the  existence  of  a 
RG  cycle  has  been  confirmed  only  in  the  Ising  limit  7^  =  0  (see  the  next  section).  All 
these  three  universality  classes  are  different  from  that  of  the  Harper  critical  point. 

The  fluctuations  in  the  L  phase  are  believed  to  be  universally  characterized  by  a  strong 
coupling  fixed  point.  This  was  confirmed  in  the  Ising  limit  discussed  in  the  next  section. 
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Figure  5.  Phase  diagram  for  the  quasi-particle  excitations  of  the  quantum  spin  chain 
with  Jx  =  1  —  g  and  Jy  —  1  +  g.  For  g  =  ±1,  the  model  reduces  to  Ising  model  with  C 
and  L  phases.  The  dark  boundary  line  separating  C  and  L  phases  is  the  conformal  line 
describing  the  onset  to  magnetic  long  range  order. 


5.2  Relation  to  the  block  electrons 

It  is  rather  interesting  that  the  Bloch  electron  problem  and  the  spin  problem  are  related  in 
certain  limits.  The  TBM  for  the  triangular  lattice  (see  (11))  resembles  the  TBM  for  the 
Ising  limit  (Jy  =  0)  of  the  quantum  spin  chain  discussed  in  the  previous  section  [19,  20], 
Both  are  modeled  by  a  NN  TBM  where  the  diagonal  as  well  as  the  off-diagonal  terms  are 
modulating.  The  periodicity  of  the  diagonal  term  is  twice  the  periodicity  of  the  off- 
diagonal  term.  The  only  difference  between  these  two  models  is  in  the  relative  phase 
differences  between  the  diagonal  and  off-diagonal  terms.  As.  a  consequence,  the  wave 
functions  at  the  band  edges  have  exact  symmetry  in  the  triangular  model  but  only 
asymptotic  symmetry  in  the  quantum  Ising  model  (QM). 

The  phase  diagram  is  the  same  for  both  systems  with  C  phase  for  A  <  1  and  the  L 
phase  for  A  >  1  (see  figure  5  at  g  =  —  1).  There  are  three  RG  limit  cycles  which 
determine  the  asymptotic  scaling  of  the  wave  functions  at  the  upper  or  lower  band  edges 
in  both  systems  for  A  <  1,  A  >  1,  and  A  =  1,  respectively.  However,  an  exception  to  this 
rule  is  formed  by  the  lower  band  edge  of  the  QDVI  at  A  =  1  which  is  described  by  its  own 
RG  fixed  point.  The  corresponding  state  is  believed  to  be  conformally  invariant  mapping 
to  none  of  the  quantum  states  of  the  triangular  model  [5,  6]. 

The  fact  that  RG  analysis  singles  out. the  conformal  state  in  distinguishing  the  two 
models  that  bear  striking  mathematical  resemblance  (inspite  of  some  important  physical 
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oi  the  physics  underlying  the  QP  models  discussed  here. 

Another  interesting  feature  is  that  the  conformally  invariant  state  of  the  Ising  model 
also  describes  the  fluctuations  in  the  exponentially  localized  strong  coupling  limit  of  the 
Harper  equation  for  E  —  0  [8].  In  other  words,  the  self-similar  fluctuations  at  the  band 
center  of  the  supercritical  Harper  model  describe  the  self-similarity  of  the  quantum  state 
responsible  for  the  long  range  order  in  Ising  spin.  This  fact  can  be  understood  by  noticing 
that  at  the  conformal  point,  the  TBM  described  by  (13)  resembles  equation  (8)  describing 
the  fluctuations  in  the  localized  phase  of  the  Harper  equation.  Therefore,  the  excitations 
in  the  spin  model  are  related  to  the  fluctuations  in  the  localized  states  of  the  Harper 
equation  and  the  conformal  universality  class  of  the  spin  model  is  related  to  the  strong 
coupling  universality  of  the  Harper  equation. 

6.  Conclusions 

Our  study  of  a  variety  of  systems  with  a  common  feature  being  the  competing  length 
scales,  shows  the  existence  of  interesting  phase  diagrams  exhibiting  E-L,  E-C  and  C-L 
transitions.  Furthermore,  another  intriguing  possibility  is  the  reentrant  E  phase.  This 
richness  in  the  phase  diagram  manifests  itself  in  interesting  variations  in  the  transport 
properties  as  the  parameters  of  the  system  are  varied.  We  believe  that  our  results  can  be 
experimentally  realized  on  two-dimensional  mesoscopic  systems.  The  results  on  the 
quantum  spin  chain  may  be  relevant  in  magnetic  superlattices. 

Wave  functions  in  both  the  critical  and  localized  phases  exhibit  fractal  character.  The 
universal  scaling  properties  in  these  phases  can  be  studied  with  very  high  precision  using 
a  decimation  method.  The  decimation  scheme  unveils  some  interesting  relationships 
between  seemingly  unrelated  problems  and  limits  the  number  of  universality  classes.  One 
of  our  important  results  is  that  a  wide  class  of  systems  can  be  shown  to  belong  to  the 
universality  class  of  the  NN  square  lattice  (Harper  universality)  or  the  triangular  lattice. 
In  addition  to  determining  the  phase  diagram  and  scaling  characteristics,  our  method 
provides  a  new  tool  to  determine  eigenvalues,  phase  boundaries,  and  Lyapunov 
exponents.  This  is  because  unknown  parameters  in  the  TBM  can  be  determined  self- 
consistently  with  the  RG  limit  cycle  upto  machine  precision.  We  refer  the  readers  to  our 
earlier  papers  for  various  detailed  examples. 

Another  important  result  of  our  studies  that  the  L  phase  exhibit  the  same  richness  as  the 
C  phase  with  fractal  characteristics,  has  to  be  viewed  from  a  more  general  perspective. 
The  localization  phenomenon  in  QP  systems  is  analogous  to  the  Anderson  localization  of 
the  random  systems.  Therefore,  our  findings  strengthen  the  likelyhood  of  fractal 
characteristics  in  other  random  or  aperiodic  systems  exhibiting  localization. 

Finally,  we  would  like  to  point  out  that  there  are  many  other  systems  in  both  classical 
and  quantum  chaos  [24]  which  can  be  modeled  by  [TBM].  One  interesting  example  is  the 
study  of  phonon  modes  in  the  classical  Frenkel-Kontorova  model  exhibiting  the  pinning- 
depinning  transition  [22,23].  Our  very  recent  studies  have  shown  the  existence  of  a  new 
type  of  phonon  modes  with  eigenstates  represented  by  an  infinite  series  of  step  functions 
[24].  Another  important  problem  that  is  under  investigation  is  the  quantum  kicked  rotor 
which  has  been  mapped  to  a  TBM  [21].  Thus,  we  believe  that  there  are  many  more 
interesting  results  to  be  discovered  using  the  RG  methods. 
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Abstract.  Finite  clusters  of  atoms  or  molecules,  typically  composed  of  about  50  particles  (and 
often  as  few  as  13  or  even  less)  have  proved  to  be  useful  prototypes  of  systems  undergoing  phase 
transitions.  Analogues  of  the  solid-liquid  melting  transition,  surface  melting,  structural  phase 
transitions  and  the  glass  transition  have  been  observed  in  cluster  systems.  The  methods  of  nonlinear 
dynamics  can  be  applied  to  systems  of  this  size,  and  these  have  helped  elucidate  the  nature  of  the 
microscopic  dynamics,  which,  as  a  function  of  internal  energy  (or  'temperature')  can  be  in  a 
solidlike,  liquidlike,  or  even  gaseous  state.  The  Lyapunov  exponents  show  a  characteristic 
behaviour  as  a  function  of  energy,  and  provide  a  reliable  signature  of  the  solid-liquid  melting  phase 
transition.  The  behaviour  of  such  indices  at  other  phase  transitions  has  only  partially  been  explored. 
These  and  related  applications  are  reviewed  in  the  present  article. 

Keywords.    Clusters;  chaos;  phase  transitions;  Lyapunov  exponents. 
PACS  Nos    5.45;  64.70;  36.40 


1.  Introduction 

One  major  impetus  for  research  in  the  dynamics  of  nonintegrable  systems  has  been  the 
hope  of  understanding  the  basis  for  the  effectiveness  of  statistical  mechanical  ideas  in  the 
description  of  systems  with  several  degrees  of  freedom  [1].  On  the  one  hand,  while  the 
ergodic  hypothesis  is  presumed  to  be  valid  for  generic  many-body  Hamiltonians,  it  is 
difficult  to  prove  for  any  particular  system  since  a  realistic  dynamical  picture  of  a  multi- 
particle  system  is  not  easy  to  obtain  owing  to  the  extremely  large  number  of  degrees  of 
freedom.  On  the  other  hand,  the  application  of  ideas  of  statistical  mechanics  to  few-body 
systems  is  fraught  with  difficulty. 

A  regime  where  some  progress  can  be  made  has  been  provided  in  recent  years  by 
cluster  systems,  namely  aggregates  of  atoms  or  molecules  consisting  of  typically  between 
10  and  1000  particles  [2].  These  systems  are  small  enough  that  the  explicit  dynamics  can 
be  followed  for  reasonably  long  times.  They  are  also  large  enough  for  it  to  be  possible  to 
characterise  them  as  having  a  well-defined  temperature  or  being  in  a  particular  state  of 
matter.  Indeed,  in  recent  years,  a  major  interest  in  cluster  studies  is  the  elucidation  of 
simple  phase  transformations  [3,4].  The  study  of  the  dynamics  at  these  phase 
transformations  is  the  subject  of  this  article. 

Clusters,  being  intermediate  in  size  between  molecules  and  bulk  matter,  have  physical 
and  chemical  properties  which  can  sometimes  be  very  different  from  the  properties  of  a 
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classical  molecular  dynamics  (CMD)  techniques.  The  connection  between  the 
microscopic  dynamics  and  thermodynamics  or  statistical  mechanics  can  be  explored  in 
some  detail  by  studying  clusters  of  increasing  size  or  varying  other  parameters  of  the 
system  such  as  particle  mass  or  the  interaction  potential.  These  studies  also  play  an 
important  role  in  clarifying  the  spread  of  ergodic  behavior  in  many-body  systems. 

In  the  context  of  nonlinear  studies,  finite  clusters  are  important  as  they  provide  useful 
examples  of  micro-  and  mesoscopic  systems  where  the  methods  of  nonlinear  dynamics 
(which  were  essentially  developed  with  low-dimensional  systems  in  mind)   can  find 
application.  For  a  N-particle  cluster,  the  number  of  freedoms  is  3N,  which  is  quite  large 
for  even  the  smallest  (N  «  7)  systems  studied  in  this  context.  Given  the  highly  nonlinear 
nature  of  the  interparticle  interaction,  the  motion  is  unlikely  to  be  globally  regular  at 
finite  temperatures.  Thus,  chaotic  dynamics  may  provide  the  basic  mechanism  for 
inducing  statistical  behavior  in  cluster  systems  by  facilitating  the  rapid  mixing  among 
modes  and  consequent  redistribution  of  vibrational   energy.   In  general,   stochastic 
trajectories  are  presumed  to  be  the  main  cause  for  giving  rise  to  statistical  behavior,  when 
the  ergodic  hypothesis,  namely  that  the  time  averages  are  identical  to  the  phase  space 
averages,  is  expected  to  hold.  This  statistical  description  is  manifest  in  the  measured 
values  and  the  distributions  of  different  quantities  [9-11]. 

With  regard  to  simple  phase  transitions,  it  is  now  well-established  that   although 
the  numbers  of  atoms  or  molecules  present  is  small  compared  to  bulk  material  (~  1023), 
clusters  can  exhibit  structural  isomerization  [12-15]  and  related  phenomena.   Small 
clusters  go  from  a  solid-like  phase  to  a  liquid-like  phase  as  the  temperature  or  total 
internal  energy  is  increased  [16].  In  bulk  matter,  the  temperature  at  which  the  solid 
melts,  Tm,  is  same  as  the  temperature  at  which  the  corresponding  liquid    freezes, 
Tf.  However,  Tm  and  Tf  can  be  different  in  finite  clusters  [17],  when  there  can  be 
dynamical  coexistence  [18,  19]  of  both  of  these  phases  in  the  intermediate  temperature 
regime  (between  Tf  and  rm)  when  clusters  start  melting.  This  phenomenon  is  distinct 
from  the  phase  change  in  bulk  matter,  and  has  been  experimentally  verified;  extensive 
molecular  dynamics  and  Monte  Carlo  simulations   [16]   also   support  this    picture. 
Cluster  systems  can  show  yet  another  form  of  coexistence  in  surface  melting  where 
atoms  in  the  cluster  core  are  at  a  lower  effective  temperature  and  are  in  a  solid  state,  while 
atoms  on  the  surface  are  in  a  liquid-like  state  and  show  much  greater  diffusivity  [13,  20]. 
At  temperatures  below  the  melting  transitions,  there  can  also  be  structural   phase 
transitions  [21].  In  addition,  the  so-called  glass  transition  can  also  be  mimicked  by  cluster 
systems  [22]. 

The  methods  and  techniques  of  nonlinear  dynamics  of  relevance  to  the  study  of  atomic 
clusters  are  reviewed  in  this  article.  These  include  the  computation  of  Lyapunov 
exponents,  in  particular  the  largest  (maximal)  one  which  has  been  found  to  be  a  reliable 
phase-space  indicator  of  cluster  phase  change,  the  Kolmogorov  (K-)  entropy,  short  time 
distributions  of  local  K-entropy  and  Lyapunov  exponents  and  Lyapunov  spectra.  These 
prove  invaluable  m  understanding  phase  transitions  since  there  are  few  available  reliable 
measures  for  exploring  the  process  of  phase  change  in  finite  systems. 
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In  the  next  section  we  briefly  describe  the  classical  methodology  of  cluster  simulations, 
and  define  the  Lyapunov  exponents  and  the  K-entropy.  We  then  discuss  the  connection 
between  phase  transformations  (solid-to-liquid)  and  the  maximal  Lyapunov  exponent 
(MLE).  This  is  followed  by  a  review  of  some  of  the  other  techniques  that  have  been 
applied  in  cluster  dynamical  studies.  We  conclude  in  §  5  with  a  summary. 

2.  Cluster  dynamics,  Lyapunov  exponents  and  the  Koimogorov  entropy 

The  simulation  of  clusters  is  usually  performed  through  Monte  Carlo  (MC)  methods,  or 
through  molecular  dynamics  (MD)  [23],  namely  the  integration  of  the  relevant 
Hamilton's  or  Newton's  equations  of  motion.  MD  simulations  can  be  carried  out  either 
at  constant  energy,  when  the  microcanonical  ensemble  obtains,  or  at  constant 
temperature,  with  the  use  of  thermostats,  in  which  case  the  canonical  ensemble  results. 
For  dynamical  studies  the  most  relevant  are  the  MD  methods  although  some  progress 
is  being  made  in  the  extraction  of  dynamical  information  from  equilibrium  MC 
simulations  [24]. 

Details  of  the  molecular  dynamics  simulations  of  )V-particle  systems  have  been 
extensively  documented  [23],  and  the  computational  methodology  of  integrating 
Newton's  equation  of  motion  has  been  developed  in  considerable  detail.  Since  a  cluster 
is  an  isolated  group  of  particles,  the  boundary  conditions  are  simpler  to  incorporate  than 
in  bulk  or  liquid-state  simulations.  The  interaction  between  particles  is  normally  taken  to 
be  composed  of  pair-potentials,  so  that  the  total  potential  energy  V  for  a  configuration 
R=  n,  r2,...,rN  is 


where  r,-  is  the  position  of  the  i  th  particle  and  ry  =  |  r/  —  ry-  is  the  distance  between 
particles  i  and  j.  The  pair-potential  V(r)  is  usually  taken  to  be  of  a  simple  analytic 
form,  such  as  the  Lennard-Jones  (for  rare-gas  systems),  Yukawa  (for  nuclear  systems) 
or  Gupta  (for  metallic  clusters)  [25-28].  The  total  potential  energy,  V,  is  an  extremely 
complicated  function  of  the  configuration  R.  Finding  the  ground  state,  namely  the 
global  minimum,  is  a  tricky  problem,  although  for  small  enough  clusters,  this  and 
the  lowest  few  local  minima  (corresponding  to  isomers)  can  be  located  by  several 
techniques. 
The  dynamics  follows  Newton's  equations,  which  are  (m,-  is  the  mass  of  the  zth  particle) 

dV 
mfr(-  =  -—     /=1,2,...,  AT  (2) 

The  total  energy,  namely  the  Hamiltonian  7i  =  T  +  V,  where  T  =  J^m/r,-2,  is 
conserved,  as  are  all  the  components  of  the  total  momentum  and  the  angular  momentum, 
giving  a  total  of  7  constants  of  the  motion.  At  any  given  total  energy,  the  'temperature'  of 
the  cluster  is  defined  through  the  average  kinetic  energy  as 


(3N-6)kb 
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£b  being  the  Boltzmann  constant  and  {  }  denoting  an  average  over  the  entire  trajectory. 
The  spectrum  of  Lyapunov  exponents  of  a  given  dynamical  system  characterizes  the 
dynamical  instability  of  the  orbits.  An  W-freedom  hamiltonian  system  has  2N  Lyapunov 
exponents,  Ai,...,A2w  which  are  defined  by  the  eigenvalue  spectrum  of  the  squared 
Jacobi  matrix  [29] 

a(p(Q.r(0) 
0(p(0),r(0))' 
(Pi  =  mri     and    p  =  pr--p^)  (4) 

.in  the  asymptotic  limit  i.e., 

je2Air, . . . , e2Awr}  =  eigenvalue  spectrum  of  JT(T}J(T]  as  r  ->  oo  (5) 

Therefore,  for  a  //-particle  system  in  three  dimensions  there  are  6N  Lyapunov  exponents, 
which  come  in  matching  positive-negative  pairs  as  a  consequence  of  Liouville's  theorem 
[30].  They  are  independent  of  initial  condition  (r(0),  p(0))  for  ergodic  Harniltonians,  and 
further,  if  there  are  L  independent  constants  of  motion  then  2L  of  the  exponents  vanish. 
Hence  for  a  cluster  system  of  N  particles  there  will  be  37V  -  7  positive  Lyapunov 
exponents.  From  Pesin's  theorem  [31]  the  sum  of  these  equals  the  Kolmogorov  entropy, 
which  measures  the  rate  of  loss  of  information  in  a  chaotic  system.  Standard  techniques  to 
compute  the  entire  spectrum  of  Lyapunov  exponents  have  been  devised,  and  these  can  be 
further  simplified  to  get  only  the  largest  one,  the  MLE  [32,  33].  If  the  distance  between 
two  trajectories  at  time  t  =  0  is  d(0),  and  the  distance  at  time  t  is  d(f),  then 

X,  =  lim   lim  -  log  M  (6) 

'->ood(o)-+of        d(0) 

The  actual  computation  of  the  MLE  uses  the  above  definition,  but  calculates  the  rate  of 
increase  of  distance  in  tangent  space  [32]. 


3.  Phase  transitions  and  Lyapunov  exponents 

The  behaviour  of  the  MLE  at  a  phase  transition  was  first  studied  by  Butera  and  Caravati 
[34]  who  simulated  the  planar  O(2)  Heisenberg  model  of  interacting  planar  rotors  on  a 
lattice  in  2d  as  a  function  of  temperature.  They  noted  that  the  MLE  had  a  distinct  bend  or 
'knee'  at  the  temperature  of  the  Kosterlitz-Thouless  transition. 

In  subsequent  studies,  Posch  and  Hoover  [35]  studied  Lyapunov  spectra  for 
monoatomic  system  in  two  and  three  spatial  dimensions,  both  solid  and  liquid  states 
using  a  repulisive  potential.  Their  results  show  that  the  partial  spectrum  of  positive 
Lyapunov  exponents,  An,  are  well  approximated  by  a  power  law  Xn  =  an&  with  J3  «  1  for 
solid-like  states  and  0  w  1  for  the  liquidlike  states.  Hinde  et  al  [36]  examined  the  entire 
Lyapunov  exponent  spectra  in  7  atom  rare-gas  cluster  and  observed  the  linear  dependence 
of  An  with  distinct  slopes  in  solidlike  and  liquidlike  states. 

The  idea  that  the  MLE  could  be  an  indicator  of  phase  changes  was  further  investigated 
by  Nayak  et  al  [37],  who  examined  the  melting  of  rare-gas  clusters  by  using  classical 
molecular  dynamics  techniques.  Simulations  of  atomic  clusters,  using  the  Lennard-Jones 
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pair  potential  for  cluster  sizes  of  7,  13  and  55  showed  that  the  MLE  has  a  characteristic 
dependence  at  the  temperature  of  the  solid-liquid  melting  transition. 

Studies  of  finite  clusters  have  explored  this  transition  for  several  years,  and  the  usual 
manner  in  which  melting  is  detected  is  by  computing  the  Lindemann  index,  namely  the 
average  rms  fluctuation  of  the  atoms  about  their  equilibrium  positions, 


i  indexing  the  various  atoms,  /-,-;  the  interparticle  distances  and  (  }  implying  an  average 
over  a  MD  trajectory.  The  Lindemann  criterion  [38,  39]  for  bulk  melting  is  that  S  is  larger 
than  0.1,  and  has  its  origin  in  empirical  observations  on  the  characteristic  distances 
between  atoms,  as  estimated  from  the  amplitude  of  vibrations,  which  is  greater  in  the 
liquid  state  as  compared  to  the  solid.  This  criterion  is  used  even  for  the  study  of  small 
systems  and  is  known  to  correlate  well  with  other  indicators  of  melting  such  as 
diffusivity. 

A  striking  result  of  the  dynamical  studies  [37]  was  that  the  change  in  the  largest 
Lyapunov  exponent  occurs  at  the  same  total  energy  when  8^  0.1,  namely,  when  the 
Lindemann  criterion  for  melting  was  satisfied.  At  melting,  the  cluster  goes  from  fairly 
rigid  to  nonrigid  configurations,  and  is  hence  able  to  access  a  larger  volume  of  the  phase 
space.  As  a  result,  larger  nonlinearities  become  important  and  the  motion  becomes 
globally  chaotic.  Thus  the  MLE,  which  is  either  zero  (or  small)  for  energies  below  the 
transition  temperature,  increases  significantly  at  the  transition  (figure  1).  For  clusters  such 
as  Arn,  the  change  in  the  MLE  at  the  melting  transition  is  large,  whereas  in  smaller 
clusters  such  as  Ar7,  there  is  only  a  discontinuity  in  the  slope  as  the  energy  increases.  In 
even  smaller  clusters  such  as  Ar3,  the  MLE  increases  uniformly  with  temperature,  and 
there  is  no  explicit  signature  of  a  phase  transition  (Ar7  is  the  smallest  cluster  that  can  be 
considered  to  show  any  evidence  of  a  change  in  "phase"  with  temperature  [40]). 
Comparison  with  previously  published  results  [18,  41,  42]  for  various  cluster  sizes  shows 
that  the  behaviour  of  the  MLE  with  energy  or  temperature  closely  parallels  the  behaviour 
of  the  Lindemann  index. 

The  magnitude  of  the  MLE  may  be  correlated  with  the  volume  of  phase  space  available 
to  the  system,  which  also  increases  significantly  at  a  phase  transition.  Such  increase  of 
phase-space  volume  apparently  accelerates  the  divergence  of  nearby  trajectories,  and  this 
appears  to  be  a  generic  feature  of  nonlinear  systems.  Since  the  sum  of  positive  Lyapunov 
exponents  is  the  Kolmogorov  entropy,  it  is  not  surprising  that  the  MLE  should  reflect  the 
large  increase  in  entropy  at  a  phase  transition.  This  connection  has  been  explored  to  some 
extent-see  e.g.  rets.  [34,  40,  43].  One  also  therefore  expects  that  the  energy  density  of 
states  should  increase  sharply  at  approximately  the  energy  at  which  the  isoergic  MD 
simulations  show  a  sharp  rise  in  the  Lyapunov  exponent.  This  has  indeed  been  noted 
earlier  [44],  as  an  increase  in  the  configurational  density  of  states  in  the  cluster  solid- 
liquid  coexistence  regime,  which  in  turn  is  reflected  in  a  bimodal  character  of  PE 
distributions  obtained  from  canonical  Monte  Carlo  simulations  for  the  phase  coexistence 
temperatures.  (The  density  of  states  can  be  constructed  from  constant  temperature 
canonical  ensemble  simulations  using  multiple  histogram  method;  see  [37]  for  details.) 
Although  the  discussion  above  is  in  terms  of  the  internal  energy  of  the  clusters,  similar 
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Figure  1.  Variation  of  the  largest  Lyapunov  exponent,  AI  ,  with  internal  energy  (units 
are  e  per  atom)  for  Lennard-Jones  clusters  of  various  sizes.  The  dependence  of  MLE 
on  the  temperature  also  shows  similar  behavior. 
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Figure  2.  (a)  Variation  of  Lindemann  index  with  energy  per  particle  (in  reduced 
units)  for  Gupta?,  clusters  for  N  =  6,7,13.  (b)  Variation  of  maximal  Lyapunov 
exponent  with  energy  per  particle  (in  reduced  units)  for  Gupta^  clusters  for 
#  =  6,7,13. 


observations  and  conclusions  hold  if  one  considers  the  behaviour  of  the  MLE  as  a 
function  of  the  average  kinetic  energy  or  temperature. 

At  the  melting  transition,  both  the  Lindemann  index  and  the  Lyapunov  exponent  give 
identical  information  about  the  change  in  phase  of  the  cluster.  At  other  transitions,  when  a 
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transition'  is  not  as  well  defined,  it  is  often  difficult  to  find  a  reliable  indicator  of 
change.  Take,  for  example,  the  putative  glass-like  transition  that  has  been  seen  in 
lusters,  an  example  of  which  is  provided  by  Arjg  [22].  The  global  behavior  of  the 
as  been  examined  [45]  as  a  function  of  average  kinetic  energy  or  temperature  for 
tarting  from  different  configurations  corresponding  to  the  global  minimum,  local 
i  and  saddles  in  the  potential  energy  hypersurface.  The  radial  function  which  is 
>nly  used  in  bulk  studies  proves  unreliable  in  detecting  phase  change,  while  the 
:an  be  shown  to  characterise  the  instability  in  a  precise  manner,  both  in  bulk 
ils  as  well  as  in  finite  clusters  [45]. 

generality  (and  perhaps  universality)  of  these  observations  needs  to  be  investigated 
il.  Mehra  and  Ramaswamy  [46]  have  studied  melting  in  transition-metal  clusters 
tie  Gupta  potential.  In  these  systems  there  is  no  regime  of  solid-liquid  coexistence, 
re  can  be  a  fluctuating  state  where  the  cluster  continuously  moves  between  two 
res  with  zero  diffusion  coefficient.  Simulations  of  6,  7,  and  1 3  atom  clusters  have 
parallel  behavior:  no  discontinuities  were  observed  in  the  MLE,  but  a  marked 
inuity  in  the  slope  near  Tm  was  noted  (figure  2). 

ther  recent  work  on  the  dynamics  at  phase  transitions,  Bonasera  et  al  [47]  have 
d  CMD  simulations  near  the  critical  point  for  the  liquid-gas  transition.  They  study 
m  of  100  nucleons  interacting  through  a  pairwise  Yukawa  potential,  as  well  as  a 
helium  droplet  containing  about  800  He4  atoms  interacting  via  the  Slater- 
)od  potential  and  claim  that  the  MLE  has  a  peak  at  Tc  and  can  be  fitted  near  the 
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Figure  3.  (a)  Probability  distribution  of  the  maximal  Lyapunov  exponent  for  Ara. 
The  two  different  symbols  correspond  to  two  different  initial  conditions.  The 
distributions  differ  markedly  and  is  a  signature  of  non-ergodicity  at  low  temperatures. 
See  the  text  for  details,  (b)  Probability  distribution  of  MLE  for  Ar3  for  two  different 
initial  conditions  at  E  =  -0.156  x  1CT13  erg/atom,  r  =  32.3K.  The  distributions 
collapse  on  each  other  indicating  that  the  system  has  attained  ergodicity. 

critical  temperature  with  a  power  law 


with  exponent  uj  =  0.15  (in  both  the  systems).  It  is  not  clear  if  this  observation  is  on  solid 
ground  since,  as  has  been  pointed  out  by  Nayak  et  al  [47],  the  numerical  fit  to  data  is 
rather  poor  and  the  error  bars  in  the  data  itself  is  unacceptably  large.  At  continuous  phase 
transitions  though,  the  MLE  shows  characteristic  and  distinctive  behaviour.  Apart  from 
the  study  of  Butera  and  Caravati,  there  is  recent  work  by  Yamaguchi  [10]  who  has  studied 
a  globally  coupled  Hamiltonian  lattice  which  undergoes  a  continuous  phase  transition  as 
energy  per  particle  is  increased.  There  is  apparently  a  peak  in  the  MLE  at  Tc  although  the 
model  is  integrable  both  in  zero-energy  and  high-energy  limits.  It  appears  that  at 
continuous  phase  transitions  the  Lyapunov  exponent  has  a  maximum  value,  while  for 
first-order  transitions  MLE  typically  displays  a  sigmoidal  curve.  Welding  these  disparate 
observations  into  a  coherent  whole  and  giving  a  satisfactory  explanation  of  the 
phenomena  is  currently  a  major  theoretical  challenge. 

The  general  features  of  the  behaviour  of  the  MLE  at  phase  transitions  can  be  verified 
by  studying  simpler  systems  where  the  same  behaviour  occurs.  Simple  dissipative 
dynamical  systems  show  so-called  'crises'  [48],  where  the  phase  space  volume  changes 
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discontinuously  with  the  change  in  a  system  parameter.  Crises  (which  also  occur  in 
conservative  systems  [49])  can  be  of  several  types.  For  attractor  widening  crises  [50] 
where  the  volume  increases  by  an  order  of  magnitude  or  more,  the  MLE  changes  abruptly 
at  the  crisis  point,  increasing  as  a  power  law.  For  attractor  merging  crises  when  the  phase 
space  volume  doubles,  the  variation  of  the  MLE  with  parameter  is  "knee"-like,  i.e.  there 
is  a  change  in  the  slope  of  the  MLE.  Interestingly,  both  these  behaviours  have  been  seen 
in  the  case  of  phase  change  in  finite  system  (c.f.  the  cases  of  Arn  and  Ar7),  as  well  as  in 
phase  transitions  in  other  systems. 

Calculations  of  entire  spectrum  of  Lyapunov  exponents  have  also  been  made  [36, 
46,  51].  For  the  study  of  complex  systems  this  is  indeed  important,  but  calculating  them 
through  the  standard  methods  such  as  the  tangent  space  technique  can  be  computationally 
intensive.  New  methods  proposed  [52,  53]  can  circumvent  this  bottleneck.  Lyapunov 
spectra  characterize  the  short-time  behaviour  of  correlation  functions,  and  are  related  to 
the  entire  set  of  relaxation  times  in  the  system  [54]  although  this  has  been  established 
only  for  simple  low-dimensional  systems.  For  high-dimensional  systems,  in  the  limit 
N  — *  oo,  the  existence  of  a  smooth  distribution  An,  n  =  i,...,N  has  been  numerically 
verified  [55].  In  the  highly  chaotic  regime  the  function  \n  is  linear  [56],  and  this  has  been 
verified  for  Ar  clusters  (see  figure  3(a)).  In  other  systems,  such  as  metallic  clusters  (using 
the  Gupta  potential)  the  results  differ:  the  Lyapunov  spectra  for  6,  7  and  13  particle  metal 
clusters  have  been  calculated,  and  Xn  is  found  to  be  a  nonlinear  function  of  n  [50] 
(figure  3(b)).  The  behaviour  of  Lyapunov  spectra  at  phase  transitions  needs  to  be 
investigated. 


4.  Local  K-entropy  and  local  Lyapunov  exponents 

Global  dynamical  indicators  are  useful,  but  they  are  also  somewhat  limited  in  that  they  do 
not  provide  details  about  local  dynamics — whether  motion  within  a  particular  region  is 
more  or  less  chaotic.  They  also  do  not  give  any  indication  of  the  time-dependence  of 
phenomena  such  as  the  spread  of  ergodicity.  This  information  is  provided  by  local 
indicators  of  the  dynamics,  namely  the  short  time  distributions  of  the  K-entropy  and  the 
Lyapunov  exponent.  Berry  and  coworkers,  in  a  series  of  papers  [11,  36,  57],  related  the 
local  K-entropy  information  to  the  topology  of  the  cluster  potential  energy  surface  (PES). 
The  local  K-entropy  is  finite-time  analogue  of  the  global  entropy,  and  is  obtained  by 
dividing  a  long  trajectory  into  several  parts  of  duration  N  A?  and  calculating  the  entropy 
for  each  time  segment  separately.  From  this  the  probability  distribution  of  entropies  is 
obtained,  which  can  then  be  compared  with  the  distribution  of  potential  energy  averaged 
over  same  duration  A/Af. 

A  direct  way  to  characterize  the  structure  of  a  PES  is  to  catalogue  its  various  minima 
and  saddles  through  steepest  gradient  and  quenching  techniques.  This  can  be  done  for 
small  clusters  but  as  the  number  of  minima  and  saddles  increases  exponentially  with  size, 
it  is  a  formidable  task  to  list  all  of  these.  The  geometric  features  of  a  saddle  can  be 
determined  by  calculating  the  average  curvature  from  the  eigenvalues  of  the 
instantaneous  Hessian  matrix,  namely  the  instantaneous  normal  mode  spectra  [24].  The 
analysis  of  clusters  in  saddle  regions  is  important  in  understanding  phenomena  such  as 
isomerization  and  solid-liquid  transition. 

Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  II) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences"  611 


0.2 


0.1 


v  T  =  40  K 
a  T  =  34  K 
o  T  =  24  K 


a  v 

pa  v 

Un  D  v 

h°o  a  a        v 

0  o  o  „  a  n 


"DoVv 
0  o  o  a  a 

o  o 


a  g 


J I        .1 I         I.       ,1 L 


10  20 

i 


30 


1.5    - 


1.0 


0.5 


(b) 


r=o.oo9 

T'=0.032 


10 


20 


30 


n 


Figure  4.  (a)  Lyapunov  spectra  for  Ari3  at  three  temperatures  (1)  24  K  (solid  phase), 
(2)  34  K  (coexistance  phase)  and  (3)  40  K  (liquid  phase),  (b)  Lyapunov  spectra  for 
GuptaJ3  at  reduced  temperatures  (1)  T*  =  0.009  (solid)  and  (2)  T*  =  0.032  (liquid). 
Only  the  32  positive  exponents  are  shown. 

In  Ar3  and  Ar4  clusters,  which  have  rather  flat  saddles,  the  local  K-entropy  was  found 
[11,  36,  57]  to  be  significantly  smaller  in  the  saddle  region  as  compared  to  other  regions 
of  the  PES.  This  was  attributed  to  an  increase  in  the  degree  of  quasiperiodicity  near  the 
saddle  arising  from  a  partial  decoupling  of  the  cluster's  vibrational  modes  (as  measured 
by  the  total  vibrational  coupling).  In  Ar5  on  the  other  hand,  the  local  K-entropy  in  saddle 
regions  was  comparable  to  that  in  the  well  while  for  Are  and  Ar7  saddle  entropy  was 
intermediate  between  entropies  of  the  two  wells  which  are  connected  by  that  saddle.  All 
these  clusters  have  relatively  steep  saddles,  which  suggests  that  the  'regularizing'  effect 
of  a  saddle  depends  on  the  shape  of  PES  near  the  saddle,  especially  the  flatness  of  the 


612 


Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  H) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


saddle  along  the  isomerization  or  reaction  path.  This  behaviour  is  size-dependent:  in 
larger  clusters  the  influence  of  a  particular  saddle  is  reduced  as  the  system  will  not  spend 
much  time  on  a  sharp  saddle. 

In  systems  where  the  topography  of  the  potential  energy  surface  is  very  complicated, 
ergodicity  cannot  be  guaranteed  at  all  energies  or  temperatures.  This  can  be  seen  by 
examining  the  distribution  of  finite-time  or  local  Lyapunov  exponents  which  are  defined 
in  the  same  manner  as  the  local  K-entropy,  by  dividing  a  long  trajectory  into  segments  of 
length  N6t.  Amitrano  and  Berry  [57, 58]  calculated  the  probability  distribution  of  the 
largest  local  Lyapunov  exponent  for  Ara  at  different  values  of  energy  for  various  time 
intervals,  and  related  these  distributions  to  the  evolution  of  ergodicity.  At  low  energies 
(T  =  18K)  in  the  argon  trimer,  the  shape  of  the  distribution  depends  on  the  initial 
conditions  for  any  time  interval:  the  system  does  not  exhibit  ergodicity  even  on  long  time 
scales.  At  intermediate  energies  (T  =  28  K)  the  system  is  found  to  be  ergodic:  different 
initial  conditions  gave  similar  unimodal  distributions  but  with  different  second  moments. 
At  energies  high  enough  to  allow  saddle  crossing  (T=31K)  the  distributions  are 
bimodal  up  to  a  particular  time  interval,  and  unimodal  for  longer  times,  showing  a  time 
scale  separation  between  the  intra-well  and  inter-well  motions.  At  still  higher  energy 
(T  =  37  K)  one  has  true  ergodic  behavior  with  unimodal  distributions  even  for  shortest 
trajectory  segments.  Figure  3(a)  is  the  probability  distribution  of  MLE  for  Ars  at 
E  =  -0.156  x  10~13  erg/atom,  T  =  7.41K  for  two  different  initial  conditions,  the 
trajectories  being  of  108  MD  time  steps.  The  distributions  shows  no  sign  of  convergence, 
indicating  the  non-ergodic  nature  of  the  system  at  that  temperature.  In  contrast,  the 
probability  distributions  of  MLE  at  a  higher  energy  E  =  —0.089  x  10~13  erg/atom, 
T  =  32.3  K  (see  figure  3(b))  for  two  different  initial  conditions  collapse  on  each  other 
indicating  the  system  has  attained  ergodic  behavior.  These  observations  at  the 
microscopic  level  are  significant  particularly  in  the  context  of  transition  from  non- 
ergodic  to  ergodic  behavior  in  small  finite  systems.  Similarly,  at  T  =  14  K  the 
autocorrelation  function  of  the  local  Lyapunov  function  is  oscillatory,  while  it  rapidly 
decays  to  zero  for  T  >  18K.  Qualitatively  similar  results  hold  for  Ar7.  A' general  dieory 
of  these  phenomena  has  not  yet  been  given,  and  it  is  not  a  simple  matter  to  extend  these 
calculations  to  larger  cluster  sizes. 


5.  Discussion  and  summary 

Finite  cluster  systems  may  provide  one  regime  where  the  methods  of  dynamical  systems 
and  statistical  mechanics  can  simultaneously  be  applied.  Given  the  highly  nonlinear 
character  of  the  typical  cluster  dynamical  system,  it  is  not  surprising  that  chaotic  motion 
is  both  so  widespread  and  that  it  plays  such  an  important  role.  For  small  clusters,  although 
the  number  of  freedoms  is  not  too  large,  it  is  possible  to  focus  on  particular  features  of 
interest  —  such  as  the  K-entropy  or  the  Lyapunov  exponents  —  and  explore  the  dynamics 
of  how  statistical  behaviour  such  as  ergodicity  develops  in  such  systems. 

There  are,  however,  dynamical  regimes  of  interest  when  the  motion  of  clusters 
becomes  simple.  For  example,  at  very  low  energies  above  the  global  minimum,  the 
cluster,  like  many  hamiltonian  systems,  can  be  expected  to  be  nearly  integrable.  In  a 
recent  study  of  Aris  clusters,  Salian  et  al  [59]  examined  the  dynamics  of  collective 
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excitations  by  exciting  the  monopole  mode,  namely  that  mode  or  oscillation  where  the 
particles  execute  radial  motion  only,  as  a  function  of  temperature.  Below  a  threshold, 
which  they  determined  to  be  T$  =  7.2  K,  this  mode  does  not  decay  for  long  times.  Above 
Ts,  the  time  dependence  of  the  amplitude  variation  could  be  accurately  modeled  by  a 
damped  harmonic  oscillator,  while  below  Ts,  the  monopole  mode  is  periodic  or 
quasiperiodic.  This  suggests  [59]  that  as  long  as  the  cluster  is  confined  to  the  global 
minimum  in  the  PES,  the  monopole  mode  does  not  decay.  When  the  temperature  is  high 
enough  for  the  system  to  come  out  of  this  well,  the  nearby  local  minima,  which 
correspond  to  rearrangement  of  one  or  a  few  particles  act  as  decay  channels,  akin  to 
single-particle  excitations  in  nuclei  [60].  The  MLE  in  this  temperature  range  has  also 
been  studied  [61]:  below  rs,  the  MLE  is  essentially  zero,  and  although  in  the  so-called 
'solid1  state,  the  MLE  becomes  positive  only  above  Ts.  Thus  the  survival  of  the  symmetric 
monopole  mode  is  intimately  connected  to  the  onset  of  globally  chaotic  motions  in  the 
dynamics. 

A  major  focus  of  this  article  has  been  on  the  appropriateness  of  using  the  Lyapunov 
exponents  as  indicators  of  phase  change  since  this  focuses  on  a  property  of  the  phase 
space  behaviour  of  the  system.  The  dynamics  of  the  solid-liquid  melting  transition,  the 
liquid-gas  transition,  and  the  glass  transition  in  finite  systems  have  been  studied  using  this 
quantity.  For  phase  transitions  where  order  parameters  based  on  other  criteria  can  be 
ambiguous  (as  for  example  the  glass  transition),  the  MLE  may  be  able  to  provide  the 
clearest  signature  of  the  transition. 
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Abstract.  In  this  review  we  present  the  salient  features  of  dynamical  chaos  in  classical  gauge 
theories  with  spatially  homogeneous  fields.  The  chaotic  behaviour  displayed  by  both  abelian  and 
non-abelian  gauge  theories  and  the  effect  of  the  Higgs  term  in  both  cases  are  discussed.  The  role 
of  the  Chern-Sirnons  term  in  these  theories  is  examined  in  detail.  Whereas,  in  the  abelian  case, 
the  pure  Chern-Simons-Higgs  system  is  integrable,  addition  of  the  Maxwell  term  renders  the 
system  chaotic.  In  contrast,  the  non-abelian  Chern-Simons-Higgs  system  is  chaotic  both  in  the 
presence  and  the  absence  of  the  Yang-Mills  term.  We  support  our  conclusions  with  numerical 
studies  on  plots  of  phase  trajectories  and  Lyapunov  exponents.  Analytical  tests  of  integrability 
such  as  the  Painleve  criterion  are  carried  out  for  these  theories.  The  role  of  the  various  terms  in  the 
Hamiltonians  for  the  abelian  Higgs,  Yang-Mills-Higgs  and  Yang-Mills-Chern-Sirnons-Higgs 
systems  with  spatially  homogeneous  fields,  in  determining  the  nature  of  order-disorder  transitions  is 
highlighted,  and  the  effects  are  shown  to  be  counter-intuitive  in  the  last-named  system. 

Keywords.    Gauge  theories;  integrability;  chaos;  phase  space;  Lyapunov  exponents. 
PACS  Nos    11.15;  05.45 


1.  Introduction  and  overview 

The  dynamical  behaviour  of  gauge  theories,  whose  importance  in  particle  physics  is 
well  known,  has  been  studied  in  detail  in  recent  times.  This  has  been  a  primer  to  the 
understanding  of  the  chaotic  behaviour  of  field  theories.  Early  investigations  along 
these  lines  in  the  context  of  non-abelian  gauge  theories  were  carried  out  with  an  aim 
to  understanding  the  structure  of  the  field-theoretic  vacuum  and  the  asymptotic  states 
of  the  theory  [1]  with  particular  reference  to  strong  interaction  physics.  This  was 
motivated  by  the  belief  that  there  was  a  connection  between  colour  confinement  and 
chaos  in  quantum  chromodynamics,  the  popular  theory  of  strong  interactions.  Further 
interest  in  such  investigations  arose  because  some  gauge  theories  support  solitons,  and  in 
a  sense,  these  topological  structures  and  chaos  are  examples  of  opposite  extremes  of 
nonlinear  behaviour,  and  their  coexistence  in  field  theories  needs  to  be  examined  and 
understood. 

Since  the  systems  considered  are  conservative  and  are  described  by  Hamiltonians, 
non-integrability  of  the  evolution  equations  implies  chaos.  From  this  point  of  view, 
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extensive  studies  have  been  conducted  on  the  classical  Yang-Mills  (YM)  system  which 
is  the  non-abelian  gauge  theory  that  (when  suitably  adapted)  describes  certain 
fundamental  interactions  in  particle  physics.  Even  in  the  late  seventies,  attention  was 
drawn  to  the  fact  that  only  if  self-duality  conditions  are  imposed,  are  the  classical  YM 
equations  integrable  [2].  This  was  confirmed  by  the  Painleve  test  for  integrability  in  the 
special  case  of  the  self-dual  517 (2)  YM  theory  [3].  In  the  general  case,  where  self-duality 
conditions  are  not  imposed,  most  results  reported  in  the  literature  pertain  to  a  special 
reduction  of  the  classical  YM  equation.  This  reduction  corresponds  to  obtaining  a 
dynamical  system  (DS)  from  the  field  theory  by  considering  the  fields  to  be  spatially 
homogeneous. 

Although  the  DS  is  an  extremely  simplified  version  of  any  field  theory,  there  exists  a 
well-defined  limit  where  this  simplification  is  justified.  The  field  configurations  relevant 
for  the  ground  state  of  quantum  chromodynamics,  for  instance,  would  vary  only  slowly  in 
space  and  would  correspond  to  a  long  wavelength  region.  The  DS  involving  spatially 
homogeneous  fields  is  a  good  approximation  in  this  case.  In  particle  production  it  is 
conjectured  that  chaotic  field  configurations  may  be  excited  during  violent  collisions  of 
particles,  in  localized  regions  of  space,  where  the  fields  are  approximately  homogeneous. 
These  could  contribute  significantly  to  some  physical  processes,  in  which  their  signatures 
would  be  unique  [4]. 

The  DS  that  is  obtained  from  the  field  theory  by  assuming  the  fields  to  be  spatially 
homogeneous  is  clearly  governed  by  coupled  ordinary  differential  equations  describing 
the  time  evolution  of  the  fields,  in  contrast  to  the  full  field  theory  described  by  a 
complicated  set  of  coupled  partial  differential  equations.  YMDS  which  is  the  dynamical 
system  obtained  in  this  manner  from  the  YM  field  theory,  was  shown  numerically  to 
be  chaotic,  in  the  SU(2)  case,  for  a  suitable  choice  of  ansatz  for  the  spatially 
homogeneous  gauge  fields  [5].  The  ansatz  reduces  the  theory  to  a  dynamical  system  with 
two  degrees  of  freedom,  since  the  time  component  Ag  of  the  gauge  potential  (a  is  an 
SU(2)  index)  is  set  equal  to  zero  and  the  space  components  are  defined  in  terms  of  two 
variables  alone.  Whereas  the  Liouville-Arnol'd  criterion  for  integrability  requires  the 
existence  of  two  constants  of  the  motion,  only  the  Hamiltonian  corresponding  to  YMDS 
is  conserved  and  no  additional  first  integral  exists.  Numerical  studies  of  the  phase 
trajectories  add  support  to  the  inference  that  YMDS  is  chaotic.  In  [5]  it  has  been 
conjectured  that  since  there  exists  at  least  one  reduction  of  the  YM  field  theory  which  is 
chaotic,  the  full  field  theory  should  also  exhibit  chaos.  It  is  worth  noting  that  yet  another 
reduction  obtained  by  a  different  choice  of  ansatz  for  the  spatially  homogeneous  fields 
leads  to  nonlinear  plane  wave  solutions  of  the  YM  equations  in  Minkowski  space,  which 
also  exhibit  chaotic  behaviour  [6].  Analytical  tests  of  integrability  like  the  ARS  method 
[7]  would  be  conducted  in  the  same  spirit,  by  examining  in  principle  all  reductions  of  the 
coupled  partial  differential  equations  describing  the  field  theory  of  interest.  The  complete 
field  theory  is  said  to  be  integrable  if  all  reductions  pass  the  analytical  test  for 
integrability. 

The  YMDS  equation  has  been  investigated  from  the  point  of  view  of  ergodic  theory 
[8,  9]  wherein  it  is  inferred  that  it  displays  strong  statistical  properties,  and  is 
homeomorphic  to  the  Krylov-Sinai  billiards  system  with  hyperbolic  walls. 

The  chaotic  nature  of  the  YM  field  equations  has  been  explicitly  verified  in  [10,  11] 
where  it  is  shown  that  the  phase  space  near  the  Wu-Yang  static  solution  is  a  stochastic 
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region.  The  verification  of  the  existence  of  spatio-temporal  chaos  in  this  case,  relies 
essentially  on  the  Fermi-Pasta-Ulam  approach  [12]  where  the  continuous  string  is 
replaced  by  a  discrete  lattice,  for  numerical  purposes. 

These  conclusions  based  on  YMDS  (and  further  strengthened  by  explicit  field- 
theoretic  computations)  change  significantly  when  Higgs  terms  are  added  to  the  gauge 
field.  The  Yang-Mills-Higgs  (YMH)  system  has  been  extensively  investigated  in  this 
respect.  YMH  is  important  in  particle  physics  as  several  physical  models  based  on  it, 
incorporating  spontaneous  breakdown  of  the  gauge  symmetry,  have  been  suggested  and 
successfully  applied  to  the  unification  of  the  fundamental  forces  of  nature. 

From  the  point  of  view  of  DS,  the  50(3}  Georgi-Glashow  model  which  admits 
monopole  solutions  has  been  shown  (under  suitable  assumptions),  to  exhibit  chaos  when 
reduced  to  its  dynamical  counterpart  where  the  fields  are  considered  to  be  spatially 
homogeneous  [13,  14].  By  applying  the  Painleve  prescription  it  has  been  established  in 
[14]  that  the  model,  with  a  suitable  ansatz  for  the  gauge  fields  and  the  Higgs  field  r/>,  is 
algebraically  non-integrable  for  A  >  2g2.  Here  A  is  the  coefficient  of  </>4  in  the  Lagrangian 
and  g  is  the  gauge  coupling  constant.  The  Toda-Brumer  criterion  [15]  when  applied  to 
this  model  yields  an  expression  for  the  critical  energy  for  the  onset  of  chaos,  in  terms  of  A 
and  m  (the  coefficient  of  02  in  the  Lagrangian).The  full  field-theoretic  version  of  the 
5(9(3)  Georgi-Glashow  model  has  also  been  investigated  by  considering  the  phase  space 
around  the  t'Hooft-Polyakov  monopole  solution  [16].  There  exists  a  phase  transition  in 
the  system  in  the  following  sense:  For  small  perturbations  around  the  t'Hooft-Polyakov 
monopole  solution,  the  motion  is  regular  and  the  system  is  close  to  an  integrable  one.  As 
the  strength  of  the  perturbation  increases  above  a  critical  value,  the  motion  becomes 
chaotic.  Since  the  pure  YM  system  is  always  chaotic,  this  result  shows  that  the  Higgs 
fields  are  necessary  to  have  an  ordered  phase  in  the  theory. 

In  order  to  address  the  question  of  whether  the  order-chaos  transition  is  an  inherent 
phenomenon  only  observed  in  the  case  where  a  monopole  solution  exists  in  the  system,  a 
different  model  has  been  considered  in  the  literature  [17].  This  is  described  by  an  517(2) 
YMH  theory  which  admits  spherically  symmetric  sphaleron  solutions  and  where  the 
fields  are  taken  to  be  spatially  homogeneous.  Through  extensive  numerical  work  it  is 
established  in  this  case  too  that  an  order-chaos  transition  occurs  similar  to  that  observed 
in  the  monopole  system. 

The  nature  of  chaos  that  arises  in  the  abelian  counterparts  of  the  foregoing  has 
also  been  investigated  [13,  14,  18].  The  abelian  Higgs  (AH)  model  as  a  DS  is  shown  to  be 
algebraically  non-integrable  in  most  of  the  type-I  region  of  superconductivity  [14],  The 
conclusion  is  further  supported  by  numerical  work  including  investigation  of  phase  plots 
and  Lyapunov  exponents  in  both  the  Georgi-Glashow  and  AH  cases  [13]. 

It  therefore  appears  from  the  details  reviewed  above  that  a  chaotic  system  like  YMDS 
exhibits  regularity  for  some  parameter  values  if  the  Higgs  term  is  introduced  and  that  a 
transition  from  order  to  chaos  takes  place  in  the  YMH  system.  However,  in  contrast,  the 
message  from  the  abelian  counterpart  seems  to  be  that  a  regular  system  (Maxwell) 
undergoes  an  order-to-chaos  transition  on  introduction  of  the  Higgs  field.  At  this  stage, 
therefore,  the  role  of  the  Higgs  term  is  still  not  clear.  More  detailed  studies  on  this  aspect 
have  been  undertaken  in  [19,  20,  21]. 

A  related  question  pertains  to  whether  other  terms  apart  from  the  Higgs  coupling  can 
produce  order  in  YMDS  for  any  suitable  range  of  the  parameters  of  the  theory.  A  possible 
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candidate  in  this  regard  would  be  the  Chern-Simons  (CS)  term.  In  (2  +  1)  dimensions  the 
CS  term  plays  a  very  interesting  role  in  both  abelian  electrodynamics  and  non-abelian 
gauge  theories  [see,  for  example,  [22]].  Owing  to  its  presence,  the  gauge  field  acquires 
mass  and  yet  the  theory  is  gauge  invariant.  The  CS  term  can  be  generated  through 
spontaneous  symmetry  breaking  [23].  Since  it  seems  to  implement  the  effect  of  the  Higgs 
term  in  this  sense,  it  is  important  to  know  its  effect  on  YM  from  the  dynamical  systems 
point  of  view  as  well. 

Interesting  effects  have  been  observed  [4]  in  the  YMCS  system  when  the  fields  are 
made  spatially  homogeneous.  Order-chaos  transitions  occur  within  narrow  energy  ranges 
and  have  been  tracked  in  this  model,  for  instance,  in  the  energy  range  between  0.37  and 
0.40  in  suitable  units.  Below  the  lower  limit  of  energy  the  system  displays  order  and 
above  the  upper  limit,  chaos.  The  authors  have  observed  a  hyperfine  structure  of  order- 
chaos-order —  transitions  present  at  all  arbitrary  small  scales  within  the  energy  range 
0.37  to  0.40. 

Thus  while  both  the  CS  and  Higgs  terms  seem  to  bring  about  order-chaos  transitions  in 
YMDS  (which  is  otherwise  totally  chaotic),  the  transition  region  seems  to  display 
different  stochastic  properties  depending  on  whether  the  CS  term  or  the  Higgs  term  is 
added  to  the  gauge  theory.  It  is  therefore  of  interest  to  explore  in  detail  the  combined 
effect  of  both  the  CS  and  Higgs  couplings  in  the  abelian  and  non-abelian  situations.  New 
features  emerge  in  this  case,  some  of  them  rather  counter-intuitive.  The  details  of  these 
calculations  are  reported  in  [19,  20,  21],  and  will  form  the  contents  of  the  following 
sections.  Here  we  give  a  summary  of  the  results. 

We  find  that  the  abelian  Chern-Simons-Higgs  system  without  the  Maxwell  term 
is  integrable,  whereas  the  inclusion  of  the  Maxwell  term  makes  the  system  chaotic 
[19].  We  then  consider  the  SU(2}  Yang-Mills-Chern-Simons-Higgs  system  with 
homogeneous  fields,  both  in  the  absence  of  the  YM  term  (NACSH)  and  in  its  presence 
(YMCSH)  [20].  In  contrast  to  the  abelian  case,  where,  in  the  absence  of  the  Maxwell 
term,  the  system  is  integrable,  here  both  NACSH  and  YMCSH  are  non-integrable.  All  this 
is  verified  by  a  Painleve  analysis  as  well  as  by  a  study  of  phase  plots  and  Lyapunov 
exponents.  In  the  case  of  YMCSH,  the  dimensionality  of  the  phase  space  being  large 
(eighteen),  we  have  restricted  ourselves  to  those  sections  of  phase  space  where  we  expect 
interesting  features  and  our  plots  reveal  that  the  dynamical  evolution  ranges  from  regular 
to  chaotic  behaviour.  We  see  that  in  the  limit  of  large  Chern-Simons  mass,  large  Higgs 
coupling  constant,  or  small  gauge  coupling  constant,  NACSH  and  YMCSH  become 
regular. 

In  the  above  analysis,  we  retain  the  full  vector  nature  of  the  gauge  fields  and  do 
not  restrict  our  study  to  just  a  single  ansatz,  in  contrast  to  some  of  the  earlier 
works  reported  in  the  literature,  on  classical  gauge  theories  as  dynamical  systems.  We 
have  examined  in  detail  the  Lyapunov  spectra  for  NACSH  and  YMCSH  as  functions 
of  the  various  couplings  and  parameters  in  the  system  [21].  We  find  that  there  exists 
an  interplay  between  the  oscillatory  effects  of  the  CS  term,  the  regularising  effect 
of  the  Higgs  term  and  the  completely  chaotic  nature  of  the  YM  term.  However,  a 
striking  feature  that  emerges  in  our  investigations  is  that  it  is  not  true  that  an  increase 
in  the  Higgs  coupling  constant  regularizes  the  gauge  term  at  all  energies.  This  only 
happens,  in  fact,  for  small  values  of  the  energy;  in  this  sense,  the  result  is  counter- 
intuitive. 


2.  The  abelian  Chern-Simons-Higgs  theory 

2. 1   The  dynamical  system 

The  Lagrangian  density  of  the  (2+1)  dimensional  Chern-Simons-Higgs  system  is  given 
by 

m  .  2 

LQ  =  —  e^  pA^Fvp  +  KO/J,  —  ieAfjj^l    —  V(^>),  (1) 

2 

where  V(</>)  is  the  Higgs  potential  and  m  is  the  Chern-Simons  mass.  The  equations  of 
motion  are 

i  •  2 


-^.  (3) 

The  corresponding  dynamical  system  (CSH)  is  obtained  (as  described  in  the  preceding 
section)  by  taking  the  fields  to  be  spatially  homogeneous.  We  also  choose  the  gauge 
AQ  =  0,  for  convenience.  Further,  it  can  be  easily  seen  that  the  phase  of  ^  is  a  constant. 
Thus,  (f>  can  be  taken  to  be  real  without  loss  of  generality. 

The  equations  of  motion  for  Ai,A2  and  </>  then  become 

A2  =  --A102,  (4) 

m 

2 

A!--A202,  (5) 

m 

^Vi2+A22)0  =  -~.  (6) 

The  overhead  dot  refers  to  differentiation  with  respect  to  time.  From  these  equations 
it  follows  that  (Ai2+A22)  is  a  constant  of  the  motion.  Writing  AI  =  Ccos  %  and 
A2  =  Csin  x,  we  get 


m 


Hence  the  Chern-Simons-Higgs  field  theory  reduces  to  a  dynamical  system,  with  two 
degrees  of  freedom  4>  and  x  whose  equations  of  motion  are  as  given  above. 
The  Lagrangian  which  yields  these  equations  is 


where  px  is  a  Lagrange  multiplier,  and  also  the  canonical  momentum  corresponding  to  x- 
Also, 

&  =  (*-&.  (10) 

m 
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The  corresponding  Hamiltonian  is 

(11) 


so  that  the  equations  of  motion  are 

•  (I2) 


<px          m 

A  =  -f  =  o.  (15) 

Thus  we  see  that  ;?x  is  a  constant  of  the  motion.  Hence  CSH  has  two  degrees  of  freedom, 
and  two  integrals  of  the  motion,  H  and  px.  Therefore  the  Liouville-Arnol'd  criterion  for 
integrability  is  satisfied  by  this  DS.  The  Painleve  test  for  integrability  can  be  carried  out 
trivially  and  confirms  that  CSH  is  indeed  an  integrable  system. 

2.2  Maxwell-Chern-Simons-Higgs  theory 

We  now  turn  our  attention  to  the  situation  when  the  Maxwell  term  is  added  to  the  Chern- 
Simons-Higgs  field  theory.  The  corresponding  Lagrangian  is  given  by 

L  =  -\F^  +  ^A»FVP  +  |(dM  -  ieA^\2  -  v(0).  (16) 

Again,  choosing  the  gauge  AQ  =  0,  and  working  with  spatially  homogeneous  fields,  we 
see  that  0  can  be  taken  to  be  real,  without  loss  of  generality.  The  equations  of  motion 
describing  this  DS,  which  we  will  refer  to  as  MCSH,  are  as  follows: 

(17) 
(18) 

(19) 

This  system  clearly  has  three  degrees  of  freedom,  Ai,A2  and  </>.  The  Hamiltonian 
appropriate  to  this  system  is  given  by 

H  =  l[(p!  -  mA2)2  +  (p2  +  mA1}2}  +       +  e2(Al2  +  A22)02  +  V(0),      (20) 


where  plt  p2,  and  p3  are  the  canonical  momenta  corresponding  to  AI,  A2  and 
respectively. 

We  work  with  the  quartic  Higgs  potential 


(21) 


and  rename  A\  as  q\,  AI  as  qi,  and  $  as  #3.  Hamilton's  equations  then  become 

q\  =  pi  -mq2,  (22) 

q2  =  p2  +  /n?i ,  (23) 


/         .  \        ,~.   2          2  (r)c-\\ 

f  \      o  2        2  fo^N 

p2  =  tn(p\  —  mqi)  —  ^  tf2tf3  >  l^°y 

We  note  that  corresponding  to  the  MCSH  Hamiltonian,  p^  =  A2p\  —  A\p2,  is  a  constant 
of  the  motion.  This  feature  can  be  used  to  express  MCSH  in  terms  of  just  two  degrees  of 
freedom.  In  close  analogy  with  the  CSH  case,  we  now  define  A]  =A  cos  (  and 
A2  =  A  sin  £.  In  terms  of  A,  0  and  £,  Hamilton's  equations  read 

\  f^Q\ 

A=pA,  l/»J 

m2A,  (29) 

(30) 

-v2),  ^  (31) 

C  =  m-^,  (32) 

JC  =  0.  (33) 

This  system  has  three  degrees  of  freedom,  while  there  appear  to  exist  only  two  constants 
of  the  motion.  Hence  one  may  suspect  that  the  system  could  be  chaotic.  In  order  to  verify 
this,  the  Painleve  test  is  sketched  below.  We  first  recast  the  above  equations  in  the  form  of 
the  following  second-order  differential  equations  by  exploiting  the  fact  that  £  is  a  cyclic 
coordinate.  We  have 


-v2)  (34) 

and 

2 
A  =  -*A  +  -fc-2SA*?.  (35) 

In  terms  of  the  rescaled  variables  x  =  0/v,  y  =  A/v,  t'  =  evt,  C  =  p^2/e2v6,  A  =  X/2e2, 
and  B  =  m2/e2v2,  we  get  the  equations 

x=-xy2-Ax(x2-l),  (36) 

y         y    y3 

where  the  differentiation  is  with  respect  to  t1. 
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Figure  l(a).  Plot  of  #3  vs  p3  for  the  Maxwell  Chern-Sirnons-Higgs  system  with 
parameters  m  =  2,  A  =  2,  v2  =  8  and  e2  =  1  and  initial  conditions  q\  =  1 ,  qi  =  1 , 
«?3  =  0.5,;?!  =  I,p2  =  1  andp3  =  1. 
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Figure  l(b).  Plot  of  qj,  vs  ^3  for  the  Maxwell  Chern-Simons-Higgs  system  with 
parameters  m  =  2,  A  =  2,v2  =  8  and  e2  =  100  and  initial  conditions  #1  =  1,#2  =  1) 
4s  =  0.5,pi  =  l,p2  =  1  andp3  =  1. 


We  first  find  the  dominant  singularity  structure  by  setting  x  =  a(t  —  to}01  and 
y  =  b(t  —  r0)  ,  where  ct  and  ft  must  be  negative.  This  reveals  that  there  exist  two 
possibilities  for  the  leading  behaviour:  Case  (1):  a  =  —I, ft  =  -I, a2  —  — 1  and 
b2  =  A  -  2.  Case  (2):  a  =  -I,  /3  =  \  ±  ^/(1+ (16/*4)),  a2  =  -2/A  and  fc  is  arbitrary. 
For  both  cases,  the  resonance  analysis  reveals  that  the  Painleve  property  fails  for  all 
physical  regions  of  the  Higgs  potential,  thus  establishing  that  MCSH  is  not  integrable. 

2.3  Numerical  studies 

To  establish  the  chaotic  behaviour  of  the  MCSH  system  numerically,  we  report  below  the 
results  of  a  study  [19]  of  the  corresponding  phase  space  trajectories.  In  figures  l(a)  and 
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(b)  we  have  displayed  phase  plots  relating  to  MCSH.  These  are  plots  of  #3  vs  jp3  for  initial 
conditions  q\  =  1,  q2  =  1,  #3  =  0.5,  /?i  =  1,  p2  =  1  and  /?3  =  1.  The  parameter  e2  is 
chosen  differently  in  figures  l(a)  and  (b)  as  seen  in  the  corresponding  captions.  Further, 
while  the  CSH  system  is  integrable,  the  MCSH  plots  are  space-filling,  signalling  that 
MCSH  is  not.  Also,  it  is  clear  from  figures  l(a)  and  (b)  that  with  increase  in  the  value  of 
e2,  MCSH  becomes  more  space-filling.  A  study  of  the  maximal  Lyapunov  exponent  for 
MCSH  as  a  function  of  the  dimensionless  parameter  X/2e2  reveals  that  it  is  positive,  thus 
establishing  firmly  that  the  system  is  chaotic. 

3.  The  non-abelian  Chern-Simons-Higgs  theory 

3.1  The  pure  Chern-Simons-Higgs  system 

The  Lagrangian  for  the  non-abelian  5(7(2)  Chern-Simons-Higgs  field  theory  in  (2  -f-  1) 
dimensional  Minkowski  space  is  given  by 

L  =  f  ^  [F^l  -  1  eofcAjAjA*  ]  +  D^D^u  -  V(|0|)  (38) 

where 

f%  =  W*  -  &AJ  +  ge^fa  ,  (39) 

and 

a.  (40) 


Here  ra's  are  the  generators  of  the  SU(2)  algebra,  and  tr[rfl7/,]  =  28ab.  We  consider  a  real 
triplet  representation  for  the  Higgs  field,  the  gauge  AQ  =  0,  and  spatially  homogeneous 
fields  as  before.  Bold  face  letters  denote  vectors  in  SU(2)  space.  Corresponding  to  this 
mechanical  system  (NACSH)  the  Gauss'  law  constraint  is  as  given  below: 


x  A2  =  —  <&  x  <&.  (41) 

The  remaining  equations  of  motion  for  the  vector  field  are 

Ai  =  —  (  A2$2  -  $  A2  •  $)  ,  (42) 

j?2 

A2  =  -5-(Ai$2  -  $Aj  •  $).  (43) 

The  equation  of  motion  for  the  Higgs  field  is 

*  =  -/[(A2  -I-  A2)$  -  (A!  -  $A!  +  A2  -  *A2)]  -  ~  -  (44) 

From  the  above  we  can  check  that  (A2  +  Aj)  is  a  constant  of  the  motion.  The  canonical 
momenta  are 

pA,  =  mA2,  (45) 

PA2  =  -mAi,  (46) 
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and 

P^  =  24.  (47) 

Ir  (p  >.          / 

The  Hamiltonian  is  given  by 

[    9     PA  1                        2      (PA   '  ^*)2 1 
AT  H <fr  —  (A]  •  <&) — r H-  V(0).  (48) 

Hamilton's  equations  are 

2 

m2       '  ' 

i  P<$  ^£T  1  N 

$=2 '  ^     ' 

5V 

p^  =  -2,g2[(A2  +  A^)^  -  (Ai  •  $Ai  +  A2  •  $A2)]  —  —  .  (52) 

As  mA2  is  the  canonical  momentum  corresponding  to  AI  and  —  raAi  the  canonical 
momentum  corresponding  to  A2,  the  system  has  only  six  degrees  of  freedom.  It  appears 
that  H,  (A2  +  A2),  and  the  three  constants  of  the  motion  corresponding  to  the  Gauss'  law 
constraint  are  the  only  constants  of  the  motion,  for  NACSH. 

However,  for  the  system  to  be  integrable,  there  must  be  one  more  constant  of  the 
motion.  At  this  stage  it  is  not  clear  if  such  a  constant  exists.  We  therefore  carry  out  the 
Painleve  test  for  integrability,  choosing  the  scalar  Higgs  potential  to  be  a  quartic  of  the 


form  V(0)  =  (A/4)(02-v2) 

It  is  clear  that  each  of  the  three  components  of  the  gauge  fields  AI  and  A2  obeys  a 
first-order  differential  equation  and  that  each  of  the  three  components  of  the  Higgs  field 
obeys  a  second-order  differential  equation.  So  we  require  twelve  arbitrary  constants  in  the 
local  representation  of  the  general  solution,  if  NACSH  were  integrable. 

Carrying  out  the  Painleve  analysis,  we  first  determine  the  leading  behaviour  by  setting 

A,=ai(f-fo)ai,  (53) 

A2=a2(f-for,  (54) 

^a^f-fo)0*.  ,  (55) 

We  find  that  the  leading  behaviour  is  given  by 

Ai=/3ia,  (56) 

A2  =  /32a,  (57) 

*  =  a(f-f0)"1.  (58) 

with  a2  =  -4/A. 
For  the  resonance  analysis  we  consider  the  next-to-leading  order  singularity  by  setting 

(59) 
(60) 
=  Si(t-torl+r)r(t-tQy-1.  (61) 
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and 

*  =  *(t-tQrl[l+q(t-t0r1].  (64) 
From  the  general  expansion  about  the  leading  behaviour,  we  find  that 

A,=/3,a,  (65) 

A2  -  /32a,  (66) 

*  =  a(;  -  r0r]  [1  +       <?/('  -'o)'],  (67) 


where  a2,  and  all  the  #,-'s  except  94,  are  determined.  The  arbitrary  constants  are  ?o,/3\,  fa, 
#4  and  the  direction  of  a.  Thus  we  have  six  arbitrary  constants,  while  twelve  are  needed 
for  integrability.  Hence  the  Painleve  test  indicates  that  NACSH  is  chaotic. 

3.2  The  Yang-Mills-Chern-Simons—Higgs  system 
The  Lagrangian  for  YMCSH  is  given  by 

L  =  -^M™  +f  ^a^  -  m*i)- 

(68) 
Corresponding  to  this  Lagrangian  the  equations  of  motion  are 

D^"""  +  m^Flp  =  felD-y  r.0  -  ^r0D>],  (69) 

W  =  -|£.  (70) 

The  v  =  0  component  gives  the  following  Gauss'  law  constraint 

l(Ai  x  AI  +  A2  x  A2  +  2mAj  x  A2)  =  -$  x  6.  (71) 

Once  again  YMCSH,  the  DS  in  this  case,  is  obtained  by  considering  spatially 
homogeneous  fields.  Further,  setting  AQ  =  0,  the  equations  of  motion  corresponding  to 
the  gauge  fields  become 


A!  +  2mA2  +  2g2(Aj$2  -  ^Al  •$)  +  g2(Aj  A2-  A2  -  A2A!  •  A2)  =  0, 

(72) 

A2  +  2mA!  +  2g2(A.2$2  ~  $A2-$)  +  ^(A2Ai  -A!  -  AiAj  •  A2)  =  0. 

(73) 

One  constant  of  the  motion  for  YMCSH  is  seen  to  be  A2-  AI  —  AI  •  A2  +  m(Ai2+  A22). 

Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  H) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences"  627 


The  three-component  Higgs  field  obeys  the  following  equation  of  motion: 

$  -  (Ar*A,  +  A 


(74) 


We  therefore  see  that  there  are  nine  second-order  differential  equations  for  YMCSH, 
one  for  each  of  the  three  components  of  AI,  A.%  and  <3?.  This  means  that  if  the  system 
were  integrable,  there  must  exist  18  arbitrary  constants  in  the  Painleve  analysis,  which 
may  be  carried  out  along  lines  identical  to  that  in  the  case  of  NACSH.  It  turns  out  that 
YMCSH  does  not  pass  the  Painleve  test  for  integrability,  either. 


4.  Numerical  studies  on  the  non-abelian-Chern-Simons-Higgs  system 

In  contrast  to  the  abelian  case,  we  have  seen  that  the  non-abelian  counterpart  is  not 
integrable,  even  in  the  absence  of  the  YM  term.  To  support  this  fact  numerically  [20]  we 
present  in  figures  2(a)-(d),  phase  plots  corresponding  to  NACSH.  The  calculations  were 
performed  using  a  Runge-Kutta  fourth  order  algorithm  with  adaptive  step  size.  In  the 
case  of  NACSH,  since  AI  and  A2  are  conjugate  to  each  other,  we  need  to  consider  only 
AI,  A2,  $  and  its  canonical  conjugate  p^.  Renaming  these  fields  for  convenience  as  x,  y, 
$  and  g  respectively,  we  display  in  figures  2(a)  and  (b),  plots  of  0i  vs  g\  respectively 
with  initial  conditions  x  =  y  =  (0,  1,  1)  and  $  =  g  =  (1,  0,  1).  This  choice  of  initial 
conditions  automatically  satisfies  the  Gauss'  law  constraint.  The  parameter  values 
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Figure  2(a).    Plot  of  fa  vs  gi  for  the  NACSH  system  with  parameters  A  =  v2 
=  m  =  g2  =  1  and  the  initial  conditions  x  =  y  =  (0, 1, 1)  and  $  =  g  =  (1 , 0, 1). 
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Figure   2(b).     Plot   of   fa    vs   gi    for   the   NACSH   system   with   parameters 
X  =  v2=m=l    and   g1  —  10   and   the   initial   conditions   x  =  y  =  (0,1,1)    and 
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Figure  2(c).  Plot  of  fa  vs  g\  for  the  NACSH  system  with  parameters 
A  =  v2=g2  =  l  and  m  =  10  and  the  initial  conditions  x  =  y=  (0,1,1)  and 
$  =  g=  (1,0,1). 

•esponding  to  these  plots  are  A  =  v2  =  m  =  1 .  However,  in  figure  2(a)  g2  =  1  and  in 
)  g2  =  10.  Clearly,  while  the  general  pattern  is  the  same  in  both  cases,  the  plots  get 
ser  as  g2  increases.  In  figure  2(c),  we  display  <f>\  vs  g\  for  the  same  initial  conditions 
parameter  values  except  that  g2  =  1  and  m  =  10.  From  figures  2(a),  (b)  and  (c)  it  is 
IT  that  as  m  increases  some  "regularity"  in  the  behaviour  of  the  phase  plot  is  brought 
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Figure  2(d).    Plot  of  fa  vs  gi  for  the  NACSH  system  with  parameters  v   =  m 
=  g2  =1  and  A  =  500  and  the  initial  conditions  x=y=  (0,  1,  1)  and  $=  g  =  (1,  0,  1). 

about  (in  the  sense  that  it  is  not  so  space-filling).  Thus,  for  large  m  the  system  is  close  to 
being  integrable.  In  figure  2(d)  the  plot  of  <p\  vs  g\  is  displayed  with  all  initial  conditions 
and  parameter  values  the  same  as  in  figure  2(a)  except  that  A  =  500.  The  figure  suggests 
that  as  A  increases  there  is  a  tendency  towards  regularity. 

We  now  turn  our  attention  to  YMCSH.  Without  displaying  the  phase  plots  we  comment 
that  as  in  the  NACSH  case,  an  increase  in  g2  makes  the  phase  plots  more  space-filling. 
This  is  to  be  expected  as  YM  is  anyway  a  chaotic  system.  While  an  increase  in  A  brings 
about  more  'regularity'  in  the  Higgs  sector,  it  seems  to  have  no  significant  effect  on  the 
gauge  sector. 

We  now  report  on  results  obtained  in  the  non-abelian  case  by  means  of  an  extensive 
study  [20]  of  the  Lyapunov  exponents.  We  first  consider  the  NACSH  case.  In  order  to 
compare  our  results  with  those  in  [18],  we  must  scale  the  NACSH  variables  in  (56-58),  as 
follows: 


A2 


(75) 
(76) 

(77) 


This  reduces  the  number  of  parameters  from  three  to  one,  and  correspondingly  the 
equations  of  motion  to 

I],  (78) 

*)],  (79) 

=  -[(A2  +  A2)$  -  (Ar$Ai  +  A2-$A2)j  --$($2  -  v'2)  (80) 
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with  K  —  Xm/g2.  Further,  without  loss  of  generality,  we  set  the  scaled  variable  v'  to  be 
unity.  The  scaled  Lagrangian  becomes 

L=(Ai-A2  -  A2-Ai)  +<I?2  -  [(A2  +  A2,)*!?2  —  (Ap<I>)2  -  (A2-$)2] 

-^($2-l)2.  (81) 

The  corresponding  energy  function  is  given  by 

E-  $2+  [(A2  +  A2.)'!*2  -  (Ai-$)2  -  (A2  -$)2]  -l--($2  -  I)2.  (82) 

In  the  YMCSH  situation,  using  the  same  scaling  as  above  we  have  the  following 
equations  of  motion: 

1   ••  0 

-  A,  +  2A2  +  2(Aj$2  -  $Ai  •$) 

m 

+  i(A,A2.A2-A2ArA2)  =  0,  (83) 

-A2-2A,+ 
m 

+  -  ( A2At  -AJ  -  A,  Ai  •  A2)  =  0  (84) 

m 

and 

$  =  -[(A2  +  A2)$  -  (AI-<&AI  +  A2-(I:)A2)]  —  —$($2  -  1).  (85) 

In  contrast  to  NACSH  there  still  remain  two  parameters  here,  namely,  K,  and  m.  The 
corresponding  scaled  Lagrangian  is 


+  [(Aj  +  A?,)$2  -  (Ai  •$)*  -  (A2-$)z]  -  -  ($2  -  I)2  (86) 

while  the  energy  function  is  given  by 

2m      l         2  2m      [     1 

+  [(A2  +  A2)*2  -  (Ai  -$)2  -  (A2-$)2]  +  j  ($2  -  I)2.  (87) 

We  have  examined  the  variation  of  the  maximal  Lyapunov  exponent  as  the  values  of 
energy  and  K,  are  changed,  for  a  large  range  of  initial  conditions.  These  conditions  were 
chosen  to  satisfy  the  Gauss'  law  constraint.  Choosing  a  one-variable  ansatz  of  the  form 

/x\  /o\  /_*\  /,' 

A,=      0        A2=     ,        *=       ,         *-i     x|  (88) 

\oy  \o/          V  o  I         *  \o 

for  NACSH  and  supplementing  them  with 

•  (Q\ 

A!  =      0        A2  =  |  0  |  (89) 

\o/ 
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Figure  3(a).    Variation  of  the  maximal  Lyapunov  exponent  with  energy  for  NACSH 
for  K  =  0. 
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Figure  3(b).    Variation  of  the  maximal  Lyapunov  exponent  with  energy  for  NACSH 
for  K  =  5. 

for  YMCSH,  we  have  varied  x  to  obtain  several  initial  conditions  in  the  energy  region  of 
interest.  In  figures  3(a)  and  (b),  we  show  the  behaviour  of  the  maximal  Lyapunov 
exponent  as  a  function  of  energy  for  K  =  0  and  5.  For  K  =  0,  i.e.,  in  the  absence  of  the 
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Figure  4.  Comparison  of  maximal  Lyapunov  exponent  vs  energy  for  YMCSH  for 
the  one- variable  ansatz  for  K  =  1,  m  =  1  (*  represents  the  curve)  with  K,  =  10,  m  =  1 
(+  represents  the  curve). 


Higgs  potential,  the  system  is  mostly  chaotic  as  seen  from  the  positive  value  of  the 
Lyapunov  exponent,  except  for  one  window  of  regularity,  in  the  energy  region  7  <  E  <  9. 
While  more  transitions  from  order  to  chaos  appear  as  K,  increases,  the  situation  becomes 
dramatic  for  large  K  as  in  figure  3(b),  where  oscillatory  order-chaos  behaviour  appears 
even  from  E  =  3  onwards.  This  is  the  effect  of  an  increase  in  the  coefficient  m  of  the 
topological  term.  Further,  we  see  that  while  order-chaos  transitions  occur  on  all  scales  in 
YMCS  [4],  this  is  not  true  in  NACSH.  This  suggests  that  the  quartic  YM  coupling  is 
responsible  for  this  behaviour  (the  occurrence  of  order-chaos  transitions  on  all  scales).  A 
more  detailed  analysis  of  the  YMCSH  system  is  therefore  called  for,  and  we  proceed  to 
report  on  these  numerical  investigations. 

Figure  4  shows  the  variation  of  the  maximal  Lyapunov  exponent  as  a  function  of 
energy  for  «  =  1  and  10  and  with  m  =  1.  The  graphs  reveal  that  for  large  K  (be  it  due  to 
large  Higgs  coupling  A  or  small  YM  coupling  g)  the  system  exhibits  more  regularity  for 
low  energies  (in  the  sense  of  an  almost  vanishing  maximal  Lyapunov  exponent).  The 
absence  of  order-chaos  transitions  on  all  scales  in  YMCSH  shows  that  the  quartic  YM 
coupling  is  not  responsible  for  this  feature  observed  in  the  YMCS  system.  Also,  our 
results  are  to  be  contrasted  with  earlier  work  [1, 18,24]  in  which  it  is  reported  that  a 
YMH  system  becomes  more  regular  as  A  increases,  regardless  of  the  energy  regime 
investigated. 

We  have  determined  the  fraction  of  phase  space  that  is  regular  by  considering  [21] 
about  100  initial  conditions  for  energies  ranging  from  1  to  10.  We  find  that  for  small  K, 
NACSH  becomes  chaotic  more  or  less  monotonically  as  the  energy  is  increased.  For  large 
ft,  however,  there  is  no  such  simple  behaviour.  For  the  NACSH  system,  all  the 


Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  H) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


633 


calculations  in  this  regard  were  carried  out  with  a  straightforward  Runge-Kutta  fourth 
order  routine,  preserving  constants  of  the  motion  to  an  accuracy  of  one  part  in  105. 
However,  for  calculations  of  exponents  for  YMCSH  an  adaptive  step-size  Runge-Kutta 
routine  was  necessary  to  ensure  energy  conservation  to  the  same  degree  of  accuracy  as  in 
the  case  of  NACSH. 


5.  Discussion  and  conclusions 

From  the  numerical  studies  reported  here  on  the  abelian  and  non-abelian  Chern-Simons- 
Higgs  systems,  we  arrive  at  some  very  interesting  conclusions  relating  to  the  rich  variety 
of  phase  space  behaviour  in  these  systems,  some  of  these  being  counter-intuitive.  The 
boundedness  of  the  motion  in  phase  space  seems  to  be  a  common  feature  for  both  the 
YMH  and  NACSH  systems.  This  is  in  contrast  to  the  situation  in  the  abelian  Higgs 
system.  This  feature  is  thus  independent  of  the  details  of  -the  non-abelian  nature  of  the 
couplings,  i.e.,  whether  it  is  of  the  YM  type  or  the  CS  type. 

Contrary  to  what  we  may  expect  intuitively,  it  is  not  true  that  an  increase  in  K 
"regularizes"  the  gauge  term  at  all  energies.  If  the  increase  in  K  is  due  to  one  in  the  Higgs 
coupling,  the  degree  of  regularity  may  be  intuitively  expected  to  increase;  but  if  the 
increase  in  K  is  due  to  a  decrease  in  the  gauge  coupling,  it  is  not  established  that  further 
regular  islands  appear  for  all  small  (non-zero)  values  of  the  gauge  coupling.  As  borne  out 
by  our  studies,  however,  an  increase  in  the  scaled  parameter  K  enhances  the  regularity 
only  for  small  energies  [21].  This  feature  can  be  understood  better  if  we  realise  that 
appearance  of  regularity  also  depends  on  the  nature  and  availability  of  the  phase  space. 
When  more  regions  in  phase  space  become  accessible,  it  is  plausible  that  randomness 
could  increase  and  the  KAM  tori  structure  could  get  further  broken. 

Another  aspect  is  that  as  the  energy  increases,  the  maximal  Lyapunov  exponent 
increases  in  magnitude  in  the  YMCSH  system.  Thus  the  YM  term  takes  over  for  large 
energies  and  the  CS  term  produces  the  "oscillatory"  effect  between  regularity  and  chaos. 

While  large  windows  of  regularity  exist  in  NACSH,  this  is  not  true  for  YMCSH.  This 
could  perhaps  be  traced  back  to  the  interplay  between  the  "oscillatory  effect"  of  the  CS 
term,  the  "regularizing  effect"  of  the  Higgs  term,  and  the  completely  chaotic  signature  of 
the  YM  term. 

The  message  from  our  investigations  is  as  follows.  In  contrast  to  systems  with  a  few 
degrees  of  freedom,  relatively  complex  systems  of  the  sort  we  have  considered,  governed 
by  non-trivial  symmetries,  can  display  unusual  and  rich  dynamical  behaviour,  thanks  to 
the  intricate  interplay  between  the  various  coupling  constants.  As  even  CSH  (the  simplest 
DS  that  we  have  considered)  shows,  its  integrable  nature  is  lost  on  including  a  linear 
coupling  to  the  Maxwell  term  (to  yield  MCSH).  As  the  number  of  degrees  of  freedom 
increases,  the  intricacies  and  the  chaotic  nature  of  phase  plots  increases,  as  may  be 
expected.  The  next  step  would  be  the  study  of  the  case  of  a  full-fledged  field  theory, 
where  it  would  be  necessary  to  identify  good  quantifiers  of  chaos.  At  the  present  stage  our 
understanding  of  the  connections  (if  any)  between  the  analytical  tests  of  integrability  like 
the  ARS  conjecture  and  numerical  methods  as  in  the  Fermi-Pasta-Ulam  approach  is 
rather  poor.  Further,  several  problems  relating  to  the  significance  of  chaos  in  field 
theories  quantized  using  semi-classical  quantization  prescriptions  remain  open.  The  role 
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of  chaos  in  field  theories  that  support  topological  solutions  is  not  understood  completely, 
although  some  interesting  results  have  been  obtained  (see,  for  example,  [11,  25]);  and 
finally,  the  implications  of  the  lack  of  integrability  in  quantum  field  theories  are  yet  to  be 
examined. 
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Chaos  in  Josephson  junctions 
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Abstract.  A  detailed  analysis  of  the  control  space  characterization  of  phase  locked  states  and 
chaotic  attractors  in  Josephson  junctions  is  presented,  based  on  a  model  that  includes  both  quadratic 
damping  and  cosine  interference  terms.  In  addition,  some  novel  features  of  the  nonlinear 
characteristics  of  the  junction  like  evolution  of  basin  boundaries,  bifurcation  structure  analysis  and 
scaling  behaviour  of  Lyapunov  exponent  are  discussed. 

Keywords.  Josephson  junctions;  chaos;  fractal  basin  boundary;  phase  locking;  Lyapunov 
exponent  and  scaling  index. 

PACS  Nos    05.40;  05.45;  74.40;  85.25 

1.  Introduction 

During  the  past  two  decades,  Josephson  junction  (JJ)  has  served  as  a  powerful  paradigm 
in  understanding  low  dimensional  chaos  in  nonlinear  systems.  This  is  mainly  because  the 
equation  modeling  the  dynamics  of  the  junction,  in  its  simplest  form,  represents  a 
minimal  system  for  the  observation  of  chaos  and  it  is  isomorphic  to  a  variety  of 
physically  interesting  situations  like  the  forced  pendulum,  pinned  charge  density  waves 
and  phase  locked  loops.  Moreover,  it  is  a  remarkably  fertile  field  exhibiting  a  large 
variety  of  nonlinear  effects  ranging  from  phase  locking  and  sub  harmonic  generation  to 
chaotic  transitions  of  different  kinds  like  period  doubling,  crisis  etc.  Further,  the  nonlinear 
behaviour  in  Josephson  junctions  is  easily  accessible  for  experimental  measurements  in 
addition  to  circuit  simulations  and  numerical  analysis.  This  system  is  well  known  to  be 
technologically  important  due  to  its  possible  applications  in  many  electronic  devices  like 
parametric  amplifiers,  local  oscillators,  mm  wave  detectors  and  voltage  standards. 

Experimental  observation  of  chaos  in  these  systems  dates  back  to  1968  [1].  Historically 
speaking,  JJ  oscillators  revealed  a  striking  noise  rise  phenomena  that  could  not  be 
explained  in  terms  of  thermal  fluctuations.  In  superamps,  an  increase  in  the  amplitude  of 
the  driving  signal  led  to  broad-band  voltage  fluctuations  with  an  equivalent  noise 
temperature  of  w  104K  [2].  Huberman  and  his  group  suggested  that  this  can  be 
understood  in  terms  of  chaotic  solutions  of  the  nonlinear  junction  dynamics  [3,4].  Later 
the  period  doubling  phenomena  was  observed  above  a  critical  voltage  in  tunnel  junctions 
[5]  and  irregularities  in  the  I-V  characteristics  of  the  junction  were  reported  as  signatures 
of  chaos  [6-8].  Experimental  results  on  deterministic  chaos  in  JJ  [9]  and  nonlinear 
properties  of  the  junction  dynamics  [10],  have  been  recently  reviewed. 

Apart  from  experimental  studies,  theoretical  investigations  have  been  carried  out  in  JJ 
based  on  equations  that  model  the  dynamical  characteristics  of  the  junction.  The  most 
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popular  and  well  studied  of  such  models  is  the  RSJ  model  or  the  Stewart-McCumber 
model  [11]  in  which  the  resistance  R  that  accounts  for  the  current  of  quasi  particles  and 
the  self  capacitance  C  of  the  junction  are  shunted  across  the  device.  Then  the  response  of 
the  junction  to  dc  and  rf  biasing  can  be  represented  by  an  equivalent  circuit.  In  simple 
models  the  resistance  R  is  taken  to  be  independent  of  the  junction  voltage  v(t).  Survey  of 
chaos  in  JJ  centered  around  this  model  through  digital  simulations,  numerical 
investigations  and  circuit  realizations  of  the  modeling  equations,  abound  in  the  literature 
[12-14].  Such  studies  could  identify  regions  in  the  parameter  space  of  the  junction  where 
nonlinearity  generates  the  most  complex  behaviour.  In  the  chaotic  regime,  a  variety  of 
intermittent  regions  [15]  are  observed  after  a  complete  period  doubling  sequence.  In 
actual  experiments  such  interesting  and  complicated  structure  are  usually  washed  out  by 
thermal  noise  [7],  which  induces  dynamics  very  much  like  chaos  called  thermally 
induced  chaos  [16]. 

A  few  studies  reported  in  later  years  based  on  some  modified  models  are  worth 
mentioning,  the  major  modifications  being  in  the  damping  term  of  the  equation.  If  in  a 
current  biased  junction,  the  shunted  resistance  R  is  taken  to  be  nonlinear  and  vary  as 
l/v(f),  the  damping  term  in  the  equation  of  motion  becomes  quadratic  [17, 18].  Another 
type  of  nonlinear  damping  mechanism  arises  when  the  conditions  are  such  that  the 
interference  effects  between  the  pair  and  quasi-particle  current  should  be  taken  into 
account.  This  adds  a  cosine  term  to  the  usual  damping  term  [19, 5].  Studies  in  the  linear 
RSJ  model  with  the  cosine  term  reveal  that  its  presence  sensitively  governs  the  responses 
of  the  system  [20,21].  Since  chaotic  responses  of  the  system  arise  due  to  the  nonlinear 
effects  of  the  system,  we  feel  it  is  relevant  to  include  both  these  for  a  complete  analysis  of 
the  possible  modes  of  the  system. 

In  this  paper,  we  report  the  results  of  such  a  study  on  a  quadratic  RSJ  model  with 
cosine  interference  terms.  We  focus  our  studies  along  four  main  tracks,  viz. 
characterization  of  phase  locked  states  and  chaotic  regimes,  bifurcation  structure  studies 
to  bring  out  the  relevance  of  the  dc  bias  and  cosine  terms,  evolution  of  the  basin 
boundaries  when  the  system  asymptotes  to  multiple  attractors  and  the  scaling  behaviour 
of  the  order  parameter  for  the  transition  from  periodicity  to  chaos. 

In  §  2,  we  discuss  the  salient  features  of  the  junction  treated  as  a  dynamical  system  in 
the  quadratically  damped  model  with  the  cosine  term.  We  then  proceed  to  a  preliminary 
search  procedure  for  the  transition  from  periodicity  to  chaos  using  Melnikov  analysis,  to 
provide  lowest  threshold  for  the  onset  of  chaos  that  can  be  prescribed  using  the 
unperturbed  solutions  in  terms  of  the  relevant  parameters  of  the  system.  The  JJ  as  a 
dynamical  system  turns  out  to  be  a  system  with  multiple  attractors  that  occur  both  in  the 
same  well  and  also  in  different  potential  wells.  Each  attractor  will  have  its  own  separate 
basin  of  attraction  and  the  nature  of  evolution  of  the  boundary  between  them  forms  an 
interesting  study.  We  discuss  the  evolution  of  basin  boundaries  as  the  system  develops 
period  doublings,  crises  and  tangles  in  §  4. 

Section  5  presents  the  results  of  a  numerical  analysis  with  a  view  to  isolate  the 
windows  of  phase  locked  states  in  control  parameter  space  are  included.  The  chaotic 
regions  are  then  marked  out  according  to  their  'degree'  of  chaos  that  is  estimated  based 
on  the  magnitude  of  the  Lyapunov  exponents  (LE).  The  nature  of  the  scenario  leading  to 
chaos  is  studied  in  the  following  section  by  plotting  the  detailed  bifurcation  structures 
with  the  frequency  of  the  drive  as  the  tuning  parameter.  The  crossover  from  periodic  to 
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chaotic  behaviour  is  signalled  by  the  change  of  the  maximal  LE  from  negative  to  positive 
value.  Hence  it  has  the  status  of  an  order  parameter  in  the  transition.  The  scaling 
behaviour  of  the  LE  is  therefore  relevant,  in  general,  from  a  phase  transition  point  of 
view.  We  numerically  compute  the  LE  near  the  transition  point  for  different  types  of 
scenarios  that  occur  in  the  present  system  such  as  period  doubling,  intermittency  between 
phase  locked  states,  crisis  etc.  The  results  are  given  in  §7,  and  compared  with  numerical 
work  based  on  other  models,  and  with  available  experimental  data,  in  the  concluding 
section. 


2.  Josephson  junction  as  a  dynamical  system 

The  Josephson  junction  is  an  SIS  junction  consisting  of  two  superconducting  electrodes 
separated  by  a  tunnelling  barrier.  The  tunnelling  of  electron  pairs  produces  a 
dissipationless  current  ls,  which  flows  at  zero  bias  voltage  and  given  by  [22] 

j;v(r)  =  /osinjc(0,  (2.1) 

/n,  (2.2) 


where  x(t)  is  the  phase  difference  between  the  super  conducting  order  parameters  on  both 
sides  of  the  junction,  /o  is  the  maximum  supercurrent  that  is  geometry,  temperature  and 
material  dependent,  v(t)  is  the  voltage  across  the  junction.  In  addition  to  this  current  of 
Cooper  pairs,  for  nonzero  voltages,  there  is  a  current  of  quasi  particles  modelled  as  a 
current  through  the  ohmic  resistance  R\  and  the  interference  effects  are  taken  care  of  by  a 
term  proportional  to  cos  x(t)  through  a  resistance  RI.  Under  a  dc  and  rf  drive,  the 
equation  of  motion  can  thus  be  written  as  [23] 

cos  *  v  +  /o  sin  *  =  /dc  +  /i  sin  fir.  (2.3) 


df      L/?i(v) 

We  introduce  scaled  variables  and  dimensionless  parameters  in  the  following  way.  t  is 
replaced  by  CJQ*,  where  LUQ  is  given  by 

uJ  =  (2*/0)/(;»C).  (2.4) 

The  resistance  RI  is  taken  as  a  constant  in  the  simple  RSJ  model.  But  the  general 
expression  for  l/R\  as  given  in  [24]  shows  that  its  dependence  on  v  is  through  an 
exponential  function  such  that  as  an  approximation,  we  can  take 


and 


v  = 


(2.5) 


If  we  define 


,   /  =  /dc//o,   A  =  /i//0     and    u  =  H/WQ,     (2.6) 


equation  (2.3)  then  becomes 


d/2 


d* 

dt 


—  ecos  x)  +sin  x  = 


(2.7) 
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dt 

^3  ,~  B> 

_  =  u,  (2.8) 

The  control  parameter  space  of  the  system  (2.8)  is  5  dimensional,  spanned  by 
(Jt,£,/,A,a>).  At  the  origin  of  this  parameter  space,  the  system  is  isomorphic  to  the 
classical  nonlinear  pendulum  with  the  same  phase  space  structure  as  the  pendulum  in  the 
(x\  —  x-i]  phase  plane.  By  analogy  with  the  pendulum,  we  characterize  the  periodic 
modes  of  the  system  by  (p,  q)  so  that  p  =  0  corresponds  to  the  oscillatory  modes  and 
p  ^  0  are  rotational  modes,  q  specifies  the  periodicity  of  the  mode  relative  to  the  rf  drive, 
while  p  gives  the  number  of  rotations  made  in  one  period.  The  qualitatively  different 
trajectories  of  these  two  modes  are  separated  by  the  separatrix  orbit.  The  fixed  points  of 
the  system  alternate  along  the  xi-axis,  the  stable  ones  or  centres  at  (2n7r,0)  and  the 
unstable  ones  or  saddles  at  ((In  +  I)TT,  0),  for  integer  n  values.  The  stable  manifold  Ws 
and  the  unstable  manifold  W1  of  the  separatrix  connect  the  saddles  at  (mr,  0)  and 
(—  n-TT,  0).  It  is  easy  to  see  that  the  above  dynamics  has  an  infinite  series  of  potential  wells 
with  minima  at  (2n?r,  0)  and  maxima  at  ((In  +  I)TT,  0)  as  its  background  [25]. 

When  the  control  parameters  are  small  enough,  (2.8)  can  be  considered  as  a 
perturbation  of  the  pendulum  structure  discussed  above.  Then  W1  and  Ws  of  the  saddles 
reorganize  themselves  and  may  touch  each  other  resulting  in  a  heteroclinic  tangency.  The 
threshold  value  for  this  to  occur  can  be  computed  analytically  by  the  Melnikov  analysis 
[26].  The  separation  between  Ws  and  W11  is  measured  by  the  Melnikov  function  Af  (fo)  in 
terms  of  the  solutions  of  the  unperturbed  separatrix,  ;tio  and  XIQ.  (?o  here  refers  to  the 
initial  time  that  fixes  the  particular  Poincare  section  where  the  trajectories  are  visualized.) 
The  vanishing  of  M(f6)  then  provides  the  required  threshold.  For  the  system  in  (2.8),  we 
have  [27] 


X\Q  =  ±2tan~1(sinhr)     and    x2o  =  ±2sechr,  (2.9) 

where  r  =  t  -  f0  and 

/oo 
X2o[-kx^Q(l  -ecosxio)  +Asinu(t-  /0)  +7]  dr.  (2.10) 

•oo 


Evaluating  the  integral  using  (2.9)  we  get 

M(r0)  =  k(±2n£  T  4?r)  ±  1-nI  ±  2?r  A  sech(7rw/2)  sin  ut0.  (2.1  1) 

The  threshold  value  for  the  parameter  A  for  tangency  is 

AM  =  |fc(2  -  e)  -  7|  cosh(7ruy  2)  .  (2.  12) 

As  A  increases  beyond  AM,  the  Ws  and  W1  of  the  separatrix  accumulate  on  each  other  due 
to  their  transverse  intersections,  forming  a  heteroclinic  tangle.  Thus  there  are  Smale 
horseshoes  in  the  underlying  dynamics,  although  the  existence  of  horseshoe  does  not  in 

Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  II) 
640  Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


most  cases  imply  asymptotic  chaos.  The  initial  conditions  which  are  near  the  invariant  set 
of  the  horseshoe  may  experience  a  chaotic  transient  but  will  ultimately  settle  down  to 
motion  on  an  attractive  periodic  orbit.  On  the  other  hand,  if  the  points  are  eventually  fed 
back  into  the  same  region,  a  chaotic  attractor  will  be  formed.  Near  the  tangle  therefore  the 
system  develops  a  sensitivity  to  initial  conditions. 

The  dependence  of  the  threshold  on  other  relevant  parameters  can  be  deduced  from 
(2.12).  In  the  absence  of  the  dc  bias,  the  threshold  for  a  given  frequency  w,  is  generally 
higher  and  therefore  a  nonzero  value  of  /  should  bring  down  the  transition  curve  for  onset 
of  chaos  in  the  A  —  uj  plane.  This  has  been  numerically  verified  in  a  previous  study  [27]. 
So  also,  by  fixing  7  at  0.5  and  £  =  0.8,  the  threshold  is  minimum  for  k  =  0.4 
corresponding  to  moderate  damping  in  the  system.  When  e  is  not  included,  however,  the 
minimum  occurs  for  low  damping.  Similarly  for  e  —  0.8  and  k  —  0.4,  the  threshold  has  its 
minimum  at  /  =  0.5.  We,  therefore  restrict  our  studies  in  subsequent  sections  to 
parameter  values  k  =  0.4,  e  =  0.8  and  /  =  0.5,  since  this  is  presumably  the  most 
favourable  choice  for  onset  of  chaotic  behaviour  in  the  system. 


3.  Fractal  basin  boundaries 

It  is  clear  from  the  discussion  in  the  previous  section  that  the  system  in  (2.8)  has  multiple 
attractors.  The  attractors  are  all  simple  fixed  points  when  there  is  no  perturbation  and  the 
separatrix  defines  the  smooth  boundary  of  their  basins.  If  we  concentrate  on  the  central 
potential  well,  the  oscillatory  modes  developed  under  forcing  will  have  bounded  basins 
around  (0,0).  The  escape  from  the  well  normally  leads  to  p  ^  0  modes  which,  with  the 
damping  may  settle  down  to  p  —  0  modes  in  distant  wells  or  cross  well  oscillations  as 
well  as  chaotic  attractors.  The  nature  and  pattern  of  basin  changes  in  each  well  should  be 
the  same  but  when  the  central  well  is  under  consideration,  the  basin  of  attractors  in  other 
wells  form  part  of  the  escaping  basin.  Once  the  threshold  for  tangle  is  reached,  the 
smooth  boundary  starts  developing  structures  and  becomes  a  fractal,  even  when  the 
asymptotic  state  is  a  periodic  mode  in  a  distant  well.  A  priori  knowledge  about  how 
attractor  basins  evolve  is  of  value  when  we  address  the  question  of  control  mechanisms. 
So  also,  the  extent  of  basin  erosion  due  to  forcing  can  tell  us  how  far  the  system  has 
chances  of  surviving  in  a  noisy  environment  and  this  is  relevant  to  stability  related 
engineering  problems  [28,29]. 

Our  previous  studies  of  (2.8)  indicates  that  for  small  uj  values,  the  stable  mode  excited 
is  a  (0, 1)  one  which  we  call  Sn-  Choosing  a  window  of  phase  space  defined  by  0  <  x\  <  8 
and  — 3  <  ^2  <  3,  we  analyse  the  bounded  basin  of  this  mode  numerically.  With  k  =  0.1, 
cu  =  0.3,  /  =  0.0  and  e  =  0.0,  we  try  to  see  how  the  basin  evolves  as  A  is  slowly  tuned. 
For  this,  the  system  in  (2.8)  is  integrated  numerically  using  fourth  order  Runge-Kutta- 
Gills  (RKG)  algorithm  and  checked  for  the  required  periodicity.  For  the  central  well  (0, 1) 
modes,  the  x^  values  averaged  over  a  cycle  is  zero  and  —TT  <  xi  <  TT.  The  integrations  are 
repeated  for  a  grid  of  80  x  60  initial  conditions  and  the  asymptotic  state  identified  after 
100  drive  cycles  using  the  above  information.  The  initial  points  are  then  colour-coded  to 
distinguish  the  escaping  and  bounded  basins  [30]. 

For  the  set  of  basin  portraits  displayed  in  figure  1  the  bounded  basin  of  Sn  is  in  red 
while  the  escaping  basin  belonging  to  attractors  in  other  wells  is  yellow.  As  we  increased 
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Figure  1.  The  bounded  basin  for  the  nonresonant  (0, 1)  mode  (red)  in  the  central 
well  and  the  escaping  basin  (yellow)  belonging  to  attractors  in  other  wells  for 
increasing  values  of  A  viz.  0.2,  0.5  and  0.8. 


Figure  2.  The  sharing  of  the  bounded  basin  by  the  coexisting  5,,  and  Sr  modes.  With 
(jj  =  0.6  as  A  is  tuned  from  0.3  to  0.34,  the  basin  of  the  latter,  indicated  by  the  blue 
colour,  gets  reorganized,  while  that  of  the  former  suffers  a  progressive  shrinking. 

beyond  the  threshold  AM,  the  reorganization  of  the  basin  due  to  the  tangle  affects  only  the 
escaping  basin  and  the  nonresonant  bounded  basin  suffers  only  a  progressive  shrinking.  It 
is  to  be  noted  in  this  connection  that  Sn  does  not  undergo  period  doubling  and  the  onset  of 
chaos  is  finally  due  to  a  reverse  boundary  crisis  [31].  Here  the  chaotic  attractor  that  exists 
for  A  >  1 .017  collides  with  the  saddle  at  the  basin  boundary  resulting  in  the  sudden 
disappearance  of  the  attractor  together  with  its  basin  [32]. 

For  higher  values  of  u,  a  periodic  resonant  (0, 1)  mode  Sr  is  born  along  with  its 
unstable  counterpart  via  a  saddle  node  bifurcation  before  the  Melnikov  tangency  occurs. 
The  bounded  basin  in  the  central  well  is  now  shared  by  Sn  and  Sr  and  we  have  the 
phenomenon  of  resonant  hysteresis.  In  figure  2,  blue  regions  indicate  the  newly  created 
basin  of  Sr.  Above  the  threshold,  increase  of  A  results  in  a  drastic  reorganization  of  this 
basin  due  to  the  incursion  of  the  escaping  basin.  However  before  mass  erosion  of  the 
basin  starts,  Sr  undergoes  period  doubling  and  the  resulting  chaotic  attractor  disappears 
via  a  boundary  crisis  [31]  that  occurs  at  A  =  0.41 1.  Beyond  this  we  find  only  the  bounded 
basin  of  Sn  and  the  escaping  basin  shared  by  the  chaotic  attractor  and  attractors  in  other 
wells. 

However  for  values  of  u  «  1,  Sn  has  already  disappeared  and  the  bounded  basin 
belongs  to  Sr  alone  [27].  Then  above  AM  long  finger  like  structures  start  penetrating  its 
basin  and  massive  erosion  of  the  whole  bounded  basin  takes  place.  These  events  are  clear 
from  the  series  in  figure  3.  The  bounded  basin  in  the  central  well  is  coloured  yellow  while 
the  escaping  basin  belonging  to  attractors  in  the  successive  wells  on  the  right  are  given 
green,  violet,  red  and  blue  respectively.  The  brighter  shades  of  the  same  colours  indicate 
basins  of  modes  in  wells  on  the  left.  The  white  regions  lead  to  rotational  modes  or  chaotic 
attractors.  Similar  results  for  erosion  of  bounded  basins  have  been  reported  by  Soliman 
and  Thompson  [32, 33]  for  periodically  driven  and  linearly  damped  nonlinear  oscillators. 


Figure  3.  Evolution  of  the  fractal  basin  of  the  resonant  mode  for  u  =  0.8.  The 
development  of  the  heteroclinic  tangle  is  coincident  with  the  erosion  of  the  basin  due 
to  the  penetration  of  long  finger  like  structures.  Here  the  A  values  taken  are  0.2,  0.35, 
0.45,  0.5,  0.55  and  0.6. 


Figure  4.  Phase  locked  regions  in  the  (A,  u]  plane.  The  yellow  regions  correspond  to 
(0, 1)  mode  while  the  red  regions  lead  to  chaotic  or  a  periodic  states.  For  the  colour 
code  used  for  other  (p,  q)  modes,  see  the  text. 


1.  Phase  locked  states  and  chaotic  regimes  in  the  control  space 

n  this  section  we  discuss  the  occurrence  of  phase  locked  states  in  (2.8),  with  k  =  0.4, 
'  —  0.5  and  E  =  0.8.  The  A  —  u  plane  is  then  scanned  in  the  range  0.2-1.2,  with  a  step 
;ize  of  0.01  [34].  For  each  point  thus  chosen,  (2.8)  is  integrated  numerically  using  fourth 
)rder  RKG  scheme  with  a  time  step  equal  to  one  fiftieth  of  the  drive  period.  The  first 
)000  values  are  discarded  to  shake  off  transients  and  the  remaining  checked  for  phase 
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Figure  5.  Chaotic  domains  in  the  (A,u)  plane.  Here  the  red  regions  related  to 
periodic  states  with  AI  <  0,  while  the  other  shades  correspond  to  Aj  >  0;  for  details  of 
colours  used  see  the  text. 


4,0 


(a) 


0.2 


1.2 


Figure  6a.    Devil's  staircase  of  the  phase  locked  modes,  with  the  winding  number 
W  =  pfq  plotted  against  w.  Here  A  =  0.76,  k  =  0.4,  e  =  0.8  and  7  =  0.5. 

locked  states  (p,  q]  based  on  the  equations 

x\  (t  +  2nq/uj}  =  xi  (t)  +  2np , 

x2(t  +  2irq/tJ)=X2(t}.  (4.1) 

The  regions  of  different  (p,q)  values  in  the  (A,uj)  plane  are  shown  in  figure  4.  Yellow 
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Figure  6b.     Variation  of  the  maximal  Lyapunov  exponent  as  a  function  of  the  rf  drive 
frequency,  with  A  =  1 .0  and  other  parameter  values  as  in  6a. 
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Figure  7.    Bifurcation  diagram  with 

k  =  0.4,  1  =  0.5  and  e  =  0.8. 


as  the  tuning  parameter  for  A  =  0.765, 


regions  correspond  to  parameter  values  where  the  (0,  1)  mode  is  stable,  while  the  red 
regions  lead  to  periodic  or  chaotic  states.  The  regions  where  the  period-doubled  state  of 
the  (0,  1)  mode  viz  (0,2)  stabilizes  are  painted  dark  blue.  The  (1,  1)  and  (1,2)  rotational 
modes  are  stable  for  (A,  a)  values  chosen  from  the  light  blue  and  green  regions 
respectively.  Similarly  the  (2,  1)  and  its  corresponding  2-cycle  (2,  2)  stabilize  for  values 
from  the  orange  and  blue  magenta  regions  respectively.  The  magenta  color  corresponds  to 
the  stable  (3,  1)  mode.  Although  computations  were  done  for  q  <  32  and  p  <  5,  modes 
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clearly  seen  in  the  figure. 

The  regions  in  the  (A,  w)  plane  that  are  marked  red  in  figure  4  where  phase  locking  has 
not  been  observed  can  be  further  characterized  by  computing  the  Lyapunov  exponent 
[35,  36].  We  restrict  the  (A,  u]  plane  further  to  regions  of  mostly  chaos  i.e.  A  and  u)  values 
in  the  range  0.5-1.0  (figure  5).  The  'strength'  of  chaos  in  each  region  is  marked  out  by 
giving  different  shades  based  on  the  value  of  X\.  The  red  regions  in  the  figure  correspond 
to  negative  values  of  AI  and  therefore  belong  to  periodic  regimes.  The  major  chaotic 
windows  are  the  dark  blue  regions  where  0.1  <  AI  <  1.0  and  the  yellow  ones  with 
0.01  <  AI  <  0.1.  There  are  also  some  narrow  and  isolated  spots  of  cyan  for  which 
0.001  <  AI  <  0.01  and  blue  cyan  for  which  AI  lies  between  10~4  and  10~3. 
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Figure  8a,b.     (Continued) 
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Figure  8.  Phase  portraits  in  the  sin  x\  -  x-i  plane,  showing  the  period  doubling  route 
to  chaos  followed  by  interior  crisis  resulting  in  the  expansion  of  the  attractor.  In  (a) 
A  =  0.765  and  u  =  1.07,  the  periodic  (2, 1)  mode  is  seen  as  (b)  uj  is  tuned  to  1.06, 
(2, 1)  period  doubles  to  (2, 2),  (c)  the  chaotic  attractor  of  (2, 1)  for  u>  =  1 .04  and  (d)  the 
attractor  after  the  interior  crisis  for  u>  =  1.0. 


rhe  phase-locked  states  that  correspond  to  the  nonzero  voltage  steps  of  the  junction, 
re  relevance  in  voltage  standards  and  can  be  further  characterized  by  calculating  the 
iding  number  of  the  modes,  which  is  defined  as 

(4.2) 
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A  plot  or  Vv  against  cj  is  given  m  ngure  oa.  it  is  clear  mat  tne  system  gets  locKea  into  a 
single  step  for  a  range  of  values  of  u,  forming  a  devil's  staircase  structure.  Here  the  major 
steps  shown  in  the  figure  follow  the  Farey  tree  sequence  but  there  are  gaps  in  between 
steps  corresponding  to  random  behaviour  due  to  shuttling  between  two  steps.  The 
variation  of  AI  with  o>  for  A  =  1.0,  is  given  in  figure  6b.  The  phase  locked  windows  and 
the  chaotic  regimes  can  be  made  out  clearly. 

5.  Bifurcation  studies 

The  different  types  of  scenarios  that  the  system  follows  in  its  passage  to  chaos  can  be 
pursued  further  by  plotting  the  bifurcation  diagrams.  It  is  clear  from  §3  that  this  is 
decided  basically  by  the  frequency  u.  Hence  in  this  section  we  analyse  the  bifurcations 
that  the  system  undergoes  as  uj  is  slowly  tuned  from  above.  The  last  thousand  values  of 
the  integrator  sampled  at  the  frequency  of  the  drive  are  plotted  against  u>  to  get  the 
bifurcation  diagram  given  in  figure  7. 

Near  u  =  1,  the  resonant  (0, 1)  mode  that  is  excited  would  undergo  a  period  doubling 
as  u  is  decreased.  However  it  is  difficult  to  capture  the  successive  period  doublings  till 
chaos  is  reached.  This  is  mainly  because  the  basin  of  attraction  of  the  (0,  q)  mode  very 
soon  becomes  narrow  and  eaten  away  by  the  basins  of  other  (p,q]  modes  which  coexist 
with  (0,  q)  modes.  These  p  ^  0  modes  also  undergo  the  period  doubling  bifurcations  and 
reach  chaos.  As  u  is  tuned  further  the  chaotic  attractor  thus  produced  expand  and  merge 
together  due  to  an  interior  crisis  [32].  This  sequence  of  events  can  be  made  out  clearly 
from  the  set  of  phase  portraits  of  the  attractors  in  the  sin  x\  —  KI  plots  included  in  figure  8. 
Here  with  A  =  0.765  and  u  =  1.07,  the  system  settles  down  to  a  (2, 1)  mode.  The  first 
period  doubling  occurs  at  w  =  1.06  followed  by  another  at  1.056  etc.  The  accumulation 
point  is  at  1.04  followed  by  interior  crisis  and  the  extended  attractor  after  this  for 
(jj  =  1 .00  is  also  shown  in  the  figure. 
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Figure  9a.     (Continued) 
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Figure  9.  The  chaos  between  two  stable  windows  of  phase  locked  states.  In  (a)  with 
A  =  1.1,  u  =  0.85,  (2, 1)  mode  is  stable  while  in  (b)  for  u>  =  0.796,  (3, 1)  mode 
stabilizes.  For  u  =  0.80  (c),  the  chaotic  attractor  due  to  random  shuttling  between  the 
above  two  modes  is  seen. 

In  addition  to  the  above  two  types  of  chaotic  attractor,  the  system  supports  a  third  type 
which  arises  from  the  random  shuttling  between  two  phase  locked  states.  This  happens 
every  time  one  phase  locked  state  becomes  unstable  and  before  another  one  is  created. 
For  A  =  1.1,  periodic  states  are  observed  for  u  values  0.85  and  0.796.  In  between  for 
u;  =  0.80,  the  shuttling  type  of  behaviour  is  seen.  This  is  illustrated  by  the  portraits  in 
figure  9.  The  same  pattern  of  events  is  seen  repeated  in  several  regions  of  parameter 
space.  However  for  very  small  u;  values,  only  the  nonresonant  (0, 1)  mode  is  supported 
and  this  disappears  via  a  reverse  boundary  crisis  mentioned  earlier. 
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Figure  10.    Bifurcation  diagrams  as  in  figure  7,  but  with  (a)  7  =  0.0  (b)  e  —  0.0  (c) 
/  =  0.0,  e  =  0.0  and  (d)  /  =  0.0,  e  =  0.0  but  k-=  0.1. 

When  the  above  analysis  is  repeated  for  I  =  0.0,  it  is  found  that  most  of  the  chaotic 
regions  are  wiped  out  and  the  (0, 1)  mode  remains  stable  up  to  u>  =  0.6.  The  effect  is  less 
spectacular  when  e  is  put  equal  to  zero,  resulting  in  the  broadening  of  a  few  phase  locked 
windows.  When  both  /  and  c  are  excluded  the  periodic  modes  predominate  and  chaotic 
regions  reappear  only  when  k  is  reduced  to  values  «  0.1  (figure  10).  These  events  are 
supported  by  the  analysis  based  on  Melnikov  threshold  discussed  earlier. 


6.  Scaling  index  for  the  Lyapunov  exponent 

Since  AI  passes  through  zero  at  the  transition  point  uc  of  any  scenario,  its  scaling 
behaviour  near  this  point  is  studied  in  detail  here.  If  we  write 


\w-we\" 


(6.1) 


then  v  is  called  the  scaling  index.  Computationally  the  first  step  to  obtain  v  is  to  identify 
the  transition  point  accurately.  This  is  easy  for  the  period  doubling  scenario,  since  by 
tuning  u  in  very  fine  steps  the  value  for  which  AI  is  sufficiently  small  can  be  located.  A 
log-log  plot  of  AI  and  \u  —  ajc\  furnishes  the  index  v  as  its  average  slope  (figure  11).  Our 
studies  in  this  direction  indicates  that  the  value  of  v  for  most  cases  is  1.0  [37], 
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Figure    11.    Scaling   behaviour  of  Lyapunov   exponent.    Here  A  =  0.765   and 
wc  =  1.054510642311;  the  average  slope  is  v  -  1.00011. 

Table  1.  Scaling  indices  for  the  LE  values  during  the  transitions  to  chaos  via  the 
different  mechanisms  mentioned  in  the  text.  All  the  entries  except  the  fifth  are  for  the 
period  doubling  routes  while  the  fifth  one  is  for  the  transitional  type  of  chaos.  For  the  last 
entry  alone,  /  and  e  are  taken  to  be  zero  with  k  =  0.1. 


A 

u 

uc 

V 

0.5 

0.913519734-0.913519743 

0.913519743424 

0.9999 

0.9 

1.071781040-1.071781090 

1.07178113667069 

0.9998 

0.9 

0.889031880-0.889031960 

0.889031960230 

0.9966 

0.765 

1  .0545  1  0642280-1  .0545  10642306 

1.054510643111 

1  .0001 

1.1 

0.79679753021092-0.7967975302110 

0.796797530210909 

1.0001 

0.765 

0.54566203  1  341-0.54566203  1346 

0.5456620313487151 

1.0001 

:stablishing  linear  scaling  for  the  maximal  Lyapunov  exponent  in  the  system.  The  onset  of 
he  transitional  type  of  chaos  due  to  shuttling  between  two  phase  locked  states  is  also 
bund  to  follow  a  linear  scaling.  The  results  for  a  few  typical  cases  are  condensed  in  table  1 . 
We  also  scan  regions  of  crisis  along  the  cu-line,  where  the  AI  value  is  not  zero  at  uc. 
Phe  critical  value  of  interior  crisis  is  located  by  changing  u?  values  in  very  small  steps 
tear  the  crisis  value  or  by  looking  at  the  actual  trajectories  or  Poincare  maps  until  an 
expansion  of  the  attractor  is  observed.  After  the  crisis  the  AI  values  fluctuates  about  the 
:risis  value.  The  ujc  value  for  reverse  boundary  crisis  is  similarly  masked  by  the  presence 
>f  long-lived  chaotic  transients. 

1.  Conclusion 


n  this  paper,  the  results  of  a  detailed  analysis  of  the  phase  locked  states  and  chaotic 
•egimes  in  the  quadratically  damped  model  of  Josephson  junctions  is  presented.  The 
najor  results  reported  are  the  following. 
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periodicity  to  chaos  indicates  that  dc  bias  can  lower  the  threshold  as  a  whole.  So  also,  low 
values  of  the  damping  factor  k  favours  the  transition.  This  is  supported  by  the  bifurcation 
studies  presented.  Near  the  threshold,  the  boundary  of  the  bounded  basin  develops  fractal 
nature  and  for  high  values  of  u  long  finger  like  structures  start  penetrating  the  basin 
leading  to  its  rapid  erosion.  The  chaotic  regions  are  characterized  further  based  on  the 
magnitude  of  the  maximum  Lyapunov  exponent.  The  transition  to  chaos  for  the  cases 
studied  in  this  work,  is  associated  with  a  linear  scaling  relation  for  this  exponent. 

Direct  experimental  measurements  of  chaos  in  actual  junctions  do  not  unfortunately 
match  with  the  parameters  chosen  for  numerical  work.  However,  there  are  a  few 
exceptions.  Using  Pb-Te-Pb  junctions  with  a  Te  thickness  of  40-75  nm  exposed  to 
35  GHz  radiation,  the  occurrence  of  chaos  as  signalled  by  a  rise  in  the  equivalent  noise 
temperature  has  been  reported  [8],  In  this  case,  the  parameters  are  k  —  0.37,  u)  =  0.18  and 
/  is  tuned  from  5-7  mA.  They  also  observed  intermittency  between  phase  locked  states 
which  leads  to  noise  rise  at  the  end  of  the  voltage  steps  as  u  was  scanned  between  0.16 
and  0.14.  It  is  also  interesting  to  note  that  with  no  dc  bias,  no  rise  in  noise  was  observed  at 
u  =  0.2.  Our  results  are  more  or  less  in  confirmity  with  this,  since  with  /  =  0.0,  we  find 
most  of  the  chaotic  regions  are  wiped  out  (figure  10). 

The  devil's  staircase  picture  of  the  phase  locked  states,  the  scaling  behaviour  of  the  LE 
across  the  transition  point  and  the  characterization  of  the  chaotic  solutions  in  the 
parameter  space  are  some  of  the  novel  features  in  the  present  work.  However  parameter 
space  characterization  of  phase  locked  states  have  been  reported  in  earlier  models  also 
[9,22].  We  would  like  to  mention  that  in  [9],  the  I  -  i/k  plane  and  in  [22],  the  7  -  e 
plane  with  A  =  0.61,  w  =  0.8  and  k  —  0.2  are  chosen.  They  report  regions  of  periodic 
modes,  intermittent  chaos  connecting  two  metastable  states  called  transfer  crisis,  regions 
of  narrow  band  chaos  from  period  doubling  and  wide  band  chaos  where  the  X2  values  are 
not  confined.  The  last  case,  we  feel,  must  correspond  to  chaos  after  interior  crisis. 

Our  studies  relate  to  the  (A,  u>)  plane  where  A  and  u  are  external  parameters  that  can  be 
tuned  easily  unlike  k  and  e  which  are  internal  parameters  characterizing  a  particular 
junction.  The  (A.w)  plane  is  chosen  in  [38],  with  the  linear  RSJ  model,  for  k  =  0.2  and 
7  =  0.0.  The  threshold  is  lowest  for  u  =  1.0  and  chaos  occurs  only  above  A  =  1.0  for 
small  values  of  aj.  Moreover,  oscillatory  modes  of  the  (0, 1),  (0,2)  and  (0,3)  types  are 
seen  above  the  threshold.  This  is  different  from  our  results  since  we  find  only  windows 
of  rotational  modes  in  the  chaotic  region  above  the  threshold.  So  also,  the  lowest  for 
the  threshold  corresponds  to  u  =  0.65-0.7  and  near  u  =  1.0,  the  resonant  (0, 1)  mode 
is  stable.  This  is  because  eventhough  escape  from  the  central  well  occurs,  the  system 
settles  down  to  the  (0, 1)  mode  in  a  different  well  after  long  chaotic  transients.  Similar 
result  is  seen  in  Huberman's  work  [15].  Here  the  terminology  is  slightly  different  since 
the  phase  locked  states  are  specified  by  n(=  pjq)  rather  than  by  p  and  q  separately. 
Giving  allowance  for  this,  we  find  the  transition  curve  in  [15]  is  higher  than  in  our  work. 
This,  as  is  clear  from  the  latter  part  of  our  work,  must  be  due  to  the  inclusion  of  dc  bias. 
We  get  more  phase  locked  windows  in  regions  which  are  indicated  as  intermittency  in 
[15].  It  seems  as  if  the  quadratic  damping  together  with  the  cosine  term  increases  the 
tendency  of  the  system  to  get  locked  into  different  frequencies.  This  is  especially  so  for 
low  values  of  u>,  where  broad  windows  corresponding  to  (1, 1),  (2, 1),  (3, 1)  etc.  are  seen 
in  our  plot. 
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Chaos  in  Josephson  junctions 

It  is  interesting  to  note  that  the  type  of  quadratic  damping  present  in  the  system,  has 
relevance  in  a  broader  sense.  Firstly  such  systems  can  be  reduced  to  a  hamiltonian  format 
[39]  and  hence  its  quantum  effects  can  be  studied  using  approximation  techniques  [40]. 
Secondly,  rapid  motion  of  bodies  in  fluids  of  low  viscosity  gives  rise  to  vortices,  whose 
production  involves  resistance  forces  proportional  to  the  square  of  the  velocity.  This 
effect  is  then  called  turbulent  damping  [41],  In  general,  quadratic  damping  terms  prop  up 
whenever  forces  of  the  type  kv2  and  —  lev2  act  on  a  system,  alternatively  damping  and 
exciting  it  for  successive  half  cycles  of  oscillations. 

The  study  of  the  nonlinear  properties  of  JJ  carries  lot  of  promise  both  from  the  point  of 
view  of  applications  and  continued  research  on  fundamental  problems.  Recently  intense 
research  is  being  concentrated  on  long  JJ  which  is  an  extension  of  simple  junctions, 
where  a  spatial  variation  of  the  pair  phase  is  included.  This  system  is  known  to  support 
soliton  propagation  and  indicates  an  even  richer  dynamics  [42]  due  to  the  inter  play 
between  time  and  space  and  possible  analogies  to  turbulent  systems  [43]. 

In  conclusion,  the  problems  we  have  discussed  so  far  in  JJ  are  defined  on  the  basis  of 
BCS  superconductivity.  Future  work  involving  new  type  of  high  Tc  superconductors  may 
bring  out  completely  different  nonlinear  phenomena  due  to  anisotropy  and  complicated 
flux  dynamics.  Although  a  few  isolated  attempts  in  this  direction  have  already  been 
recorded  [44]  it  must  be  exciting  to  plunge  into  such  promising  areas  in  JJ  systems. 
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collision  processes  between  the  partons  (momentum  transfer  PT  >  1  GeV)  will  dominate 
the  parton  thermalization.  More  precisely,  Shuryak  [2]  has  made  use  of  lowest  order 
perturbative  QCD  matrix  elements  to  arrive  at  the  conclusion  that  at  RHIC  energies, 
gluons  would  thermalize  in  a  time  rg«  0.5  fm/c  whereas  the  quarks  would  take 
rq  «  2fm/c  with  the  initial  temperatures  estimated  as  500  MeV  for  gluons  and  200  MeV 
for  quarks. 

An  approach  complementary  to  that  involving  parton  'collisions'  is  the  one  giving 
thermalization  due  to  'collective'  non-perturbative  effects  in  a  quark  gluon  plasma. 
Unfortunately,  there  is  no  quantum  field-theoretic  approach  to  study  time-dependent  non- 
perturbative  phenomena  in  a  many-body  QCD  system;  hence  most  of  the  work  has  been 
carried  out  in  a  classical  framework.  Thus,  Mu'ller  and  Trayanov  [4]  have  carried  out  a 
classical  lattice  simulation  of  pure  SU(2]  Yang-Mills  theory  and  demonstrated  that  the 
system  is  chaotic.  They  further  argue  from  the  largest  Lyapunov  exponent  that  an 
estimate  of  the  thermalization  time  is  rg  w  0.5  fm/c,  which  is  comparable  to  Shuryak's 
value.  Bhatt  [5],  Bhatt  et  al  [6,7a]  and  more  recently  Sengupta  et  al  [8]  have  respectively 
used  parton  hydrodynamic  equations  and  parton  particle  simulations  to  demonstrate 
chaos  in  non-abelian  longitudinal  oscillations  in  a  quark  gluon  plasma;  as  shown  below, 
the  estimate  of  collective  thermalization  time  from  these  studies  is  r  ~  1  fm/c. 

It  is  important  to  emphasize  however,  that  the  thermalization  due  to  semi-hard 
collisional  processes  and  that  due  to  collective  (chaotic)  behaviour  correspond  to 
physically  different  parameter  domains.  The  crucial  parameter  is  the  plasma  parameter 
AD  =  nAj-j  i.e.,  the  number  of  particles  in  a  Debye  sphere.  It  is  well  known  that  collective 
behaviour  dominates  when  AD  »  1  and  collisional  effects  will  be  more  important  when 
AD  SI. 

In  this  review  we  give  a  very  brief  qualitative  account  of  the  parton  thermalization 
work  done  by  earlier  workers  (§  2)  and  then  go  on  to  describe  in  some  detail  the  work 
done  by  our  group  on  parton  thermalization  by  collective  effects  (§  §  3-5).  Our  work  is 
based  on  the  classical  parton  equations  of  motion  due  to  Wong  [9].  These  are  introduced 
in  §  3,  with  a  detailed  discussion  of  the  model.  In  §  4,  we  present  the  results  of  our  studies 
based  on  colour  hydrodynamic  equations  [6, 7, 9]  derivable  from  Wong's  model.  Section  5 
describes  the  results  of  our  recent  particle  simulation  of  Wong's  equations  and 
summarises  our  conclusions  regarding  thermalization  by  collective  non-abelian  effects. 
Finally,  §  6  provides  a  discussion  of  the  results. 


2.  Review  of  earlier  work  on  thermalization 

Detailed  microscopic  models  of  relativistic  heavy-ion  collisions,  based  on  QCD,  are 
being  extensively  used  to  investigate  issues  of  space-time  evolution  and  subsequent 
thermalization  of  collision  products.  Such  models  use  either  quantum  mechanical  or 
classical  approach  to  describe  the  collision  process.  Models  using  quantum  mechanics  are 
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further  sub-divided  into  two  groups,  viz.,  (a)  string  models  and  (b)  Parton  CW,rf 
models.  In  the  string  models,  [10, 1 1],  the  non-perturbative  aspect  of  QCD  interact  o 
described  phenomenologically  based  on  the  data  of  soft  hadron-hadron  collision 
(typically  a  reasonable  description  when  the  four  momentum  transfer  Pr  is  small 
compared  to  the  QCD  scale  parameter  AQCD).  In  the  parton  cascade  models  [31  on  the 
other  hand,  one  uses  perturbative  QCD  which  is  known  to  be  a  satisfactory  model  of 
semi-hard  and  hard  (PT  >  AqCD)  hadron-hadron  interactions.  The  classical  models  [41 
provide  us  a  complementary  picture  to  the  quantum  mechanical  models  since  they  permit 
us  to  incorporate  time-dependent  non-perturbative  aspects  of  the  QCD  interactions  In 
these  models,  the  well  known  chaotic  behaviour  of  the  classical  non-abelian  gauge 
theories  is  linked  with  the  thermalization  of  the  plasma. 

Each  of  the  models  mentioned  above  has  its  own  merits  and  deficiencies.  We  now 
briefly  review  them. 

2.1  String  models 

As  indicated  above,  this  approach  is  based  on  a  phenomenological  model  of  soft 
(PT  <  AQCD)  hadron-hadron  collisions.  In  these  models  [10,11],  the  two  colliding 
hadrons  are  assumed  to  interpenetrate  each  other  and  form  a  colour  flux  tube  between 
them.  The  colour  tube  then  disintegrates  into  quark-antiquark  pairs  and  gluons  in  a 
time  scale  of  order  1  fm/c.  The  entropy  production  and  thermalization  time  for  the  heavy- 
ion  collision  problem  is  then  determined  by  such  soft  hadron-hadron  collision 
processes.  At  higher  beam  energies  involving  four-momentum  transfer  PT  >  AQCD,  the 
application  of  such  models  becomes  questionable  primarily  because  the  string  density 
increases  significantly  and  the  strings  begin  to  overlap.  Since  the  models  do  not  consider 
the  interaction  between  the  strings  [10, 11],  they  begin  to  lose  their  validity.  At  such 
energies,  hard  collisions  between  the  hadrons  play  a  very  important  role  and  a  prediction 
of  the  collisional  dynamics  solely  based  on  soft  collision  processes  may  become 
inaccurate. 

There  also  exists  a  model  [12, 13]  offering  a  similar  picture  of  the  nuclear  collision 
at  a  'macroscopic'  level.  This  is  the  condenser  plate  model  of  ultra  relativistic  heavy 
ion  collisions.  The  picture  of  the  collision  offered  by  such  a  model  may  be  valid 
even  at  /Y>AQCD-  In  this  model,  one  imagines  that  the  Lorentz-contracted  nuclei 
at  ultra  relativistic  energies,  assume  the  shapes  of  pancakes  with  a  thickness  ~  1  fm 
[14].  The  nuclei  then  become  essentially  transparent  and  pass  through  each  other 
because  the  time  to  fly  past  their  Lorentz  contracted  diameters  is  shorter  or  comparable 
to  the  strong  interaction  time-scale.  The  central  region  of  the  collision  could  thus  be 
baryon  free.  However,  the  soft  gluon  exchange  process  can  create  colour  charge  on 
the  receding  nuclei  producing  a  colour  electric  field  in  the  central  region.  This 
colour  electric  field  is  unstable  to  the  production  of  quark-antiquark  pairs  and  gluons 
[15, 16].  Thermalization  time  of  the  quark-gluon  plasma  produced  in  this  way  can  be 
estimated  using  kinetic  theory  with  a  parton  production  rate  term  evaluated  using  the 
Schwinger  mechanism.  The  particle  energy  density,  entropy  production  rate  and 
equilibration  time  can  be  computed  using  the  kinetic  theory  approach  [12, 17, 18].  It  is 
found  that  the  thermalization  time  is  of  the  order  of  a  few  tens  of  parton  production  rate 
time  units. 
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Abstract.  A  very  important  question  in  ultrarelativistic  heavy  ion  collisions  is  that  of 
thermalization  of  the  high  energy  density  quark  gluon  plasma  formed  in  the  central  rapidity 
region.  Different  approaches  have  been  adopted  by  various  authors  to  study  this  thermalization 
problem.  These  include  phenomenological  string  and  capacitor  plate  models,  perturbative  QCD 
based  parton  cascade  models  and  the  classical  non-perturbative  approach.  In  this  paper  we  briefly 
review  the  earlier  studies  and  discuss  our  work  which  emphasizes  the  role  of  non-perturbative 
collective  effects  (classical  chaos)  in  the  thermalization  of  the  plasma.  In  particular,  using  classical 
equations  of  motion  of  a  coloured  parton  in  self-consistent  colour  fields,  we  have  carried  out  a  1  +  1 
dimensional  simulation  of  coloured  partonic  matter.  We  find  that  in  certain  parameter  domains,  the 
system  exhibits  chaotic  behaviour  in  non-abelian  plasma  oscillations,  which  then  leads  to 
thermalization  of  the  plasma. 

Keywords.    Quark  gluon  plasma;  collective  effects;  thermalization;  chaos. 
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1.  Introduction 

There  is  a  strong  theoretical  expectation  that  collisions  of  nuclei  at  ultra  relativistic 
energies  would  produce  a  deconfined  state  of  coloured  partons  called  quark  gluon  plasma 
(QGP).  The  fireball  (plasma)  that  is  produced  during  the  collision,  will  evolve  in  space 
and  time  from  a  pre-equilibrium  phase  to  a  thermally  equilibrated  phase  (atleast  locally) 
and  will  eventually  undergo  a  phase  transition  to  hadronic  matter.  An  important  element 
in  this  scenario  is  the  process(es)  by  which  the  highly  excited,  dense  coloured  particle 
system  comes  to  thermal  equilibrium  and  the  time  it  takes  for  reaching  equilibrium.  The 
study  of  thermalization  is  crucial  for  several  reasons.  In  a  fundamental  sense,  it  throws 
light  on  the  dynamics  of  the  pre-equilibrium  phase.  Further  from  a  practical  viewpoint, 
once  the  plasma  is  in  local  thermodynamic  equilibrium,  its  space  time  evolution  can  be 
examined  using  the  well-known  hydrodynamic  approach  of  either  Landau  or  Bjorken. 
From  the  experimental  point  of  view,  thermalization  has  important  consequences  for 
signals  of  quark  gluon  plasma  [1]. 

A  number  of  studies  using  quite  different  dynamic  processes  have  been  carried  out  to 
understand  the  thermalization  problem.  This  already  indicates  that  there  is  a  great  deal  of 
uncertainty  concerning  the  dynamics  in  the  very  early  stages  of  the  evolution  of  the 
plasma.  In  one  of  the  approaches  extrapolating  from  pp  and  pA  studies,  soft  QCD 
processes  (involving,  for  example,  the  dual  string  model)  have  been  assumed  to  be 
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2.2  Perturbative  QCD  based  models 

When  the  four  momentum  transfer  PT  >  AQCD,  methods  of  the  perturbative  QCD 
(pQCD)  can  be  used  to  describe  the  hadron-hadron  interaction.  At  collider  energies 
Vs  >  lOOGeVA,  major  component  of  the;?/?  andpp  collision  spectra  can  be  reproduced 
by  taking  semi-hard  or  hard  partonic  interaction  into  account.  Therefore,  it  is  reasonable 
to  expect  that  in  the  heavy-ion  collisions  at  RHIC  or  LHC  energies  hard  or  semi-hard 
partonic  scattering  would  play  a  significant  role  in  describing  the  evolution  of  the 
collisions. 

In  this  approach,  each  nucleon  is  initially  in  a  colour  coherent  state  and  regarded  as  a 
cloud  of  partons.  Each  parton  carries  some  fraction  of  the  nucleon's  energy  and  is  capable 
of  producing  a  virtual  cascade  of  partons.  Due  to  hard  or  semi-hard  collisions  with 
another  particle,  the  parton  in  the  virtual  cascade  can  become  real  and  the  colour 
coherence  in  the  nucleon  may  get  broken.  Thus,  we  have  the  following  picture  of  the 
heavy  ion-collisions:  Two  fluxes  of  partons  are  counter  streaming  against  each  other 
leaving  a  highly  excited  dense  partonic  system  in  the  central  region,  which  is  not  in 
thermal  equilbrium.  Thermalization  is  established  due  to  interactions  among  the  partons. 
The  phase-space  evolution  of  the  partonic  system  is  described  by  Boltzmann  type 
transport  equation  for  various  species  of  quarks,  antiquarks  and  gluons.  The  form  of 
collision  terms  for  the  different  processes  are  obtained  using  the  perturbative  QCD  cross- 
sections. 

The  initial  parton  distribution  contains  mostly  gluons  and  they  thermalize  faster  than 
the  quarks  as  the  gluon-gluon  interaction  dominates  over  quark-gluon  or  quark-quark 
interaction.  It  is  found  from  such  studies  that  momentum  equilibration  of  the  initially 
produced  parton  distribution  occurs  very  early  i.e.,  in  a  time  of  the  order  of  the  inverse  of 
collision  frequency  [19].  However,  as  the  parton  production  continues  even  after  the 
momentum  isotropy  is  established  the  chemical  equilibration  can  be  established  only  at  a 
later  stage  of  the  collision.  The  production  rates  of  the  partons  are  calculable  by 
perturbative  QCD  in  the  early  stages  of  the  collision.  The  entropy  produced  per  secondary 
parton  as  a  function  of  time  can  be  studied  within  such  models  and  it  reaches  a  plateau  for 
a  wide  range  of  energies  in  a  time  less  than  1  fm/c.  Typical  estimate  for  the  gluon 
equilibration  time  is  rg  ~  0.5  fm/c  and  production  and  equilibration  time  for  quarks  is 
rq  ~  2  fm/c.  Thus,  one  may  expect  that  the  thermalization  in  heavy-ion  collision 
proceeds  through  two  stages:  first  gluons  get  thermalized  at  a  higher  temperature 
(~500MeV)  and  then  quarks  thermalize  at  a  lower  temperature  («200MeV).  Some  of 
the  predictions  of  this  model  can  be  tested  by  probing  the  signals  coming  from  the  early 
phase  dynamics  of  nuclear  collisions  [20]. 

2.3  Classical  collective  models 

As  mentioned  above,  there  are  no  quantum  field  theoretic  methods  available  to  study 
time-dependent  non-perturbative  aspects  of  collective  (many-body)  phenomena  in  QCD. 
Hence,  most  efforts  in  this  direction  are  based  on  a  classical  framework.  It  is  well  known 
that  a  classical  theory  of  elementary  particles  and  fields  has  ultraviolet  problems  because 
of  which  it  is  not  adequate  to  describe  QCD  interactions.  However,  it  is  found  that  at  high 
temperature  and  in  the  long-wavelength  limit,  classical  and  perturbative  QCD  treatments 
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give  the  same  results  for  quark  gluon  plasma  dispersion  relations  [11].  Therefore,  one 
tends  to  believe  that  even  at  the  non-perturbative  level,  qualitative  features  of  the  classical 
approach  will  remain  valid  in  the  long  wave-length  limit.  Unlike  the  quantum  mechanical 
models,  where  non-perturbative  effects  are  included  phenomenologically  (strings,  flux 
tubes,  etc.),  in  the  classical  approach  the  non-perturbative  aspects  come  from  the  QCD 
Lagrangian.  As  mentioned  earlier,  the  thermal ization  of  the  plasma  is  then  related  to  the 
well  known  tendency  of  classical  non-abelian  gauge  fields  to  become  chaotic.  Following 
the  earlier  work  of  Matinyan  et  al  [21]  on  development  of  chaos  in  classical  Yang-Mills 
theories  (in  the  infrared  limit  k  — >  0),  recently  Miiller  and  Trayanov  [4]  and  Gong  [22] 
have  carried  out  numerical  studies  of  the  classical  Yang-Mills  theories  on  a  lattice  in  the 
presence  of  spatial  inhomogeneities  (k  ^  0) .  They  show  that  randomly  chosen  field 
configurations  of  SU(2)  or  517(3)  gauge  theories  have  universal  largest  Lyapunov 
exponent  which  scales  with  the  average  energy  of  the  system.  It  was  shown  that  for  an 
extremly  large  class  of  field  configurations  the  universal  exponent  is  given  by 
AQ  ~  (l/10)g2(Ep)  (for  5/7(3)  theories).  The  entropy  for  such  a  system  grows  linearly 
with  time  and  is  given  by  the  relation 

S(0=S0  +  £/V.  (0 

From  this  relation  one  can  obtain  the  entropy  growth  rate  dS/df  >  AQ.  The  typical 
thermalization  time  found  from  the  entropy  production  rate  is  in  conformity  with  the 
perturbative  QCD  based  expectation. 

As  indicated  in  the  introduction,  there  is  another  classical  approach  to  the  problem  of 
equilibration  which  has  been  pursued  by  our  group  [7, 8].  This  approach  is  motivated  by 
the  desire  to  put  quarks  and  gluons  on  the  same  footing  in  a  description  of  the  collective 
phenomena,  in  contrast  to  the  above  calculations  where  the  quarks  are  totally  ignored.  In 
the  rest  of  this  paper  we  shall  present  a  detailed  review  of  this  work. 


3.  Classical  colour  parton  equations 

We  now  describe  the  approach  followed  by  our  group  to  investigate  collective  non- 
abelian  aspects  of  thermalization  in  a  quark  gluon  plasma.  Our  basic  objective  was  to 
extend  the  work  of  Miiller  and  Trayanov  [4]  and  Gong  [22]  on  pure  gluon  fields  by 
carrying  out  studies  involving  both  the  quark  and  the  gluon  sector  simultaneously.  For 
this  purpose,  we  followed  the  approach  suggested  by  the  work  of  Wong  [9]  and  Elze  and 
Heinz  [23].  In  this  approach  one  follows  the  equation  of  motion  of  a  classical  coloured 
parton  in  the  mean  fields  generated  by  the  colour  current  density  in  the  Yang-Mills 
equations,  where  the  moving  coloured  partons  themselves  self-consistently  determine  the 
colour  current  density.  Furthermore,  in  this  description,  incoherent  small  wavelength 
gluons  and  quarks/antiquarks  are  treated  on  the  same  footing,  both  being  treated  as 
coloured  partons.  The  separation  of  the  gluons  into  two  scales  viz.  one  corresponding  to 
coloured  partons  and  the  other  corresponding  to  mean  fields  is  in  the  nature  of  an 
approximation  [23].  Compared  to  full  Yang-Mills  solutions,  it  amounts  to  losing  some 
information  and  phenomena  related  to  intermediate  scales.  However,  the  specific 
advantage  one  gets  is  that  one  can  simultaneously  incorporate  the  contribution  of 
quarks/antiquarks  and  gluons  to  the  collective  thermalization  process. 
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The  equations  of  motion  of  a  classical  coloured  particle  moving  in  an  external  colour 
gauge  field  were  first  obtained  by  Wong  [9]  from  the  QCD  Lagrangian.  It  was  later 
pointed  out  that  the  equation  of  motion  of  the  short  wavelength  gluons  can  also  be 
described  by  these  equations  [23].  The  equations  of  motion  of  a  classical  coloured 
particle  moving  on  a  world  line  x^(r)  and  having  momentum  p^1  =  m(dx/t/dr)  are 
given  by 


where  r  is  the  proper  time  of  the  particle.  The  gauge  field  F%"  and  the  potential  A% 
correspond  to  the  mean  fields  and  are  evaluated  at  point  JCM(T).  g  is  the  coupling  constant 
and  fabc  are  the  structure  constants  of  the  relevant  gauge  group.  Ia  is  an  N2  —  1 
component  vector  of  the  colour  charge  through  which  a  coloured  particle  couples  with  the 
gauge  fields.  In  fact,  (2)  is  very  similar  to  the  Lorentz  force  equation  of  the 
electromagnetic  theory.  Equation  (3)  describes  the  dynamics  of  the  colour  charge  of 
the  particle,  which  has  no  analogue  in  the  electromagnetic  theory.  The  meaning  of 
classical  'colour  charge  of  a  particle  becomes  clear  if  it  is  interpreted  in  terms  of  the 
expectation  value  of  the  quantum  current  operator  j%?(x)  =  g'ijj(x}'yIJ'Tai/j(x}.  One  can 
carry  out  the  identification 


It  should  be  mentioned  that  la(r)  is  a  continuum  variable  and  hence  is  strictly  speaking 
appropriate  in  the  large  TV  limit.  The  gauge  field  tensor  F%v  is  given  by 


(4) 
Dynamics  of  mean  field  F£"  is  described  by  the  Yang-Mills  equations 

dltF?+gfai*Al*r?',=JVa,  (5) 

where  jva  are  the  components  of  the  current  density.  jva  can  be  evaluated  from  the 
knowledge  of  the  coloured  particle  trajectories  obtained  from  (2)  and  (3).  In  general,  it  is 
very  difficult  to  solve  (2)-(5)  for  a  many  particle  system.  In  order  to  gain  insight  into  the 
collective  modes  described  by  such  a  system,  we  first  study  them  in  the  hydrodynamic 
limit.  Equations  describing  the  colour  hydrodynamics  were  first  written  by  Kajantie  and 
Montonen  [24],  based  on  (2)-(5).  Such  equations  can  be  obtained  by  taking  the  moments 
over  the  many  particle  distribution  (see  for  example  Bhatt  [5]). 

4.  Colour  hydrodynamics  approach  to  collective  behaviour 

The  basic  idea  in  this  approach  is  to  study  collective  properties  of  QGP  in  a  classical  but 
non-perturbative  manner.  For  this  purpose  we  use  the  colour  hydrodynamic  equations 

Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  II) 
660  Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


deduced  by  Kajantie  and  Montonen  [24]  from  Wong's  equations  discussed  in  the  last 
section. 

We  would  like  to  mention  that  the  classical  approach  has  been  applied  to  the  study 
of  collective  properties  in  quark  gluori  plasmas  by  other  workers  also  [25].  In  these  papers, 
however,  the  plasma  was  considered  to  be  essentially  abelian  in  nature  with  perturbations 
taken  in  a  linear  form.  In  the  work  carried  out  by  our  group  the  non-abelian  character 
of  the  plasma  is  retained  [6,7].  The  hydrodynamical  frame  work  was  preferred  over 
the  kinetic  or  other  detailed  microscopic  models  because  hydrodynamic  models  are 
simpler  to  deal  with  and  they  can  describe  almost  all  the  collective  modes  described 
by  the  kinetic  theory.  Further,  for  simplicity,  colour  SU(2)  group  is  chosen  instead  of 
SU(3). 

The  colour  hydrodynamic  (CHD)  equations  describe  the  space-time  evolution  of 
classical  coloured  particles  in  the  fluid  picture.  These  equations  contain  three  continuous 
quantities:  number  density  of  the  coloured  particles  n.  velocity  field  V  and  the  colour 
charge  vector  lu(a  =  1,2,3)  which  are  functions  of  space  and  time.  Their  dynamics  is 
described  by  [24]  the  equations 

(6) 
VA  x  B£)  -  VA(VA-Ea)]t 

(7) 

-  VA-  Ah]lAc.  (8) 

In  (6)-(8)  the  suffix  A  and  mA  denote  species  label  of  coloured  particles  and  mass  of 
species  A  respectively.  For  light  relativistic  particles  mass  mA  is  replaced  by  (enthalpy 
density /number  density)  for  the  relevant  specie  fluid.  cttbc  is  the  completely 
antisymmetric  Levi-Civita  tensor  and  A^(fj,  =  0, 1,2, 3)  are  SU(2)  gauge  potentials 
which  satisfy  Yang-Mills  equations  dflF%v  +  geahcA^F^11  =  fa.  Here,  jva  represents 
matter  current  which  can  be  expressed  in  terms  of  CHD  variables  asj°a  =  g  X^  nAIAa  and 
Ja  =  8  ZM  nA  V/iI/ia.  Equations  (6)-(8)  together  with  Yang-Mills  equations  form  a  closed 
set  of  Lorentz  and  gauge  covariant  equations  [6]:  which  describe  the  evolution  of  the 
QGP  self  consistently.  It  should  be  noted  that  among  the  CHD  variables  nA  and  VA  are 
gauge  invariant  quantities  while  the  colour  vector  IAa  transforms  gauge  covariantly  [6]. 

Our  model  does  not  explicitly  take  account  of  the  effects  due  to  pair  creation  and 
annihilation  processes  qq  <-»  g  and  gg «-» g.  This  would  lead  to  appropriate  source  and 
sink  terms  in  the  equations  of  continuity,  motion,  etc.  We  now  show  that  for  collective 
phenomena,  this  is  justifiable  provided  the  collision-frequency  for  momentum  transfer 
due  to  collective  effects  is  larger  than  the  appropriate  pair  creation/annihilation  collision 
frequencies.  The  typical  ratio  of  the  plasma  frequency  wp  to  the  collision  frequency  v 
of  pair  creation/annihilation  processes  may  be  estimated  as  [25]  uv/v  =  4?r/ 
as<\/(Nf  +  2N)/l%  (\n(a~l})~\  Here,  N{  and  N  are  number  of  quark  flavours  and  the 
number  of  the  gluon  colours  respectively,  g  is  the  strong  interaction  coupling  constant  and 
Q>S  =  g2/4n.  For  the  plasma  with  three  flavours  of  quarks  and  g  ~  1,  say  at  temperature 
~  200  Me V,  one  finds  wp/f  ~  10.  Thus  one  may  have  significant  momentum  transfer  by 
collective  processes  compared  to  the  pair  creation/annihilation  collisional  processes  and 
the  latter  may  be  neglected.  In  our  model,  then,  the  equilibrium  number  densities  of 
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partons  are  fixed  and  are  prescribed  externally,  based  on  our  knowledge  of  the 
background  temperature,  energy  density  etc. 

We  now  use  (6)-(8)  to  describe  thermalization  due  to  non-abelian  collective  effects  in  a 
quark  gluon  plasma.  Let  us  imagine  that  because  of  the  violent  nature  of  a  heavy  ion 
collision  a  significant  energy  is  stored  in  coherent  electric  fields.  For  simplcity,  we 
assume  that  most  of  this  energy  is  in  the  form  of  longitudinal  oscillations.  We  would  like 
to  address  the  question:  what  happens  to  the  energy  as  the  system  evolves?  Is  there  any 
evidence  that  this  energy  may  be  thermalizing?  If  so,  what  are  the  time-scales  involved? 

To  investigate  these  questions,  we  have  studied  non-abelian  longitudinal  oscillations  of 
a  quark  gluon  plasma.  If  the  colour  electric  field  is  produced  in  the  z-direction,  then,  we 
can  simplify  (6)-(8)  and  the  YM  equations  by  choosing  a  purely  longitudinal  mode  (i.e., 
A-a  —  •Aa  =  0)  and  choosing  the  gauge  A"  =  0.  The  potentials  Aza  are  functions  of  z  and  t 
variables,  but  we  further  assume  that  they  depend  only  on  the  variable  £  =  z  +  fit. 
Physically  this  means  that  one  is  studying  a  class  of  solutions  which  are  stationary  in  a 
frame  moving  with  phase  velocity  (3.  Such  stationary  wave  solutions  are  widely  studied  in 
the  Coulomb  plasma  literature  and  have  also  been  considered  for  the  non-abelian  theories 
[26,27].  With  these  assumptions  the  basic  CHD  equations  reduce  to  three  coupled  non- 
linear second  order  ordinary  differential  equations  having  a  dimensionless  parameter 
e  =  g2lQa^/mujp  where,  IQ  and  ao  represent  the  scaling  parameters  for  the  colour  charge 
and  the  gauge  field  amplitude  Aza  respectively  and  UJP  is  the  plasma  frequency.  We  next 


x   o  - 


1000 


662 


Figure  1.    Hydrodynaraic  variable  XI  vs.  time  for  non-abelian  parameter  t  —  0.05. 
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Figure  2.     Hydrodynamic  variable  X]  vs.  time  for  non-abelian  parameter  e.  =  0.5. 

define  quantities  X/(i  =  1,2,3),  which  are  linear  combinations  of  the  scaled  gauge 
potential  Aza  =  Aaa0.  They  are  X\  =A\ +A2  +  A^  X2  =  <J$/2(A\  -  A2)  and  X2  = 
^/]~/2(A]  —  2A2  +  AS).  In  terms  of  the  X,-'s  the  equations  reduce  to 

(9) 
(10) 

(11) 

We  would  like  to  note  that  for  the  case  of  homogeneous  plasma  i.e.  k  — >  0,  we  get 
the  same  set  of  equations  as  that  with  the  stationary  frame  ansatz  (for  details  of 
both  calculations,  see  Bhatt  et  al  [6]).  Our  main  findings  are  as  follows:  (i)  for  the 
case  e  =  0  we  get  the  longitudinal  plasma  oscillations  with  constant  amplitude  and 
frequency,  which  are  very  similar  to  those  in  the  Coulomb  plasma.  In  this  case,  in  fact, 
the  equation  describing  the  oscillation  becomes  linear,  (ii)  For  small  values  of  e(=  0.05) 
we  find  a  new  "earphone"  oscillation  (figure  1)  mode  in  which  the  high  frequency 
oscillations  are  modulated  at  a  lower  frequency  and  the  abelian  oscillation  of  the 
plasma  at  the  plasma  frequency  is  periodically  followed  by  a  non-abelian  oscillation 
with  higher  frequency  and  reduced  amplitude.  For  higher  values  of  e  ~  0.5  the 
oscillations  are  found  to  exhibit  chaotic  behaviour  (figure  2).  We  have  also  studied  auto- 
correlations for  such  a  time  series  [5].  The  analysis  shows  that  the  overall  decay  in  time  tc 
is  such  that  uptc  <up/v  ~  10  as  discussed  earlier  in  the  section.  Thus  the  decay 
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time  scale  for  oscillations  can  be  faster  than  the  collisional  time  scale  for  larger  values  of 
the  parameter  e.  We  have  found  from  this  analysis  thai  non-abelian  effects  lead  to  chaos  in 
plasma  oscillations.  Such  chaotic  phenomena  are  already  observed  in  the  vacuum 
solutions  of  the  Yang-Mills  theory. 

The  hydrodynamic  appraoch,  however,  does  not  give  any  information  about  the 
evolution  of  parton  distribution  function.  Such  information  can  be  obtained  from  the 
solutions  of  (2)-(5).  The  chaotic  behaviour  of  the  plasma  oscillations  have  implication 
for  the  thermalization  of  the  energy.  To  analyze  this  question  in  more  detail  we  have 
carried  out  a  1  +  1  dimension  particle  simulation  of  the  longitudinal  oscillations  in  the 
QGP. 


5.  Particle  simulations  of  a  coloured  parton  plasma 

Particle  simulations  are  extensively  used  to  study  the  nonlinear  physics  of  classical 
electrodynarnic  plasmas.  These  simulations  are  in  (he  nature  of  computer  experiments  in 
which  one  explicitly  follows  the  motion  of  N  particles  in  their  own  self-consistent  fields 
which  are  evolved  using  the  Maxwell  field  equations.  Since  the  simulations  are  typically 
done  to  explore  collective  nonlinear  phenomena,  one  usually  discriminates  against  short 
range  Coulomb  interactions  (collisional  effects)  by  using  large  number  of  particles  in  a 
Debye  sphere  (Ac  3>  1)  and/or  by  using  smoothed  out  'fat'  particles  [28J.  Due  to 
limitations  imposed  by  computers,  particle  simulations  are  typically  restricted  in 
maximum  N  usable,  dimensionality  and  noise  effects/errors  due  to  finite  grid  size  and 
time  step  of  integration.  Within  these  limitations,  particle  simulations  can  go  well  beyond 
the  hydrodynamic  description  of  last  section  and  can  give  us  a  detailed  *,  t  evolution  of 
fields,  distribution  function  of  particles  in  phase  space  and  so  on.  It  is  therefore 
particularly  well  suited  for  investigating  questions  relating  to  collective  thermalization  in 
coloured  parton  plasmas. 

We  have  written  a  relativistic  particle  simulation  code  for  colour  partons  [8].  In  the 
code  we  follow  the  motion  of  a  collection  of  classical  coloured  partons  interacting  via 
non-abelian  gauge  fields.  The  classical  partons  (which  represent  quarks  and  short  scale 
giuons  as  discussed  in  §  3)  constituting  the  plasma  are  described  using  three  variables:  the 
particle  position  x,  velocity  v  and  the  colour  charge  IAa.  The  dynamics  of  the  particles  is 
governed  by  the  relativistic  Lorentz  force  law  due  to  Wong  [91  viz.  (2).  The  dynamics  of 
the  fields  F£"  is  given  by  the  Yang-Mills  equation,  (5).  The  coloured  partons  interact  by 
generating  a  charge/current  density  fa  which  acts  as  a  source  for  the  Yang-Mills  fields. 
These  fields  exert  force  on  the  partons  and  also  rotate  their  colour  vectors.  The  variation 
of  colour  charge  of  a  particle  is  governed  by  the  colour  dynamic  equation,  (3).  It  is  this 
attribute  of  a  coloured  parton  plasma  which  distinguishes  it  from  conventional 
electrodynamic  plasmas  and  makes  our  simulation,  first  one  of  its  kind.  The  equilibrium 
densities  of  partons  are  treated  as  given  constants  (since  we  are  not  explicitly 
incorporating  effects  due  to  particle  creation  and  annihilation).  Finally,  the  equilibrium 
parton  densities  are  distributed  in  such  a  manner  that  the  plasma  has  overall  colour 
neutrality,  as  is  expected  in  a  parton  plasma  created  by  heavy  ion  collisions. 

Our  objective  in  these  simulations  is  to  investigate  collective  aspects  of  thermalization 
in  non-abelian  plasmas.  We  thus  start  with  a  cold  (T  -»•  0)  coloured  parton  plasma  with 
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large  AD  and  at  t  —  0  put  a  lot  of  energy  in  coherent  longitudinal  fields  (such  as  the  ones 
which  might  be  created  because  of  heavy  ion  collisions,  as  discussed  in  §4).  Restricting 
ourselves  (for  simplicity)  to  longitudinal  field  interactions  only  we  ask,  how  does  this 
energy  equilibrate  and  what  is  the  time  scale  of  equilibration?  For  this  purpose  we  study 
the  time  evolution  of  fields  (where  we  find  generation  of  chaos  and  energy  spreading  in 
Fourier  space)  and  parton  distribution  functions  (where  we  find  heating  and  evolution  to 
Max  well  /Max  well-Jiittner  distributions).  Finally,  we  ask  whether  in  the  thermalized 
state,  there  is  an  equipartition  of  energy  between  the  mean  field  modes  and  the  kinetic 
energy  per  parton  (and  we  find  it  is  indeed  so!). 

As  discussed  above,  we  restrict  the  simulations  to  dynamics  of  longitudinal  colour 
fields  only  and  work  in  1  +  1  dimensions.  We  use  the  gauge  condition  Ala  =  0  so  that  our 
basic  equations  take  the  form 

n-n 
*,  (12) 


fl, 

at 

(14) 


05) 
/1/:\ 

,  (  1  O  ) 

at 

where  7/1  =  (1  -v^/c2)"1'2,  A  refers  to  parton  species,  repeated  indices  are  to  be 
summed  over  and  we  have  made  the  equations  dimensionless  with  the  following 
scalings:  XA  -»  xA/kl\vA  ->•  vA/vfk~l,  t  -»•  WptJAa  -»  fAa/IQ,  nA  -»  nA/n0,Ea  -*•  Eaf 
(glonok~ln~1},  A®a  —  >  A^/(gIonQk~2H~]};  here  u%  =  ^I^n^/mH2  is  the  plasma  frequency, 
a—  mujp/hk2  is  the  non-abelian  parameter  and  ki  =  2iT/L  is  the  wave-vector  of  the 
lowest  mode. 

We  use  the  particle-in-cell  (PIC)  method  of  coloured  parton  simulations.  An  infinite  1- 
D  system  is  modelled  using  periodic  boundary  conditions.  The  whole  system  is  divided 
into  NG  cells  using  7VG+  1  grid  points  and  all  the  field  equations  like  electric  field, 
potential,  charge  density  etc.,  are  calculated  at  the  grid  points.  The  particles  are  initially 
positioned  at  the  centre  of  the  cells  and  each  particle  is  assigned  a  zero  initial  velocity 
(cold  parton  plasma  with  7  =  0)  and  3-components  of  colour  charge.  Particles  are 
initially  given  a  perturbation  in  colour  space  to  set  up  the  colour  charge  density  and  the 
associated  longitudinal  colour  fields.  Once  the  initial  conditions  are  given,  the  charge 
density  on  the  grid  points  are  calculated  using  a  weighting  scheme,  which  implies  some 
form  of  interpolation  among  the  grid  points  nearest  to  a.  particle.  We  use  a  quadratic 
spline  weighting  scheme  in  which  one  distributes  the  charge  of  a  particle  to  three  nearest 
grid  points  using  the  assignment  functions 
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where  X/s  are  the  grid  coordinates  and  x  is  the  coordinate  of  a  particle.  This  process  of 
distributing  the  charge  of  a  particle  to  nearest  three  grid  points  is  repeated  for  all  the 
particles  and  so  the  total  charge  density  at  a  particular  grid  point  is  the  sum  of  the 
contribution  from  all  particles  contained  within  a  width  3  A*  with  that  grid  point  at  the 
centre.  This  description  should  also  give  one  an  idea  of  how  the  'fat  particle'  concept, 
suppressing  the  short  range  'collisions!'  Coulomb  interactions  is  introduced  in  the 
simulations. 
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The  next  step  in  simulation  is  to  solve  the  Gauss  law  on  the  grid.  We  use  the  simplest 
central  difference  scheme  accurate  up  to  order  (Ax)2  for  this  purpose.  To  make  the 
average  field  zero,  we  calculate  (E)  and  subtract  it  from  each  Ej.  The  potentials  required 
for  solving  the  colour  dynamic  equation  are  also  calculated  in  this  step  by  using  a  similar 
central  difference  scheme.  After  the  electric  field  calculation,  the  forces  acting  on  the 
particles  are  calculated.  To  calculate  the  force,  one  needs  to  calculate  the  electric  fields  at 
the  particle  positions.  For  this  purpose,  we  use  a  similar  interpolation  scheme  as  before. 
This  is  important  because  it  can  be  shown  [29]  that  for  identical  charge  assignment  and 
force  interpolation  schemes  and  correctly  space-centred  difference  approximations  for 
derivatives,  the  self  force  on  a  particle  arising  due  to  various  particle  to  grid  and  grid  to 
particle  interpolations  is  zero  and  the  total  momentum  of  the  system  is  identically 
conserved.  In  the  next  step,  forces  calculated  on  the  particles  are  used  to  move  the 
particles  to  a  new  position  by  solving  the  equations  of  motion.  We  use  the  standard  leap- 
frog scheme  [30]  for  this  purpose.  The  last  step  in  the  computational  cycle  is  the 
calculation  of  colour  vector  for  each  particle  by  solving  the  colour  dynamic  equation.  We 
have  used  a  predictor-corrector  scheme  [see  e.g.  31]  for  this  purpose.  All  the  steps  used  in 
a  computation  cycle  are  summarised  in  the  box  diagram  shown  in  figure  3. 

We  now  report  the  main  results  of  our  simulations.  As  discussed  above,  we  initiate  the 
simulation  with  the  imposition  (at  t  =  0)  of  a  coherent  colour  charge  perturbation 
la  =  /no(l  +<5cos(fo:-|-<9a))  and  study  the  time  evolution  of  the  system  for  different 
values  of  the  non-abelian  parameter  a.  We  choose  1024  grid  points  in  a  system  of  length 
2?r  and  put  104  particles  on  it.  The  parameter  nAo  for  the  1-D  system  ~  13  (which 
roughly  corresponds  to  a  3-d  parameter  AD  ~  2  x  103),  We  use  relatively  small  values 
of  6  so  that  the  electric  field  fluctuations  induced  by  colour  charge  density  are  small;  this 
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Figure  4.    Field  energy  density  vs  time  for  non-abelian  parameter  a  =  0.5. 
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Figure  5.    Field  energy  density  vs  time  for  non-abelian  parameter  a  =  10.0. 


Figure  6.     Field  energy  density  vs  frequency  (power  spectrum)  for  non-abelian 
parameter  a  ='  10.0. 

is  done  so  that  the  effects  due  to  conventional  plasma  nonlinearity  (coming,  for  example 
from  the  convective  terms  in  hydrodynamic  equations)  are  negligible.  We  then  control  the 
strength  of  non-abelian  interactions  by  changing  the  parameter  a,  which  essentially 
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determines  the  rate  of  precession  of  the  colour  charge  vector.  It  should  be  emphasized  that 
in  an  actual  parton  plasma,  the  product  a<5  determines  the  rate  of  precession  of  the  colour 
vector  so  that  this  may  be  enhanced  by  changing  the  amplitude  of  initial  perturbation  also 
and  furthermore  the  non-abelian  and  plasma  norilinearities  both  operate  together, 
contributing  to  the  overall  collective  thermalization  process. 

For  OL—  0,  we  find  that  the  system  exibits  usual  plasma  oscillations  at  the  plasma 
frequency.  This  is  expected  since  the  non-abelian  physics  is  absent  and  conventional 
plasma  nonlinearities  are  negligible(as  explained  above).  Figure  4  shows  the  behaviour 
of  collective  oscillations  for  a.—  0.5;  we  have  chosen  field  energy  density  at  a  particular 
point  as  a  representative  variable  and  show  its  time  evolution.  The  system  continues  to 
be  coherent  and  exhibits  characteristic  'ear  phone'  oscillations  in  which  a  slower  fre- 
quency modulates  the  usual  plasma  oscillations  [9].  It  is  interesting  that  the  full  particle 
simulation  reproduces  a  behaviour  observed  in  simple  hydrodynamic  models  (figure  1). 

Figure  5  shows  the  characteristic  time  evolution  for  a=  10.0.  Now  the  field  energy 
density  shows  irregular  chaotic  oscillations  (similar  to  figure  2  in  §4).  To  examine  the 
nature  of  this  chaos  we  plot  the  power  spectrum  in  figure  6.  We  see  a  broadband  structure 
which  is  fairly  representative  of  chaotic  systems  but  this  by  itself  does  not  guarantee  that 
the  system  exhibits  sensitivity  to  initial  conditions  [32].  To  examine  this  question  we 
define  a  gauge  invariant  metric  as 


NG-l 

\|    ,'=0 

This  is  nothing  but  the  distance  between  two  points  in  a  NG-dimensional  space,  where 
each  point  represents  the  field  energy  configuration  of  the  whole  system.  A  similar  type 
of  definition  has  been  used  by  Miiller  et  al  [4]  and  Gong  et  al  [22]  to  exhibit  chaotic 
dynamics  in  a  pure  gluon  system.  In  figure  7  we  show  the  variation  of  ln(D(t))  versus 
time.  The  linear  rise  of  ln(D(t}}  as  a  function  of  t  shows  that  the  'distance'  between  two 
neighbouring  configurations  increases  exponentially  in  time.  The  saturation,  as  also 
observed  in  the  earlier  work  of  Mtiller  et  al  (1992),  is  because  of  the  compact  space  used 
for  description  of  fields  (viz.  finite  number  of  discrete  modes).  The  slope  of  the  linear  part 
gives  the  largest  Lyapunov  exponent  (h)  which  is  related  to  the  rate  of  entropy  rise 
[4,31,34].  The  inverse  of  the  Lyapunov  exponent  gives  an  estimate  of  the  entropy 
production  time  scale  or  the  thermalization  time.  In  our  case  the  estimated  thermalization 
time  turns  out  to  be  of  order  1.4  plasma  periods.  We  shall  discuss  the  notion  of 
thermalization  in  somewhat  more  detail  below. 

We  first  look  at  the  distribution  of  energy  in  the  various  Fourier  modes  after  the  elapse 
of  several  Lyapunov  exponent  times.  Figure  8  shows  the  Fourier  mode  energy  density  vs. 
k/ki.  We  see  that  energy  is  distributed  in  a  large  number  of  modes  although  initially  (at 
t  =  0)  we  had  pumped  energy  only  into  the  lowest  mode.  At  the  same  time  we  look  at  the 
phase  space  of  partons.  Compared  to  the  initial  cold  (T  =  0)  parton  distribution,  we 
observe  a  considerably  broadened  phase  space  (figure  9).  The  space  average  distribution 
of  particle  momenta  is  shown  in  figure  10.  It  shows  that  the  distribution  approaches  a 
Maxwellian  form  with  a  temperature  of  order  T/mc2  ~  10~4.  Interestingly,  we  also  find 
that  mean  energy  per  mode  is  of  order  T/2  as  would  be  expected  from  the  law  of 
equipartition  of  energy  in  a  thermalized  distribution. 


Figure  7.    In  (D(t)}  vs  time  for  non-abelian  parameter  a  =  10.0. 


10" 


10' 


§10-" 


10' 


10" 


50 


100 


150 
k/k 


200 


250 


300 


Figure  8.    Field  energy  density  vs  mode  number  for  non-abelian  parameter  a  =  10.0 
at  normalized  time  t/Tp  =  80. 


We  now  comment  on  various  aspects  of  the  observed  thermalization  process.  The  flow 
of  energy  from  low  k  to  high  k  values  is  due  to  non-abelian  interactions  and  is  expected  in 
a  classical  system  tending  towards  energy  equipartition.  The  Rayleigh-Jeans  distribution 


670 


Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  D) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


Collective  thermalization  in  QGP 


0.06 


0.04 


0.02 


o 


-0.02 


-0.04 


-0.06 


alpha=10.0 


s.    *:  •  ».*•• :.-  .."*?*/..'•  c.  •  ''/••'"'  ••'*< 

?.^ 


0          0.1          0.2         0.3         0.4         0.5         0.6         0.7         0.8         0.9  1 

x/L 

Figure  9.    Phase  space  of  partons  for  non-abelian  parameter  a  =  10.0  at  normalized 
time  t/Tp  =  80. 


Figure  10.    Normalized  distribution  function  for  non-abelian  parameter  a  =  10.0  at 
normalized  time  t/Tf  —  80. 


in  a  1-D  system  would  correspond  to  a  flat  energy  distribution  in  k  space.  We  have  not 
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realization  chosen  by  us.  The  average  'mean  energy'  per  mode  does  however  appear  to  be 
close  to  (1/2)7"  as  expected  in  a  thermalized  distribution. 

The  parton  distribution  function  however  appears  to  be  fully  thermalized,  having 
acquired  the  Maxwell-Boltzmann  form  shown  in  figure  10.  The  mechanism  by  which 
cold  plasma  particles  get  heated  up  is  related  to  'phase  mixing'  type  damping  of  the 
longitudinal  fields.  One  may  argue  that  when  the  initial  energy  is  distributed  into  a  large 
number  of  randomly  phased  Fourier  modes,  resonant  particle  effects  associated  with  the 
high  k  modes  (u/k  ~  Vth)  can  lead  to  randomization  of  particle  velocities.  The 
distribution  has  acquired  the  Maxwell-Boltzmann  rather  than  the  relativistic  Maxwell- 
Jiittner  form  because  we  have  chosen  rather  small  amplitudes  of  the  longitudinal  colour 
fields.  This  has  restricted  the  final  distribution  to  non-relativistic  temperatures.  This  was 
done  because  we  wanted  to  avoid  effects  due  to  plasma  nonlinearity  and  wanted  to  isolate 
the  physics  of  non-abelian  effects.  As  mentioned  earlier,  in  a  true  situation,  with  larger 
initial  fluctuations,  both  nonlinearities  would  contribute  to  phase  mixing  and 
thermalization  and  lead  to  Maxwell-Jiittner  distribution.  It  should  be  emphasised  that 
the  thermalized  parton  distribution  does  not  distinguish  between  quarks  and  short  scale 
gluons  and  both  will  acquire  Maxwell-Jiittner  type  distributions  relativistically.  This  is 
because  we  are  treating  the  short  scale  gluons  as  classical  distinguishable  particles  and 
hence  their  statistics  will  only  generate  the  most  probable  Maxwell-Jiittner  form  (rather 
than  the  Rayleigh-Jeans  form  which  is  the  H  — >  0  limit  of  the  Planck  distribution).  The 
essential  point  is  that  all  distributions  have  nearly  thermalized  and  taken  the  most 
probable  form,  within  the  constraints  of  the  model,  and  everything  happens  because  of 
collective  effects  within  a  few  Lyapunov  exponent  times. 

We  now  comment  on  the  thermalization  time  observed  by  us.  From  the  above 
discussion  it  appears  that  although  the  Lyapunov  exponent  time  is  only  about  1.4  plasma 
periods,  it  takes  several  plasma  periods  (12  to  13  in  our  specific  example)  to  see  complete 
thermalization.  We  suspect  that  this  may  be  because  of  the  special  initial  condition 
chosen  (a  coherent  longitudinal  field  plus  T •=  0  parton  distributions)  and  because  of  the 
rather  restricted  nature  of  interaction  in  a  1-D  plasma  with  a  finite  number  of  discretized 
modes.  If  the  system  exhibits  a  chaotic  attractor  (as  it  does  in  our  case)  the  time  taken  for 
complete  thermalization  can  depend  on  the  'distance'  between  the  initial  state  and  the 
final  thermalized  state.  We  expect  that  for  general  initial  realizations  and  for  a  system 
which  has  a  greater  number  of  degrees  of  freedom  either  because  of  three  dimensionality 
or  because  of  greater  number  of  particles/number  of  modes,  the  system  should  approach 
a  thermalized  state  very  soon  after  arriving  on  the  attractor.  We  thus  take  a  Lyapunov 
exponent  time  as  a  more  representative  'minimum  time  for  thermalization'  needed  by 
collective  effects  studied  by  us. 

In  the  final  section,  we  present  an  application  of  the  above  results  to  the  thermalization 
problem  in  relativistic  heavy  ion  collision. 


6.  Discussion 

The  main  emphasis  in  this  paper  has  been  on  collective  aspects  of  thermalization  in  a 
quark  gluon  plasma.  After  presenting  a  summary  of  the  earlier  work  of  Miiller  et  al  [4] 
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and  Gong  [22]  on  thermalization  in  pure  gluon  fields,  we  presented  a  detailed  survey  of 
our  work  on  collective  thermalization  in  classical  parton  models  [6,8].  The  basic 
conclusion  is  that  when  collective  effects  dominate  (typically,  when  AD  >  1),  the 
thermalization  time  is  of  order  of  a  plasma  period.  This  time  for  a  typical  relativistic 
heavy  ion  collision  is  of  order  1  fm/c. 

A  few  words  about  the  limits  of  validity  of  our  model  and  its  applicability  to  the  RHIC 
problem  are  in  order  here.  The  best  estimates  of  AD  for  typical  RHIC  parameters  give 
AD  ^  1  [2],  which  argues  that  short  range  collisional  effects  are  important  (giving  a 
thermalization  time  ~  1  fm/c)  and  collective  effects  may  be  ignorable.  However,  there  is 
uncertainity  in  these  estimates  and  our  calculations  show  that  even  when  AD  »  1  and 
collective  effects  dominate,  thermalization  occurs  on  a  time  scale  ~  1  fm/c.  This  is 
reassuring.  Furthermore,  when  AD  «  1,  the  plasma  can  go  into  a  strongly  coupled  state 
[33]  .  None  of  the  theoretical  calculations  (either  collective  or  collisional)  presently  take 
mis  into  account.  It  would  be  of  interest  to  investigate  thermalization  in  such  strongly 
correlated  plasmas.  Our  simulations  are  presently  restricted  to  SU(2)  group  symmetries, 
longitudinal  field  interactions  and  non-relativistic  amplitudes  only.  However,  we  suspect 
that  the  basic  result  that  thermalization  time  is  of  the  order  of  a  plasma  period  is  unlikely 
to  change  since  that  is  the  characteristic  time  associated  with  collective  effects.  Thus  we 
can  use  the  results  of  our  simulation  for  the  more  general  cases  also,  taking  care  to  define 
the  plasma  period  in  an  appropriate  manner.  The  above  estimate  of  1  fm/c  for  the  plasma 
period  has  been  obtained  in  this  manner  only.  We  may  wonder  whether  our  'periodic' 
simulations  are  truly  representative  of  the  finite  heavy  ion  collision  experiment.  In  a  way, 
we  have  restricted  the  maximum  size  of  the  system  by  defining  a  minimum  k  (or  the 
largest  wavelength)  in  the  system.  However,  our  results  are  relatively  independent  of  this 
parameter  as  long  as,  this  largest  wavelength  is  many  Debye  lengths  long.  Thus  we  expect 
that  in  the  range  of  dominance  of  collective  effects,  our  simulation  results  should  be 
representative. 

Finally,  a  word  of  caution  should  be  made  regarding  the  application  of  thermalization 
results  based  on  classical  chaos  theory  to  a  truly  quantal  problem.  We  hope  to  address 
these  and  related  questions  in  our  future  work. 
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Nonlinear  propagation  of  reiativistically  intense 
electromagnetic  waves  in  a  coilisionless  plasma 
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Abstract.  We  discuss  the  nonlinear  propagation  of  reiativistically  intense  electromagnetic  waves 
into  coilisionless  plasmas  with  special  emphasis  on  one  dimensional  plane  wave  solutions  of  the 
propagating,  standing  and  modulated  types.  These  solutions  exhibit  a  rich  variety  of  phenomena 
associated  with  relativistic  electron  mass  variation  and  coupling  between  transverse  electro- 
magnetic and  longitudinal  fields.  They  have  important  applications  to  problems  of  laser 
propagation,  self-focusing  in  overdense  plasmas,  particle  and  photon  acceleration  and  to 
electromagnetic  radiation  around  pulsars. 

Keywords.     Nonlinear  waves;  reiativistically  intense;  plasma  waves. 
PACS  Nos    52.35;  52.60 

1.  Introduction 

The  propagation  of  intense  electromagnetic  waves  in  a  coilisionless  plasma  is  a  problem 
of  considerable  interest  because  of  its  applications  to  laser  fusion,  modern  plasma  based 
particle  acceleration  methods,  laboratory  experiments  using  T"3  (table  top  terawatt)  lasers, 
low  frequency  electromagnetic  radiation  around  pulsars  etc.  In  this  paper  we  shall  present 
a  general  review  of  work  on  this  topic  with  special  emphasis  on  the  work  done  by  our 
group. 

In  a  weak  electromagnetic  wave  propagating  in  a  plasma,  electrons  jitter  under  the 
influence  of  the  oscillating  electric  field  vector  giving  a  shielding  current  which  is 
responsible  for  the  well  known  dielectric  constant  expression  e  =  1  —  Wp/cj2  where 
UJP  =  (4imQe2/rn)1'2  is  the  electron  plasma  frequency  (no  is  the  electron  number  density, 
m  is  the  electron  mass  and  e  is  the  electron  charge).  Since  the  electromagnetic  waves  are 
transverse  in  nature,  the  electron  motion  is  normal  to  the  propagation  vector;  this  prevents 
the  convective  fluid  nonlinearities,  entering  through  the  (v-V)  terms  from  becoming 
effective  for  the  electromagnetic  wave.  However,  as  the  directed  jitter  electron  velocity 
v  ~  eEi/rnw  becomes  comparable  to  the  velocity  of  light  c,  nonlinear  effects  enter 
through  relativistic  mass  variation  of  the  electrons  and  the  v  x  B  forces  where  B  is  the 
magnetic  field  associated  with  the  electromagnetic  wave.  The  v  x  B  forces  drive  the 
electrons  along  the  propagation  direction  and  give  the  possibility  of  coupling  to  longi- 
tudinal plasma  oscillations.  A  reiativistically  intense  electromagnetic  wave  (v/c  ~  1)  is 
therefore  always  nonlinear  and  in  general  a  mixed  wave,  with  significant  space  charge 
fluctuations  associated  with  it.  It  is  only  for  highly  specialized  polarizations,  such  as 
circular,  where  the  v  x  B  forces  exactly  cancel  that  we  have  a  pure  nonlinear 
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electromagnetic  wave.  The  possibility  of  exciting  space  charge  fields  propagating 
nearly  at  the  velocity  of  light  (viz.  the  group  velocity  of  an  electromagnetic  pulse)  is 
the  key  to  modern  plasma  based  particle  acceleration  schemes,  which  are  currently 
undergoing  intense  experimental  investigation.  We  may  also  physically  anticipate 
other  interesting  consequences  of  working  with  relativistically  intense  electromagnetic 
waves.  It  is  obvious  that  as  the  electron  jitter  velocity  approaches  c,  the  effective  mass 
of  the  electrons  will  increase.  This  would  encourage  penetration  of  intense  electro- 
magnetic waves  into  normally  overdense  plasmas,  i.e.,  wp  >  uj  >  wpeff.  Similarly,  if 
we  send  a  finite  plane  wave  front  with  an  intensity  variation  on  it,  the  intense  regions 
will  propagate  slower  than  the  weakly  intense  regions,  because  c/(l  —  uj^/uj2)1'2  <  c/ 
(1  —  u;2/^2)1/2);  this  causes  the  plane  wave  front  to  acquire  curvature  and  leads  to  self- 
focusing  and  filameniation  of  the  intense  electromagnetic  wave.  We  will  consider 
examples  of  all  the  above  physical  effects  in  our  subsequent  discussions. 

Our  paper  is  organized  as  follows.  We  first  discuss  the  basic  equations  of  the  plasma 
model  and  its  range  of  validity.  We  next  investigate  some  special  solutions  of  the  full 
nonlinear  set  of  equations.  These  solutions  assume  the  existence  of  nonlinear  stationary 
plane  waves  and  reduce  the  problem  to  a  set  of  coupled  ordinary  differential  equations. 
We  then  go  on  to  discuss  some  exact  solutions  for  circularly  polarized  waves  and 
numerical/approximate  analytical  solutions  for  linearly  polarized  waves.  Following 
Akhiezer  and  Polovin  [1]  we  next  show  how  the  linearly  polarized  waves  may  be  reduced 
to  the  problem  of  a  Hamiltonian  with  two  degrees  of  freedom  and  describe  a  few 
analytical  results  obtained  by  them  for  high  phase  velocity  waves  and  waves  close  to  c. 
We  then  present  some  of  our  own  numerical  results  of  the  problem,  which  for  the  first 
time  showed  the  rich  variety  of  possible  solutions  [2].  The  appropriate  Hamiltonian 
involves  square  roots  because  of  relativistic  effects  (unlike  most  commonly  studied 
nonlinear  Hamiltonians  which  tend  to  involve  polynomials).  The  results  of  these 
investigations  are  therefore  of  general  interest  to  nonlinear  dynamics.  Since  the  numerical 
results  indicated  an  almost  integrable  system  there  was  widespread  interest  for  a  while  to 
obtain  an  analytical  confirmation  of  integrability.  It  came  as  somewhat  of  a  surprise  when 
it  was  actually  proved  to  be  nonintegrable  [3].  In  §4,  we  discuss  the  extension  of  these 
results  to  standing  nonlinear  waves.  The  next  section  is  devoted  to  modulated  nonlinear 
waves  in  which  solutions  are  not  stationary  in  a  moving  frame.  This  allows  us  to  consider 
the  possible  excitation  of  one  dimensional  solitons  in  the  coupled  electromagneticwave- 
plasma  wave  problem.  Many  features  of  the  numerical  solutions  of  this  soliton  problem 
can  be  reproduced  analytically  using  a  WKB  method.  In  the  last  section  we  summarise 
the  applications  of  the  above  results  to  problems  of  laser  propagation  and  self-focusing  in 
overdense  plasmas,  the  problems  of  particle  and  photon  acceleration  and  to 
electromagneticfields  around  rotating  pulsars.  We  also  discuss  questions  related  to 
stability  of  these  solutions,  the  importance  of  multidimensional  effects  and  the  conditions 
under  which  movement  of  ions  can  significantly  alter  the  conclusions. 


2.  Plasma  model  and  basic  equations 

Consider  a  finite  amplitude  electromagnetic  wave  propagating  along  the  z  axis.  If  the 
wave  amplitude  is  sufficiently  large  so  that  eE/muc  >  1,  the  electrons  are  accelerated  to 

Pramana  -  /.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  IT) 


the  speed  of  light  in  a  fraction  of  the  wave  period.  Two  nonlinear  effects  immediately 
become  important,  namely,  the  relativistic  variation  of  the  electron  mass  and  the  excitation 
of  longitudinal  space  charge  fields  by  strong  v  x  B  forces  driving  electrons  along  the 
direction  of  propagation.  On  the  time  scale  of  the  electron  motion  the  ions  are  virtually 
immobile  due  to  their  larger  mass  and  hence  their  dynamics  can  be  ignored.  Of  course,  at 
larger  electric  field  strengths,  this  assumption  can  break  down  particularly  when  the  ions 
also  acquire  large  induced  directed  velocities.  Thus  one  is  restricting  oneself  to  wave 
amplitudes  where  the  electron  directed  velocity  is  large  but  the  ion  directed  velocity  is 
still  small  (eE/Mu>  ~  O.lc  where  M  is  the  ion  mass).  Likewise  the  electrons  can  be 
treated  as  a  cold  fluid  since  their  induced  velocity  is  close  to  the  speed  of  light  and 
therefore  the  electron  thermal  velocity  can  be  neglected  in  comparison.  The  basic 
nonlinear  propagation  equations  for  the  coupled  electromagnetic  and  longitudinal  waves 
can  then  be  written  down  in  terms  of  the  following  set  of  relativistic  cold  fluid  equations. 

c\,     3  \      / 

«o),  (2) 

1  #E  ,„, 

vx±i  = «v  +  -^7>  3 

c  c  at 

V-B  =  0,  (4) 

f^f  ' 

df>  ,  ,    ^x  /„     v  x  B 


Q,     <    \w     *  /r     -       ~\ —    i  _         p  l°/ 

where  E,  B  are  the  electric  and  magnetic  field  components  of  the  wave,  v  is  the  fluid 
velocity,  n  is  the  density  and  p  =  wivy  is  the  relativistic  momentum  (with 
7  =  l/\/l  —  v2/c2).  Equations  (1-4)  are  the  full  set  of  Maxwell's  equations,  (5)  is  the 
continuity  equation  and  (6)  is  the  relativistic  fluid  momentum  equation.  These  model 
equations  were  first  written  down  by  Akhiezer  and  Polovin  [1]  and  have  formed  the  basis 
of  most  fundamental  studies  related  to  the  propagation  of  intense  electromagnetic  waves 
in  a  plasma.  While  a  fully  time  dependent  solution  of  these  coupled  nonlinear  partial 
differential  set  of  equations  is  difficult  to  obtain  analytically  (or  even  handle 
numerically),  various  special  solutions  have  been  obtained  from  time  to  time,  A  large 
class  of  them  pertain  to  so  called  one  dimensional  traveling  wave  solutions,  which  we 
discuss  next. 


3.  Traveling  wave  solutions 

We  restrict  ourselves  to  one  dimension  (z  the  direction  of  propagation)  and  look  for 
solutions  which  are  functions  of  the  independent  variable  (,=  z  —  ut.  Such  waves,  if  they 
exist,  would  be  nonlinear  waves  that  are  stationary  in  a  frame  moving  with  the  velocity  u. 
We  adopt  the  following  normalisations, 

P=a,  P=JL;  (=!*(,_*). 

c  me  c 
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Then  combining  (1-6)  and  eliminating  the  electric  and  magnetic  fields,  we  obtain  the 
following  set  of  coupled  differential  equations: 

"k^=0'  (7) 

'n-o -0,  (8) 


The  coupled  equations  above  are  still  difficult  to  solve  analytically  but  some  special  exact 
solutions  exist. 


3.1  Pure  transverse  waves 

If  we  set  pz  =  0,  then  for  bounded  solutions  (9)  gives  p2  —  constant.  Equations  (7, 8)  then 
admit  the  solution 

(10) 

(11) 

where  uj  =  ujpj3(f32  —  \}"l^(\  +  p2)~1//4.  The  wave  phase  velocity  u  can  be  expressed  in 
terms  of  /?  as 

u  =  j3c  =  ce~l/2, 
where  e,  the  dielectric  constant  of  the  plasma,  is  given  by 

£=i-^(i+p2r'/2.        .       •  (12) 

Substituting  for  p  in  the  Maxwell's  equations,  it  can  be  shown  that 

(.3) 


so  that 

p2  =  (e2E2/mW], 
where  E%  =  E2  +  E2.  Combining  these  results  one  can  now  see  that 

-1/2 
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Equations  (13)  and  (14)  indicate  that  only  circularly  polarized  modes  may  propagate  as 
pure  transverse  waves  in  the  plasma.  Equation  (15)  demonstrates  that  such  waves  can 
propagate  into  overdense  plasmas  provided  their  frequencies  lie  in  the  range 

_2r-2 

(1    tLn 


We  will  return  to  the  physical  implications  of  this  effect  in  the  final  section. 

3.2  Pure  longitudinal  waves 

For  px  =  py  =  0,  (7,  8)  are  identically  satisfied  and  pz  =  p  in  (9).  This  can  be  integrated 
exactly  to  yield  a  periodic  solution  which  is  expressible  in  terms  of  elliptic  functions.  The 
frequency  of  the  wave  is  given  by 


•^P 


TT   /-    da    Ai-a2)i/2   y1/2' 

M   _^5/4  I  /,  _      o^/2 


(16) 


where  am  is  related  to  the  electric  field  amplitude  through  Maxwell  equations  and  is 
given  by 

/  \  -1/2 

/I  1  \ 

eE=V2m(Jjpc\  -  ^  --  r^)  .  (17) 

V(i-^)1/2    (i-«2)1/2V 

In  the  limit  am  <C  1,  these  expressions  can  be  simplified  to 

W  =  Up(l--feo?m) 

with  am  =  (eEo/muipc}  where  EQ  is  the  maximum  value  of  E.  For  am  —+  1,  one  gets 

W«2-3/zTOp(l-aJ,)1/< 
where  eEo  =  \/2mWpc(l  —  o;^)"1'4  so  that 


/    j 


eEo 


In  both  these  limits  one  observes  that  pure  longitudinal  waves  with  frequencies  less  than 
o;p  can  propagate  in  the  plasma  provided  the  electric  fields  associated  with  them  are 
sufficiently  high. 

3.3  Coupled  longitudinal-transverse  waves 

For  this  general  case  it  is  not  possible  to  obtain  exact  solutions  but  some  interesting 
analytical  and  numerical  solutions  have  been  obtained  in  limiting  cases.  It  is  also 
convenient  in  some  cases  to  restrict  oneself  to  plane  polarized  waves  such  that  py  =  0. 
One  interesting  limit,  which  was  first  pointed  out  by  Akhiezer  and  Polovin  [1]  is  that  of 
(3  —  »  co.  This  corresponds  to  an  almost  transverse  wave  with  pz  —  >  0.  In  this  case,-  (7,  9) 
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can  be  simplified  to 


Equations  (18,  19)  are  exactly  integrable  and  yield  periodic  solutions  with  amplitudes 
oscillating  between  two  limits  which  are  determined  by  the  initial  values  of  the  two 
constants  of  motion  that  the  system  permits.  The  other  interesting  limit  is  /3  —  »•  1,  i.e., 
/3  -  1  <C  1.  In  this  case  one  can  redefine  the  independent  variable  as  r\  =  (fl2  —  I)"1'2 
and  rewrite  (7  —  9)  as 

£*   -  =  0,  (20) 


dr?2      (i+p2)i 

tfpy   ,  Py 


fc-<1+^+ciTr*  (22) 

where  ]3  —  1  <C  1  has  been  used.  In  (22)  the  last  term  can  be  neglected  to  yield 
(1  +  p2)1/2  —  pz  =  C2  where  C  is  a  constant.  One  then  obtains  the  following  periodic 
solutions 


px  =  Ax  cos(ur)};     py  =  Ay  & 

A2  -A2 
pz  =  x        y 


where 


and  the  amplitudes  Ax  and  Ay  are  related  to  C  by  A2.  +  A2,  =  2(C4  -  1).  Kaw  and  Dawson 
[4]  carried  out  a  detailed  numerical  investigation  of  the  original  equations  (7,  9)  and 
demonstrated  the  existence  of  periodic  propagating  solutions.  They  also  confirmed  that 
when  /3  3>  1  (phase  velocity  »  velocity  of  light),  the  waves  are  almost  transverse. 
Furthennore,  for  strongly  relativistic  regime  7  vs  £  (where  7  is  the  current  density)  is  in 
the  nature  of  a  square  wave  and  correspondingly  the  electric  field  E  as  a  function  of  C  has 
a  saw-tooth  shape.  Physically  this  is  expected  because  in  a  strongly  relativistic  regime, 
vx  —  >  c  in  most  of  the  period  and  so  Jx  —  >•  -n0ec,  giving  the  square  wave  form.  One  may 
now  give  a  simple  interpretation  of  propagation  of  strongly  relativistic  waves  into 
overdense  plasmas.  Normally,  the  conduction  current  generated  by  the  electron  response 
to  the  wave  fields  acts  as  a  shielding  current  giving  the  plasma  a  dielectric  behaviour  and 
preventing  propagation  of  waves  into  overdense  regions.  Now  since  the  shielding  current 
saturates  in  the  strongly  relativistic  regime,  the  waves  can  penetrate  into  overdense 
plasmas.  Relativistic  effects  thus  diminish  the  ability  of  the  plasma  to  act  as  a  dielectric. 


7^l      AO     XT« 


An  analytic  description  of  the  numerical  results  of  Kaw  and  Dawson  [4]  was  obtained  by 
Max  and  Perkins  [5]  who  also  derived  the  propagation  condition  in  the  strongly 
relativistic  regime  as, 


/?       o;2  2  u,  eE        ' 

Max  and  Perkins  [5]  were  also  the  first  ones  to  extend  the  concept  of  these  solutions  to 
propagation  into  weakly  inhomogeneous  plasmas.  They  used  a  WKB  like  condition  for 
constancy  of  energy  flux  for  waves  propagating  into  reflectionless  inhomogeneous 
plasma  and  pointed  out  that  indefinite  penetration  into  overdense  plasmas  is  possible  if 
the  following  condition  is  satisfied, 


2 

I  >  ui 

c  J        me         p 

where  £,  denotes  the  electric  field  strength  in  a  vacuum  and  L  is  the  density  scale  length. 
Physically  as  the  wave  penetrates  to  a  higher  density,  there  is  a  WKB  amplification  of  the 
electric  fields  which  decreases  the  effective  plasma  frequency  (due  to  increased  effective 
mass)  faster  than  its  increase  due  to  density  changes.  The  above  condition  on  L  arises  in 
ensuring  that  the  WKB  condition,  viz.  scale  length  of  density  variation  »  the  wavelength 
of  the  wave,  continues  to  be  satisfied  indefinitely.  The  notion  of  indefinite  wave 
propagation  into  overdense  plasmas  has  some  interesting  physical  applications  in 
astrophysics  particularly  to  radiation  occuring  in  pulsars.  Finally  we  would  like  to 
mention  the  article  of  DeCoster  [6]  in  which  a  lot  of  the  early  work  related  to  traveling 
wave  solutions  has  been  summarised.  He  has  also  carried  out  a  weak  coupling  analysis 
and  pointed  out  (without  any  amplification  or  discussion)  the  existence  of  nonlinear 
resonances  whenever  2m  =  n(l  —  1//32)1'2. 

3.4  The  nonlinear  Hamiltonian 

For  a  more  general  overview  of  the  linearly  polarized  traveling  wave  solutions  we 
introduce  the  variables, 


and  rewrite  (7,  9)  in  the  following  form 

X  +  P     -  x  =  0 

V//32-l+X2+Z2 

Z+     ,  ^  Z+l=0.  (25) 

^    ' 


where  the  overdots  refer  to  differentiation  with  respect  to  £.  These  equations  can  be 
derived  from  the  Hamiltonian, 


Z  (26) 

which  is  a  constant  of  motion.  The  coupled  stationary  problem  is  thus  equivalent  to  that 
of  the  motion  of  a  fictitious  particle  in  a  two  dimensional  potential.  Note  that  f  or  /3  >  1, 
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the  Hamiltonian  is  positive  definite  in  the  entire  X  —  Z  plane  and  will  yield  bound 
solutions  of  X  and  Z.  For  f3  <  1,  this  property  is  destroyed  and  the  possibility  of 
unbounded  runaway  solutions  exist  [7]. 

The  Hamiltonian  (26)  has  one  equilibrium  point  (X  =  0,  Z  —  —  1)  where  H  —  Hmin  = 
01  —  1.  Physically  this  corresponds  to  pz  =  px  =  pz  =  px  =  0,  i.e.,  to  the  equilibrium 
plasma  with  no  oscillatory  motions.  Linearisation  of  (24)  and  (25)  around  the  equilibrium 
points,  gives 


6X  +  X  =  0, 


(27) 
(28) 


where  z  =  Z  +  1.  The  'frequency'  (1  —  I//?2)1'2  corresponds  to  longitudinal  oscillations 
at  the  plasma  frequency,  as  can  be  verified  by  going  back  to  the  unnormalised  variables. 
Similarly  the  'unit'  frequency  of  'X'  oscillation  corresponds  to  linear  electromagnetic 
waves.  The  limit  X  =  X  —  0  corresponds  to  nonlinear  longitudinal  oscillations  whose 
exact  form  has  been  discussed  in  an  earlier  section. 

To  explore  the  full  range  of  nonlinear  solutions,  we  carried  out  an  extensive  numerical 
investigation  of  (24,  25)  using  the  Poincare  surface  of  section  technique.  Basically,  we 
solved  for  the  'particle  orbits'  of  the  Hamiltonian  (26),  and  looked  at  the  successive 
crossings  of  the  orbits  of  the  plane  X  =  0  (with  X  >  0)  for  a  given  set  of  values  of  H  and 
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Figure  1.     Poincare  surface  of  section  plot,  Z  versus  Z  (X  =  Q,X>Q)>  for  the 
Hamiltonian  with  ft  =  1.1,  fa  =  0.7  and  H  =  3.0 
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Nonlinear  propagation  in  a  collisionless  plasma 
(a)   5 


800 


800 


Figure  2.  (a)  A  typical  soliton  solution  of  the  Z  oscillator  corresponding  to 
H  =  2,(3=l.3  and  /?b  =  0.8,  where  the  separatrix  is  between  (2.94,0)  and 
(—3.305, 0).  (b)  A  typical  'time'  orbit  curve  of  the  X  oscillator  for  the  corresponding 
Z  soliton. 


(3.  Each  orbit  corresponds  to  a  solution  of  the  coupled  equations.  In  general  we  saw  the 
following  kind  of  orbits:  (i)  Fixed  points  which  map  back  on  themselves  after  a  finite 
number  of  iterations.  These  correspond  to  periodic  orbits  in  which  the  ratio  of  the 
frequencies  of  the  two  oscillators  is  a  rational  number,  (ii)  Quasi-periodic  orbits  which 
ergodically  fill  a  curve  in  the  Z  —  Z  plane  and  for  which  the  ratio  of  the  frequencies  is  an 
irrational  number,  (iii)  Islands  surrounding  a  fixed  point  which  correspond  to  amplitude 
modulated  waves,  (iv)  Separatrix  orbits  between  chains  of  islands  which  correspond  to 
amplitude  modulated  waves  with  the  modulation  period  being  infinite.  Physically  these 
correspond  to  soliton  solutions.  Another  possibility  which  exists  in  principle  (but  which 
we  did  not  find  in  our  early  numerical  work)  is  stochastic  orbits  for  which  the  orbit 
ergodically  fills  an  area  in  space.  These  would  correspond  to  aperiodic  orbits  which  are 
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Figure  3.    Chaotic  orbits  in  Poincare   surface   of  section  plot,  Z  versus  Z 
(X  =  0,X>  0),  for  the  Hamiltonian  with  (3  =  1.21,  fa  =  0.7  and  #  =  10.0 

highly  sensitive  to  initial  conditions.  The  absence  of  numerical  evidence  for  stochastic 
orbits  and  the  results  of  local  linear  analysis  of  the  trajectories  led  us  to  conjecture  that 
perhaps  the  Hamiltonian  was  integrable.  The  analytic  structure  of  the  Hamiltonian, 
particularly  the  presence  of  the  squareroot  term  in  the  potential,  posed  a  challenge  for 
application  of  standard  integrability  tests  such  as  the  Painleve  criterion.  There  was  thus 
widespread  interest  for  a  while  to  provide  an  analytic  proof  of  its  integrability.  It  came  as 
somewhat  of  a  surprise  when  Grammaticos,  Ramani  and  Yoshida  [3]  proved  that  it  was 
nonintegrable.  Their  proof  was  based  on  a  clever  application  of  Ziglin's  theorem. 
Subsequently  numerical  evidence  of  stochastic  orbits  was  also  found  by  Romeiras  [8].  In 
a  recent  work,'  Bisai,  Sen  and  Jain  [9]  have  carried  out  further  detailed  studies  on  a  more 
generalised  form  of  the  Hamiltonian  (26),  namely 

_  *2    ^2  rr ~ 

22 

In  this  the  plasma  is  assumed  to  have  an  equilibrium  velocity  Vb  (beam  velocity)  which  is 
represented  by  the  normalised  quantity  fa.  Figure  1  shows  a  typical  Poincare  plot  of 
orbits  obtained  from  (29)  and  displays  the  different  kind  of  solutions  including  the 
'island'  orbits.  Figures  (2a,b)  display  the  'soliton'  orbits  corresponding  to  the 
separatrices  on  the  Poincare  plots.  In  figure  3  we  see  the  presence  of  chaotic  orbits  in 
the  Poincare  plot.  Note  that  for  /?  =  I/ fa,  the  Hamiltonian  is  integrable,  since  it  admits 
L  =  ZJ£  -  XZ  as  another  invariant  (which  is  in  involution  to  H,  i.e.,  [H,  L]  =  0).  This  is 
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solutions  on  the  Poincare  surface)  to  provide  a  detailed  theoretical  basis  for  the  earlier 
conjectures  of  DeCoster  [6]  regarding  the  nonlinear  resonances. 

4.  Relativistically  intense  standing  waves 

Marburger  and  Tooper  [10]  and  Lai  [11]  found  a  new  class  of  solutions  to  the  Akhiezer- 
Polovin  equations  (1-6),  which  correspond  to  relativistically  intense  standing  waves  in  an 
overdense  plasma.  They  introduced  a  new  ansatz,  namely,  E(z,  t)  =  F(z)f(t)  and 
restricted  themselves  to  circular  polarizations  only.  The  conservation  of  canonical 
momentum  in  the  transverse  direction  then  led  to  the  condition  p^  +  eA/c  —  0  and  the 
longitudinal  momentum  pz  was  found  to  vanish.  Here  the  vector  potential  A  follows  the 
ansatz 

eA. 


where  £  =  uz/c  -and  F  satisfies  the  equation 


/. 
(1+F2)1/2 


The  electron  density  n  and  hence  w2  is  bunched  in  the  longitudinal  direction  because  the 
standing  wave  produces  ponderomotive  forces  which  push  the  electrons  away  from  the 
field  maxima.  Since  ions  are  assumed  to  be  frozen  on  the  time  scales  of  interest,  the 
electrons  are  held  in  balance  by  generation  of  static  longitudinal  fields  4>(z)  = 
(me2  /£)</>(£)  where  <j)  satisfies  the  equation 


2^          ( i.?-.  —  i,?\          /^2 

'  1/2.  (31) 


d2</>  _  ( ^po  ~  ^P  .  _ 


dC2       \      w2       /      dC2 

One  may  now  eliminate  w2  from  (30)  and  reduce  it  to  that  of  a  nonlinear  one  dimensional 
oscillator  governed  by  the  Hamiltonian 

r>2 

1/2  ^ 


where  P  =  M(F)(dF/dC),M(F)  =  (1  +  F2)"1  and  a  =  o^0/o;2.  For  overdense  plasmas 
(o.  >  1),  this  leads  to  three  types  of  nonlinear  standing  waves,  viz, 

(1)  oscillations  about  zero  when  the  maximum  field  Fm  exceeds  a  critical  value 
Fr  =  2(a2-a)1/2; 

(2)  oscillations  about  a  bias  field  FB  =  (a2  -  1)1/2  when  Fm  <  FT  and 

(3)  motion  with  an  infinite  period  (along  the  separatrix)  when  Fm  =  FT. 

For  Fm  <  Fa  =  2  (a2  —  a)1/2,  the  equations  for  F  may  be  integrated  explicitly  in  terms  of 
Jacobian  elliptic  functions.  Thus,  when  FT  <  Fm  <  Fa,  the  quartic  period  is  <"P  =  K(m)h 
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where  K  is  the  complete  elliptic  integral  of  the  first  kind,  m  =  (em  —  l)(2o;  +  1  —  em)f 
4(em  —  a),  72  =  €,„  -  a,  e2  =  1  +  F2  and  the  field  amplitude  is  given  by 


[2+(<EWI-l>n2(7C)r 
For  the  important  nonperiodic  limit,  Fm  =  FT,  m=  I  and 

no  = 


[1-fasinhVO]  ' 

1/2. 

An  important  physical  effect  which  has  to  be  carefully  considered  is  the  possibility  of 
complete  depletion  of  electrons  from  a  part  of  the  region  occupied  by  the  standing  waves. 
The  condition  for  complete  vanishing  of  the  electron  density  may  be  written  down  as 


If  this  condition  is  satisfied  somewhere,  part  of  the  standing  wave  is  in  a  vaccuum  cavity; 
appropriate  solutions  are  then  obtained  inside  the  cavity  and  their  logarithmic  derivatives 
are  matched  across  the  depletion  boundary.  One  can  show  that  for  Fm  <  Fj  — 
{(a/2)  [a  +  (4  +  a2)1'2]}1'2  complete  electron  depletion  does  not  take  place  anywhere 
and  the  above  simple  analytic  solutions  are  valid.  Elsewhere,  the  solutions  have  to  be 
obtained  with  greater  care. 

The  most  interesting  application  of  these  solutions  is  in  estimating  the  properties  of 
standing  waves  in  inhornogeneous  plasmas.  Using  the  WKB  condition  that  the  phase 
integral  /  =  §  PdF  is  an  invariant  for  propagation  into  an  inhornogeneous  plasma  with 
a  =  «(£),  one  finds  the  condition 

/0  =  TrF2  =  IT  =  Basin'1  [(a  -  l)/a]I/2  -  8(a  -  1)1/2. 

Here  FQ  is  the  vacuum  field  strength  and  IT,  the  phase  integral  corresponding  to  the 
turning  point,  is  obtained  by  using  the  solution  for  the  case  Fm  =  FT  (since  in  this  case 
the  period  Cp  —  »  oo).  Note  that  in  contrast  to  the  linear  WKB  solution,  the  wave  amplitude 
remains  finite  at  the  turning  point:  FT  =  2(a2  -  a)1/2.  The  above  condition  may  be  used 
to  find  out  the  maximum  density  upto  which  an  incident  wave  with  field  amplitude  FQ 
may  penetrate. 

5.  Modulated  nonlinear  waves 

In  this  section,  we  will  discuss  another  class  of  exact  one  dimensional  nonlinear  solutions 
of  the  relativistic  cold  plasma  equations  which  represent  envelope  solitons  of  light 
waves  [12,  13],  in  which  the  modulation  envelope  propagates  as  a  large  amplitude  plasma 
wave  in  the  medium.  These  solutions  are  a  step  beyond  the  well  known  traveling  wave 
solutions  in  a  cold  plasma  because  the  envelope~and  the  phase  propagate  with  different 
speeds  so  that  the  nonlinear  relationships  between  the  phase  and  group  velocities  can  be 
investigated.  For  simplicity  we  again  restrict  our  analysis  to  circularly  polarized  waves 
which  couple  to  longitudinal  disturbances  only  because  the  amplitude  is  modulated. 


We  start  as  usual  with  the  relativistic  set  of  fluid  equations  (1-6  for  a  cold  plasma  (in 
one  dimension)  and  introduce  the  change  of  variables  x  ~  fit  =  £,  t  —  r.  Instead  of  the 
fluid  momenta  (p)  we  now  work  with  A  (the  vector  potential)  and  <p  (the  electrostatic 
potential)  which  are  normalised  as  eA/mc2,  efi/mc1.  Time  is  normalised  as  upt  and  space 
by  ujpX/c,  where  the  direction  of  propagation  is  taken  as  x.  We  now  make  the  key  ansatz 
that  in  the  moving  frame,  the  vector  potential  is  circularly  polarized  and  has  a  sinusoidal 
phase  variation,  i.e.,  A  =  [{ya(£) +zia(£)}zxp(—iXT)  +  c.c].  The  introduction  of  a 
frequency  parameter  A  in  the  phase  factor  is  the  basic  deviation  from  earlier  stationary 
wave  solutions;  this  allows  us  to  identify  0  with  the  group  velocity  of  the  light  pulse  and 
also  to  distinguish  between  the  group  and  phase  speeds.  The  choice  of  circular 
polarization  allows  us  to  avoid  the  generation  of  harmonics  in  all  wave  fields.  For  plasma 
oscillations,  which  form  the  modulation  envelope,  this  leads  to  d/dr  =  0  in  the  moving 
frame.  Integrating  the  fluid  equations  we  then  get  n((3  —  u)  —  j3  and  7(1  —  flu)  —  4>  —  1. 
Using  these  relations  and  further  writing  a(£)  =  R&\p(iO),  we  can  reduce  the  Poisson's 
equation  and  the  wave  equation  to  obtain  the  following  set  of  coupled  nonlinear  equations 

,n  _         u  /oo\ 

{,/         ,  ~~  l  _J  _J   I 

with  M  given  by 


u  = - — 

( 

and 

R   +  i  _  Qi    I  ^   ~  ~R*  )  1  -  32~  3  -u   1+0|°'  ^35^ 

where  M  —  R2{(1  —  @2)0'  —  A/3},  is  a  constant  of  integration.  In" our  representation  the 
amplitude  R  =  (A2  +  A2)1'2  of  the  circularly  polarized  electromagnetic  wave  exhibits  a 
modulation  propagating  at  the  group  speed  0.  The  question  of  the  definition  of  phase 
speed  is  a  little  more  complex.  For  small  amplitudes  R  is  always  positive  and  the 
exponential  factor  contains  the  entire  phase  information.  The  phase  speed  may  then  be 
shown  to  be  simply  1/0  satisfying  the  conventional  relation  VPVS  =  1.  However,  for 
arbitrary  amplitudes  R  oscillates  because  of  the  strong  R  —  <j>  coupling  and  the  phase 
speed  has  to  be  defined  by  directly  determining  the  z,t  variation  of  R  cos (6  — XT). 
Equations  (33-35)  admit  of  one  exact  integral  of  motion  which  may  be  written  as 


'),  (36) 

where 

A2       R2  M2  <t> 


(1-/32)2  2      2^(1  -/32)2      (I-/?2)      • 

}1/2].      (37) 


Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  H) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences"  687 


P  K  Kaw  and  A  Sen 


Since  '/3  <  I  (group  velocity  less  than  c),  the  problem  is  similar  to  a  Hamiltonian  of 
coupled  anharmonic  oscillators  with  two  degrees  of  freedom  (R,  <p}  where  the  effective 
mass  for  one  of  the  anharmonic  oscillators  is  negative.  In  the  limit  of  weak  density 
response,  the  problem  can  be  made  one  dimensional  by  the  substitutions 
<f)  »  (1  +  R2}1/2  -  1  and  n  w  (>/l  -f/?2)".  Expanding  in  R2  and  taking  M  =  0,  K  =  0 
we  get  the  well  known  envelope  soliton  solution 


Ay  —  Rm  seen 


cos 


_ 


where 


4A2-(l-/?2)(3-h/32)' 


(38) 


(39) 


K  7^  0  gives  modulated  periodic  wave  train  solutions  with  the  envelope  factor  described 
by  an  elliptic  function.  Equation  (38)  represents  an  isolated  light  pulse  with  a  frequency 
LJ  =  A/(l  -  /32},  wavenumber  k  =  X/3/(l  -  /?2),  phase  velocity  I//?,  group  velocity  (3 
(note  VpVg  =  c2),  an  envelope  scale  length  (in  units  of  c/wp)  equal  to  (1  —  P2)/ 


—  ff2  —  A2)  and  an  amplitude-group  velocity  relationship  given  by  (40).  Note  that 
the  envelope  scale  length  is  real  only  when  A2  <  (1  -  /?2).  This  may  be  physically 
interpreted  as  follows.  The  wave-frequency  in  the  frame  moving  with  the  group  velocity 
has  the  Doppler  shifted  value  Co  =  (u  -  k/3)/^/l  -  j32  =  A/-y/l  -/32.  Thus  the  inequal- 
ity A  <  ^/\  —  (31  corresponds  to  the  situation  when  the  Doppler  shifted  wave  frequency 
cD/Wp  <  1,  i.e.  the  electromagnetic  wave  finds  itself  in  an  overdense  plasma  and  is  totally 
trapped;  this  is  why  we  get  a  soliton  solution  and  the  wave  does  not  leak  out.  The 
effective  scale-length  for  trapping  may  now  be  seen  to  be  \A  ~  frVl  \/u2  ~  ^2]  which 
is  quite  reasonable.  Similarly  the  amplitude-group  velocity  relationship  may  be  written  in 
terms  of  physical  parameters  as 


-        ~  4 


(40) 


For  arbitrary  amplitudes  we  have  numerically  solved  equations  (33-35)  for  soliton 
solutions.  For  A  very  close  to  *J\  -ft2  we  get  small  amplitude  solitons  which  are  well 
described  by  the  analytic  solutions  (38).  As  A  is  decreased  the  soliton  amplitude  increases 
and  acceptable  solutions  occur  only  at  discrete  values  of  A.  In  other  words,  for  a  fixed 
value  of  /?  finding  soliton  solutions  turns  out  to  be  an  eigenvalue  problem  in  A.  The  sizes 
and  shapes  of  these  solitons  also  vary  as  a  function  of  A.  Typically  0  has  a  characteristic 
bell  shape  whereas  R  has  a  number  of  nodes.  Figure  4  shows  a  typical  soliton  solution  for 
/3  =  0.97  and  A  =  0.224445.  For  applications  such  as  particle  acceleration  or  photon 
acceleration,  the  regime  of  interest  is  j3  ->•  1,  where  the  group  velocity  is  close  to  c.  We 
have  carried  out  a  detailed  investigation  of  soliton  solutions  in  this  regime.  Figure  5  is  a 
plot  of  normalized  group  velocity  Vg  versus  the  normalized  carrier  frequency  £}(=  w/wp). 
The  solid  curve  corresponds  to  soliton  pulse  results  obtained  from  our  numerical 
work.  For  comparison  we  have  also  plotted  the  linear  group  velocity  (dashed  line) 


VgL  =  v(l  —  l/^2)  and  the  nonlinear  group  velocity  for  the  infinite  plane  wave  (dotted 
line),  Vgoo  =  /3[1  -  (700  -  l)/(2fi2700(700  +  1)].  This  last  expression  has  been  recently 
obtained  by  Mori  et  al  [14]  and  depends  on  the  infinite  wave  7  viz.  7^  =  Vl  -\-A2.  The 
nonlinear  group  velocities  are  closer  to  c  than  the  linear  group  velocity.  This  is  physically 
understandable  because  the  nonlinearity  makes  the  electrons  heavier  and  thereby 
weakens  the  plasma  dielectric  effects.  Finite  width  soliton  pulses  propagate  slower  than 
the  infinite  plane  waves.  Physically  this  is  because  coupling  to  plasma  waves  acts  as  a 
drag  on  the  electromagnetic  waves  and  slows  them  down.  The  deviations  between  Vgoo 
and  Vgj5  can  be  substantial;  in  our  numerical  work,  we  have  observed  (not  shown) 
deviations  upto  25  per  cent  of  the  difference  between  Vg  and  1. 

To  understand  these  solutions  better  we  consider  another  approximate  analytic  limit  to 
equations  (33-35)  particularly  applicable  when  the  <f>  soliton  structure  is  large.  When 
0max  is  large,  the  longitudinal  velocity  u  — >  -1,  n  — »•  /3(1  +  /3)"1  and  71/7  -+  0.  In  this 
case,  the  light  wave  propagates  essentially  in  a  plasma  free  region  (because  the  residual 
electrons  have  become  infinitely  massive)  and  we  can  approximate  the  effective 
Hamiltonian  as 


(41) 

where  the  •  •  •  refers  to  terms  of  order  <  0"1  which  are  neglected.  From  (41)  we  get 
R  =  7?osin(A£/(l  -  /32)).  Furthermore,  the  structure  of  fi  near  its  maximum  can  be 
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Figure  4.  Electrostatic  potential  0  (dashed  line),  and  electromagnetic  wave 
amplitude  (R)  (solid  line)  profiles  for  a  soliton  pulse  with  p  =  6,  0  =  0.97  and 
A  =  0.224445. 
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Figure  5.  Normalized  group  velocity  vs  normalized  frequency  curves  for  soliton 
pulses  (solid  line),  infinite  pulses  (dotted  line)  and  linear  pulses  (dashed  line).  The 
maximum  electromagnetic  wave  amplitude  RQ  for  the  soliton  pulse  (for  p  =  18)  are 
indicated  with  solid  triangles. 


approximated  by  the  parabolic  equation  (u  ~ >  -1  in  (33))  <fi  =  0max  -  £2/2(l+  j3),  which 
gives  an  estimate  of  the  spatial  scale  size  of  the  soliton  as  £max  ~  ^2(1  +  /3)0max.  From 
(36, 37)  we  know  that  K  =  0  for  a  soliton  solution.  Noting  that  K  ~  K&K  for  0  =  0max, 
0'  =  0,  we  get  0max  =  /3  -f-  A2/?g/2(l  -  /3).  Furthermore,  if  the  soliton  region  involves  p 
wavelengths  of  the  light  wave,  we  have  pir(\  -  p>2}/\  cz  £max.  The  above  expressions 
lead  to  the  following  approximate  analytic  relationship  between  A,  RQ  and  /3  as 

9(1  _  m  r-7T2n2 
D2  _  ^V1        P)  I™  p     ,,         a2->,.         m 


(42) 

Equation  (42)  is  largely  corroborated  by  the  numerical  work  shown  in  figures  4  and  5. 
Recently,  Kuehl  et  al  [15]  have  carried  out  a  WKB  analysis  in  the  weak  amplitude  limit 
of  eqs  (33-35)  and  have  obtained  a  more  refined  estimate  of  the  discrete  spectrum.  They 
have  also  shown  that  the  discrete  nature  of  the  eigenvalue  spectrum  persists  as  long  as  the 
dispersion  term  in  the  Poisson  equation  is  retained.  If  one  neglects  the  dispersion  then 
0  oc  R2  in  the  small  amplitude  limit  and  substituting  into  the  R  equation  leads  to  the 
nonlinear  Schroedinger  equation  which  has  a  continous  spectrum  of  solitons.  Thus 
the  coupled  electromagnetic-plasma  wave  solitons  have  a  unique  characteristic.  The 
importance  of  the  dispersive  term  in  the  formation  of  these  solitons  has  also  been  pointed 
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recent  wonc  or  uecicer  ana  ivion  ei  ai  u  /j  wno  nave  carried  out  panicie  simuiauons  lor 
this  purpose. 


6.  Discussion 

We  have  discussed  the  nonlinear  propagation  of  relativistically  intense  electromagnetic 
waves  into  collisionless  overdense  plasmas  putting  a  special  emphasis  on  1-d  plane  wave 
solutions  of  the  propagating,  standing  and  modulated  (i.e.  localized)  types.  These 
solutions  already  exhibit  a  rich  variety  of  phenomena  associated  with  relativistic  electron 
mass  variations  and  coupling  between  transverse  electromagnetic  and  longitudinal  fields. 
However,  it  must  be  admitted  that  whereas  these  special  solutions  are  excellent 
candidates  for  elucidating  the  physics  of  the  important  nonlinear  phenomena,  their 
relevance  to  truly  experimental  situations  may  be  determined  by  questions  related  to  their 
accessibility.  Thus  one  may  need  very  special  initial  conditions  for  setting  them  up  and 
one  may  have  to  worry  about  their  stability  to  noise  perturbations  in  one  and  more 
dimensions.  Many  of  these  questions  have  not  yet  been  addressed  in  much  detail. 

One  question  which  has  received  some  attention  is  that  of  self-focusing  and  self- 
modulational  instability  of  relativistically  intense  electromagnetic  wave  [18, 19].  Self- 
focusing  instability  occurs  when  the  field  power  exceeds  a  critical  value  and  leads  to 
filamentation  of  intense  electromagnetic  waves  [20, 18]  Similarly  a  self-modulational 
instability  could  be  responsible  for  converting  a  propagating  plane  wave  solution  (§  3) 
into  one  with  localized  modulational  envelopes  (§5).  If  both  instabilities  are 
simultaneously  occuring,  the  final  object  would  be  a  multi-dimensional  localized  region 
('bubble')  of  intense  electromagnetic  waves  interacting  with  longitudinal  fields  as  they 
propagate.  The  analytic  description  of  such  'bubbles'  still  remains  an  outstanding 
problem  in  nonlinear  plasma  physics.  The  strong  low  frequency  electromagnetic  radiation 
believed  to  be  present  in  the  pulsar  environments  [21,22]  is  the  most  important 
astrophysical  problem  where  these  results  have  been  applied.  Retaining  relativistic  effects 
in  the  ion  motions  also,  it  has  been  concluded  that  the  pulsar  radiation  can  penetrate  to 
large  distances  in  the  Crab  nebula,  thus  allowing  the  possibility  of  an  energy  source  in 
these  volumes  [5]. 

In  laboratory  plasmas,  the  most  important  application  of  this  work  is  in  laser  fusion  and 
plasma  based  accelerators.  In  the  inertial  fusion  schemes,  a  point  of  view  has  recently 
developed  that  it  would  be  of  interest  to  create  a  hot  spot  in  the  central  core  of  a 
compressed  inertial  fusion  pellet  by  some  independent  means  [23].  The  use  of  a 
relativistically  intense  light  beam  might  be  one  possible  method  of  reaching  such 
overdense  regions.  Similarly,  for  plasma  based  accelerator  schemes  the  creation  of  space 
charge  fields  propagating  with  velocities  close  to  that  of  light  is  of  considerable 
interest  [24-26].  A  proper  nonlinear  description  of  the  group  velocity  of  such  coupled 
stationary  waves  is  therefore  under  intense  investigation  [13, 15, 14],  The  solitary  light 
pulses  which  are  mostly  empty  of  plasma  in  the  central  region,  but  have  large  plasma 
accumulation  at  the  edges,  have  consequently  large  changes  in  n/7  inside  the  pulse.  One 
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In  this  paper,  we  are  concerned  with  the  demonstration  that  the  universal  properties 
of  the  quasiperiodic  route  to  chaos  in  the  sine  circle  map  ([2],  [3],  [4])  is  an 
appropriate  framework  for  describing  the  low-Reynolds-number  behavior  of  the  wake 
behind  an  oscillating  cylinder.  Previous  experimental  studies  of  this  nature  in  this 
particular  flow  can  be  found  in  Refs.  [5]  and  [6],  and  similar  studies  in  other  fluid  flows 
in  Refs.  [7],  [8],  [9],  and  [10].  This  paper  is  in  some  respects  a  synopsis  of  results 
obtained  a  few  years  ago  [11],  but  it  also  presents  some  that  have  not  been  published 
before. 

2.  The  flow 

We  study  the  wake  of  an  oscillating  circular  cylinder  of  diameter  0.5  mm,  housed  in  a 
low-noise  suction-type  wind  tunnel,  and  oscillated  electromagnetically  in  the  direction 
transverse  to  that  of  upstream  fluid  velocity.  In  practice,  the  oscillating  cylinder  is  a 
stainless  steel  string  of  uniform  diameter  stretched  taut,  through  the  wind  tunnel  walls, 
over  a  distance  that  is  four  times  the  tunnel  span.  The  active  length/diameter  ratio  of  the 
cylinder  is  about  300.  The  Reynolds  number  based  on  the  cylinder  diameter  and  the 
velocity  of  the  oncoming  air  flow  is  55.  We  place  a  velocity  sensor  at  a  distance  of  15 
diameters  behind  the  cylinder,  half  a  diameter  off  the  geometric  axis  of  the  stationary 
wake.  The  velocity  sensor  is  a  standard  hotwire  of  5  u.m  diameter  and  0.6  mm  length, 
operated  on  a  constant  temperature  anemometer.  The  sensor  measures  the  streamwise 
component  of  fluid  velocity.  For  more  details  of  the  experimental  arrangement, 
instrumentation  and  data  processing,  please  see  Ref.  [11]. 

The  regular  'shedding'  of  vortices  at  Reynolds  numbers  above  about  40  imparts  to  the 
wake  a  periodic  motion  of  frequency  f0,  say.  The  frequency  f0  is  a  known  function  of  the 
Reynolds  number  [12].  The  transverse  oscillation  at  the  controlled  frequency  fe  imposes  a 
second  frequency  on  the  wake.  The  amplitude  of  oscillation  of  the  cylinder  can  be 
thought  of  as  a  coupling  parameter  between  vortex  shedding  and  cylinder  oscillation.  This 
nonlinear  coupling  "shifts"  the  shedding  frequency  to  some  neighboring  value,  say  f'0, 
which  depends  on  the  ratio  0  =fe/fo  and  the  magnitude  of  the  nonlinear  coupling 
between  the  two  oscillators.  In  the  jargon  of  quasiperiodic  dynamical  systems,  the 
parameter  0  =fe/f0  is  called  the  bare  winding  number,  and  the  ratio  u)  =fe/f'o  is  called 
the  dressed  winding  number. 

3.  The  circle  map 

Systems  with  two  independent  frequencies  f0  and/e  define  a  torus  in  the  phase  space.  As 
the  system  evolves,  the  phase  space  trajectory  hits  a  Poincare  section  at  some  discrete 
iterates  along  a  circle.  The  angular  position  of  the  iterates  is  the  variable  of  dynamical 
interest.  A  paradigm  situation  is  given  by  the  sine  circle  map 


On+l  =  0n+n--sin(27r0,I),  (1) 

where  On  is  the  angular  coordinate  of  the  nth  iterate,  O  is  the  bare  winding  number  and 
K  >  0  is  the  nonlinear  coupling  parameter.  The  map  is  utilized  by  setting  0  <  ft  <  1  and 
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obtaining  the  iterates  of  9.  The  dressed  winding  number  is  defined  as 
u  =  \m\n^[(0n-60}/n}. 

The  experimental  situation  is  qualitatively  similar  to  that  encountered  in  circle  maps; 
following  the  spirit  of  universality  mentioned  above,  it  appears  reasonable  to  explore  the 
analogies  quantitatively.  For  this  purpose,  we  should  recall  a  few  key  results  about  the 
circle  map,  known  from  the  theoretical  analysis  of  Refs.  [2j-[4]: 

9  For  low  forcing  amplitudes,  equation  (1)  exhibits  Arnol'd  tongues  (regions  of  mode 
locking)  in  which  the  frequency  f0  shifts  so  that  the  dressed  winding  number  is  fixed  as 
a  rational  over  some  finite  range  of  fe. 

•  In   other  case  where  the   dressed  winding   number  is   irrational,   the   motion   is 
quasiperiodic. 

o  As  the  nonlinear  coupling  parameter  K  increases,  the  Arnol'd  tongues  widen;  they 
overlap  at  some  critical  line  corresponding  to  the  critical  value  of  K.  This  critical  line 
corresponds  to  the  breakdown  of  the  two  torus  and  the  onset  of  chaos. 

•  Along  the  critical  line,  the  system  exhibits  the  devil's  staircase  structure. 

«  A  special  case  of  interest  is  the  locus  of  the  position  in  the  K  —  il  space  where  the 
dressed  winding  number  is  the  inverse  of  the  golden  mean.  (Loosely  speaking,  golden 
mean  =  (v/5  —  l)/2  is  the  "most"  irrational  number,  or  least  well  represented  by  a 
rational  approximation,  because  its  continued  fraction  representation  contains  only  the 
numeral  one.)  At  the  critical  point  on  this  line,  where  uj  —  ag,  the  latter  being  the 
inverse  of  the  gloden  mean,  the  system  is  least  influenced  by  mode-locking  and  is 
quasiperiodic.  The  properties  of  the  system  at  this  critcal  point  are  universal. 

•  At  the  critical  point,  the  power  spectrum  of  the  angular  variable  has  a  certain  well- 
quantified  self-similar  structure  [2]. 

•  A  description  of  this  behavior  requires  modern  dynamical  systems  theory  including 
multifractals  [13]  and  scaling  functions  [2]. 

4.  Devil's  staircase  and  power  spectral  density 

In  Refs.  [5]  and  [11],  it  was  shown  that  most  of  these  properties  were  exhibited  by  the 
wake  of  the  oscillating  cylinder.  As  a  quick  recapitulation,  we  show  in  figure  1  the  devil's 
staircase  structure  along  the  critical  line.  The  small  circles  mark  the  extremes  of  a  few 
principal  Arnol'd  tongues  corresponding  to  the  rational  values  of  the  dressed  winding 
number  marked  on  the  figure.  These  are  to  be  compared  with  the  horizontal  bars 
representing  the  theoretical  results  for  the  circle  map.  The  comparison  is  good  on  the 
whole.  Many  other  tongues  were  mapped  out  experimentally  but  are  not  given  on  the 
figure.  A  few  narrower  tongues  near  the  rational  numbers  4/15  to  5/16  are  displayed  in 
the  inset.  The  general  pattern  is  the  same  as  that  of  the  circle  map,  although  some  visible 
differences  occur. 

As  a  second  example  of  the  correspondence  between  the  wake  and  the  circle  map,  we 
show  in  figure  2  the  scaled  power  spectral  density  of  the  velocity  signal  obtained  close  to 
the  critical  point.  As  known  for  the  circle  map  dynamics,  we  have  (nearly)  equal - 
amplitude  spectral  peaks,  marked  1,  at  powers  of  ag  up  to  the  fifth.  For  higher  powers,  the 
flow  system  does  not  respond  like  the  circle  map,  yet  the  appearance  of  these  equally- 
spaced  peaks  is  quite  encouraging.  Furthermore,  between  any  two  of  the  four  right-most 
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Figure  1.  The  devil's  staircase  showing  mode-lockings  along  an  experimentally 
determined  critical  line  (circles).  The  general  pattern  is  similar  to  that  of  the  circle 
map  (bars).  The  inset  expands  the  small  rectangle  to  the  lower  left. 


primary  spectral  peaks,  there  are  families  of  peaks  marked  2,  3,  4  and  5,  which — where 
they  exist — appear  at  the  right  place  according  to  the  circle  map  theory.  More  delicate 
fine  structure  in  the  circle  map  is  not  replicated  by  the  fluid  system.  It  is  difficult  to  say  if 
these  differences  are  real  or  if  the  fluid  system  is  not  exactly  at  the  critical  point  as  the 
theory  demands.  Locating  the  critical  point  is  quite  delicate,  and  the  results  are 
conspicuously  sensitive  to  the  precise  location  in  its  neighborhood  [11].  Independent  of 
this  uncertainty,  it  is  fair  to  say  that  the  wake  behaves,  at  least  grossly,  much  like  a  circle 
map.  This  then  suggests  that  the  underlying  dynamics  of  this  fluid  system  may  live  on  a 
low-dimensional  attractor  whose  Poincare  section  closely  resembles  that  generated  by  the 
circle  map.  This  expectation  is  explored  in  greater  detail  below. 


5.  Multifractal  analysis 

To  construct  the  experimental  Poincare  section  at  the  critical  point  in  phase  space,  the 
continuous  hotwire  velocity  time  trace  u(t]  is  sampled  discretely  at  a  time  period  r  equal 

to  the  period  of  excitation,  yielding  Pn  =  u(t  +  n  *  r),  n  =  0, 1, 2, 3, This  procedure 

can  lead  to  some  problems  if  there  is  a  drift  in  the  bare  or  dressed  winding  numbers.  A 
better  procedure  is  to  select  points  in  the  continuous  time  trace  which  have  zero  time 
derivative.  In  either  case,  the  Pn  are  embedded  in  the  three-dimensional  phase  space 
(Pn,  Pn+i ,  PM+2),  three  being  the  lowest  dimension  in  which  the  Pn  are  non-interacting.  A 
two-dimensional  projection  of  the  three-dimensional  Poincare  section  is  shown  in 
figure  3.  The  attractor  lies  essentially  on  a  one-dimensional  subspace  and  is  perceptibly 
inhomogeneous.  (Other  projections  described  in  Ref.  [11]  confirm  this  conclusion.) 

We  first  turn  to  a  suitable  statistical  characterization  of  the  attractor.  From  the  theory  of 
circle  maps,  one  knows  that  the  attractor  generated  from  the  sine  circle  map  has  a 


696 


Pratnana  -  /.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  n) 
Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


•  -7 

Power/f2 
(log  scale)  .a 


a  5  a  4    a  3   a  2  a  !    a>= 


2.2          2.4  2.4          2.8 


frequency  (f),  Hz 
(log  scale) 

Figure  2.  The  scaled  power  spectral  density  for  the  velocity  signal  measured  in  the 
wake  of  the  oscillating  cylinder.  The  dressed  winding  number  is  estimated  to  be  within 
0.1  %  of  ag.  The  structure  is  quantitatively  similar  to  that  of  the  circle  map. 
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Figure  3.  Two-dimensional  projection  of  the  experimental  Poincare  section  at  the 
critical  golden  mean  point,  obtained  by  embedding  the  velocity  signal  in  three 
dimensions  as  described  in  the  text. 


particular  self-similar  structure  that  can  be  quantified  via  the  multifractal  formalism. 
Exploitation  of  this  self-similar  structure  for  making  predictions  about  the  dynamics  of 
the  system  is  the  goal  of  the  multifractal  formalism.  To  this  end,  one  can  construct  a 
measure  on  this  attractor  by  subdividing  it  into  boxes  of  equal  size  and  computing  the 
density  of  points  in  each  of  these  boxes.  This  gives  one  a  measure  of  how  often  certain 
regions  of  the  attractor  are  visited  in  comparison  with  others.  One  defines  a  singularity 
strength  [13]  by 

7r,.~LQ<,  (2) 
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Figure  4..  The  /(a)  curve  obtained  from  analyzing  the  measure  on  the  Poincare 
section.  The  solid  line  is  the  theoretical  curve  for  the  sine  circle  map  at  the  golden 
mean  critical  point.  Typical  error  bars  are  shown. 

where  TT/  is  the  relative  density  of  points  in  the  i'h  box  of  size  L.  The  box  size  L  is 
restricted  to  the  scaling  bounded  by  some  inner  and  outer  cut-off  scales.  Within  the 
scaling  range,  one  can  define  f(a)  as  the  dimension  of  the  set  of  points  possessing 
singularity  strength  a  by  the  equation 

N(a)  ~  L~^a\  (3) 

where  N(a)  is  the  number  of  boxes  of  size  L  possessing  singularity  strength  a.  The  f(oi) 
formalism  entails  viewing  the  attractor  as  the  union  of  several  interwoven  fractal  sets 
each  having  a  measure  of  a  different  singularity  strength.  (In  equations  (2)  and  (3),  the 
symbol  ~  subsumes  all  the  prefactors  needed  to  render  them  dimensionally 
homogeneous.) 

A  summary  of  these  multifractal  results  is  given  in  figure  4  which  shows  a  comparison 
between  the  measured/(a)  curve  at  the  experimentally  determined  critical  point  and  that 
for  the  sine  circle  map.  Space  limitations  preclude  a  description  here  of  how  the  curve 
was  obtained  from  experimental  data.  This  information  can  be  found  in  Ref.  [11].  On  the 
left  part  of  the  figure  representing  the  scaling  of  high  probability  regions  on  the  attractor, 
the  data  agree  well  with  the  circle  map  predictions.  The  right  half  of  the  /(a)  curve 
represents  the  scaling  of  the  low-probability  events  on  the  attractor  and  is  usually 
unreliable  in  experiment  because  of  the  strong  influence  of  noise  (even  if  small).  It  is  thus 
unclear  at  present  whether  the  discrepancy  on  the  right  part  represents  a  true  departure 
from  the  universal  scaling  or  is  an  experimental  artifact. 

We  have  discussed  this  issue  in  more  detail  in  Refs.  [11]  and  [14]  where  we  have 
concluded  that  the  correspondence  between  the  circle  map  and  the  flow  is  indeed  real. 


6.  The  trajectory  scaling  function 

One  would  like  to  push  further  the  correspondence  between  the  flow  and  the  circle  map. 
One  powerful  way  of  describing  the  dynamics  of  a  system  is  the  so-called  Feigenbaum 
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Chaos  and  oscillating  cylinder 

scaling  function  [15],  [16].  This  object  contains  microscopic  information  about  a 
deterministic  dynamical  system  and  its  scaling  properties.  At  the  onset  of  chaos,  the 
attractor  can  be  regarded  as  constructed  by  an  underlying  multiplicative  process  that 
successively  refines  it,  in  such  a  way  that  it  possesses  scaling  properties  at  finer  and  finer 
scales.  The  scaling  function  gives  a  compact  description  of  the  refinement  basic  to  the 
fractal  structure  of  the  attractor.  The  underlying  multiplicative  process  can  be  mapped  on 
to  a  subdividing  tree  structure  where,  on  the  average,  each  branch  divides  into  2 
subbranches  at  each  successive  level  of  refinement.  The  iterates  of  the  dynamical  system 
become  branches  of  the  tree,  and  intervals  are  formed  by  joining  nearest  neighbor 
branches. 

At  the  critical  golden  mean  winding  number,  the  attractor  can  be  thought  to  have  been 
generated  by  successive  refinements  of  a  measure  created  from  superstable  periodic 
orbits  whose  winding  numbers  are  successive  Fibonacci  approximations  to  the  golden 
mean.  The  Fibonacci  sequence  is  given  by  the  relation  F/+2  =  F,-  +  F/+i  with  F0  =  0  and 
FI  =  1.  It  is  easily  verified  that  the  ratio  p/q  =  F//F;+i  approaches  ag  for  large  enough  /. 
Since  the  intervals  at  each  level  are  formed  by  joining  nearest  neighbor  points,  a 
measure  is  constructed  to  give  equal  weight  to  each  such  interval.  The  scaling  function, 
which  describes  the  contraction  factors  of  each  interval  along  each  branch  as  the  tree 
subdivides,  is  defined  as 


where  L(en,  .  .  .  ,  e0)  is  the  length  of  an  interval  belonging  to  the  tree.  The  Kronecker 
delta's  ensure  that  the  two  intervals  being  compared  have  the  same  history  in  the  tree 
construction.  The  sequence  of  e's  describes  the  location  of  the  intervals  in  the  tree 
structure,  and  assume  the  binary  values  of  0  or  1.  For  the  circle  map,  the  sequence 
(  e,-+i,e,-)  —  (1,  1)  is  forbidden,  consistent  with  the  Fibonacci  sequence  defining  the 
periodic  orbits.  The  argument  of  the  function  is  determined  by  expanding  time  (or 
iterations)  in  base  2,  i.e.,  t  =  Se,-2',  yielding  <j(f).  After  suitable  normalization,  this 
procedure  associates  a  real  number  between  0  and  1  to  each  interval.  Such  an  expansion 
also  correctly  organizes  the  intervals  to  be  compared.  It  should  be  stressed  that  if  one  can 
extract  the  scaling  function  from  the  experiment,  one  has  a  complete  microscopic 
description  of  how  the  attractor  was  refined,  and  one  can  easily  compute  all  statistical 
averages. 

We  shall  now  assume  that  the  two  entities,  namely  the  flow  and  the  map,  possess 
the  same  multiplicative  structure,  leading  to  a  particular  kind  of  scaling  at  finer  and 
finer  scales.  We  now  like  to  quantify  the  refinement  process  by  extracting  the  values 
of  scale  contractions  at  each  stage  of  refinement  and  comparing  them  with  those  from 
the  circle  map.  Here  one  runs  into  the  difficulty  that  the  scaling  function  is 
experimentally  not  a  robust  quantity  and  therefore  difficult  to  obtain.  We  therefore 
extract  a  suitably  modified  scaling  function  [17],  which  is  based  on  comparing  intervals 
within  a  single  periodic  orbit  rather  than  two  orbits.  Focusing  on  the  golden  mean 
winding  number,  we  note  that  the  ratios  p/q,  which  approximate  the  golden  mean 
successively  better  for  each  increasing  index  on  the  Fibonacci  sequence,  correspond  to 
the  different  frequency-locked  Arnol'd  tongues  mentioned  earlier.  We  thus  need  to  study 
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Figure  5.  A  typical  Poincare  section  for  the  mode-locked  state  for  the  ratio 
pjq  =  5/8.  The  numbers  by  the  point  clusters  represent  the  dynamical  sequence  of  the 
periodic  iterates  creating  the  attractor.  The  sequence  1-6-3-8-5-2-7-4  is  consistent  with 
the  circle  map  theory.  The  Poincare  section  is  embedded  in  three-dimensional  phase 
space  whose  two-dimensional  projection  is  presented.  Three  dimensional  vector 
distances  between  neighboring  points  are  used  to  determine  the  trajectory  scaling 
function. 


periodic  orbits  of  length  Ft  =  q,  where  9n  =  6n+q.  It  follows  that  Bn  and  0n+F,_,  are  nearest 
neighbors  on  the  Poincare  section,  9n  and  0n+F,_2  are  the  next  nearest  neighbors,  and  so 
forth.  Following  Ref.  [17],  we  define  the  trajectory  scaling  function  within  a  single 
periodic  orbit  for  the  circle  map  as 


if  0<;<Fn_2 


(5) 


and 


0V  = 


if  Fn-.2<j<Fn. 


(6) 


Since  we  wish  to  approach  the  golden  mean  critical  point  with  a  sequence  ofp/q  rational 
approximations,  we  experimentally  adjust  the  system  at  the  critical  amplitude  to  the  best 
possible  rational  approximation  to  the  golden  mean  winding  number.  For  any  given  p/q 
approximation,  the  Poincare  section  consists  of  a  set  of  q  discrete  points.  For  the 
experimental  situation,  a  two-dimensional  projection  for  q  =  8  is  shown  in  figure  5.  The 
experimental  trajectory  scaling  function  is  obtained  using  the  three-dimensional  vector 
distances  between  the  appropriate  points.  The  noise  within  each  point  cluster  is  reduced 
by  averaging.  The  experimental  trajectory  scaling  function  for  a  5/8  lock-in  state  is 
shown  in  figure  6.  The  reader  should  examine  figure  6  with  figure  4  of  Ref.  [17]  to  realize 
that  the  present  comparison  is  actually  of  better  quality  than  for  the  convection 
experiment  considered  there.  The  agreement  between  theory  and  experiment  is 
reasonably  good,  though  more  data  points  and  smaller  experimental  error  would  have 
been  desirable. 
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Figure  6.  The  trajectory  scaling  function  from  the  experiment  for  a  5/8  lock-in 
state.  The  solid  line  is  computed  from  the  sine  circle  map  at  the  golden  mean  critical 
point.  The  various  symbols  represent  calculations  from  different  experimental  runs. 


7.  Conclusions 

Even  though  some  departures  from  the  universality  associated  with  the  circle  map  are 
apparent  in  wake  dynamics,  we  think  that  the  extent  of  the  observed  similarity  is 
remarkable.  It  is  not  obvious  whether  the  departures  observed  are  real  or  occur  because 
the  control  parameters  are  not  as  fine  tuned  as  desired.  It  is  known,  for  example,  that 
departures  from  criticality  can  produce  comparable  departures  from  universality  to  those 
observed  here  [18].  Apropos  of  this  state  of  affairs,  we  reiterate  that  enormous  care  was 
exercised  in  the  experiments,  and  state  our  belief  that  the  residual  problems  of  fine 
control  cannot  be  eliminated  without  resorting  to  unconventional  ways  of  generating  such 
flows.  A  particular  focus  of  this  work  has  been  the  search  for  universal  behavior  through 
the  study  of  the  trajectory  scaling  function.  This  gives  a  detailed  description  of  the 
attractor.  We  believe  that  the  agreement  with  universality  predictions  for  this 
experimentally  fragile  but  rich  function  is  indeed  considerable. 

In  the  past,  observations  such  as  these  have  evoked  the  question:  why  does  a  fluid  flow 
governed  by  coupled  field  equations  follow  the  dynamics  of  a  simple  map?  The  key 
idea  here  is  universality:  a  system  may  exhibit  quantitative  universality  as  long  as  it 
possesses  certain  common  qualitative  properties.  The  relevant  statement  here  is  that  the 
dynamics  of  many  systems  with  two  nonlinearly  coupled  modes,  one  of  which  is 
periodically  driven,  yield  the  dynamics  of  a  sine  circle  map  near  the  critical  point.  It  must 
be  said  that  at  present  there  are  no  general  rules  that  help  us  forecast  with  certainty  that  a 
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flow  in  which  a  few  modes  are  excited  will  or  will  not  follow  a  certain  universal  scenario. 
However,  with  the  hindsight  of  the  experimental  (or  other)  knowledge  about  the  existence 
of  a  universal  scenario,  it  is  often  possible  to  derive  the  simple  map  from  the  governing 
equations. 

-  In  the  particular  case  of  the  oscillating  cylinder,  the  experimental  demonstration  of  the 
correspondence  between  the  flow  and  the  circle  map  implies  that  one  should  be  able  to 
extract  the  latter  appropriately  from  the  equations  of  motion  governing  the  flow.  Without 
this  step,  the  connections  implied  are  unsatisfying  and,  to  a  skeptical  mind,  perhaps  no 
more  than  a  curiosity.  Some  work  in  the  needed  direction  has  already  occurred  [11].  The 
starting  point  is  the  Landau-Stuart  equation  [19],  [20]  which,  on  the  one  hand,  can  be 
deduced  from  the  equations  of  motion  and,  on  the  other  hand,  is  also  known  to  describe 
the  wake  evolution  near  the  critical  Reynolds  number  [21],  [22].  Add  to  this  equation  an 
external  forcing  term  F  =  F0exp(iu)et},  in  the  same  spirit  as  that  of  the  oscillating 
cylinder  experiment.  From  this  one  can  generate  a  coupled  dynamical  system  for  the 
amplitude  and  phase  of  a  disturbance  in  the  forced  wake.  An  analytical  study  of  even  this 
simplified  dynamical  system  is  too  complex,  but  its  numerical  study  shows  features  that 
are  close  to  those  observed.  In  the  limit  of  low  forcing,  this  dynamical  system  indeed 
reduces  to  the  circle  map. 
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Abstract.  Sometime  ago  Ananthakrishna  and  coworkers  had  predicted  the  existence  of  chaos 
in  jerky  flow  based  on  a  nonlinear  dynamical  model  consisting  of  the  time  evolution  equations 
for  three  types  of  dislocations  and  an  equation  for  the  evolution  of  the  stress.  Our  main  focus  here 
is  to  report  the  verification  of  this  prediction  by  analysing  the  stress  signals  obtained  from 
samples  of  AlCu  alloys  subjected  to  a  constant  strain  rate  test.  The  analysis  of  the  stress  signals 
is  carried  out  by  using  several  methods.  The  analysis  shows  the  existence  of  a  finite  correlation 
dimension  and  a  positive  Lyapunov  exponent.  We  also  carry  out  a  surrogate  analysis  of  the 
time  series  to  ascertain  that  the  signals  are  not  from  a  power  law  stochastic  process.  From  the 
analysis  we  find  that  the  minimum  number  of  variables  required  for  a  dynamical  description  of  the 
jerky  flow  appears  to  be  four  or  five,  consistent  with  the  number  of  degrees  of  freedom  envisaged  in 
the  model. 

Keywords.  Jerky  flow;  Portevin-Le  Chatelier  effect;  dislocations;  chaos;  correlation  dimension; 
Lyapunov  exponent;  singular  value  decomposition;  surrogate  analysis. 

PACS  Nos    05.45;  62.20;  81.40 


1.  Introduction 

The  work  reported  here  is  an  exercise  in  the  application  of  concepts  and  techniques 
in  dynamical  systems  to  problems  in  plasticity.  The  subject  matter  falls  in  the  general 
area  of  'stick-slip'  phenomenon.  For  the  sake  of  a  general  audience,  we  very  briefly 
recall  some  basic  features  of  a  specific  form  of  plastic  instability  arising  under  a  constant 
strain  rate  test.  This  is  known  as  jerky  flow  or  the  Portevin-Le  Chatelier  (PLC)  effect 
and  has  been  an  object  of  attention  for  a  long  time  in  metallurgical  literature.  Dislocations 
are  the  basic  defects  contributing  to  the  plastic  deformation  of  specimens.  The  stress- 
strain  curves  show  repeated  stress  drops  and  each  stress  drop  in  the  deformation  curve 
is  associated  with  the  occurrence  of  a  dislocation  band.  The  band  usually  propagates 
along  the  sample.  The  phenomenon  is  seen  in  a  number  of  alloys  and  is  known  to 
manifest  only  in  a  window  of  applied  strain  rates  and  temperatures  [1].  Alloys 
always  contain  point  defects  such  as  substitutional  and  interstitial  atoms  which  are 
commonly  referred  to  as  solute  atoms.  At  intermediate  temperatures,  the  cores  of  these 
dislocations  are  saturated  with  solute  atoms,  since  these  atoms  have  a  tendency  to  migrate 


to  the  cores  of  the  dislocations.  Consider  a  constant  strain  rate  experiment  and  assume 

that  dislocations  are  initially  free  from  solute  atoms.  Once  the  stress  developed  in 

the  sample  is  enough  to  move  dislocations,  they  sweep  the  sample  and  in  the  process 

gather  solute  atoms  leading  to  immobilization  of  dislocations.  This  happens  once 

the  solute  concentration  at  the  core  of  the  dislocation  exceeds  a  certain  critical  value. 

However,  since  the  sample  is  subject  to  a  constant  strain  rate  deformation,  the  stress  in 

the  sample  increases  to  such  a  value  that  dislocations  break  away  from  the  clouds  of 

solute  atoms.  The  breakaway  of  dislocations  results  in  an  yield  drop.  This  whole  process 

repeats  thereby  multiple  yield  drops  are  seen.  It  is  clear  that  this  repetitive  nature  of 

the  process  arises  due  to  a  competition  of  two  time  scales,  namely,  the  time  scale 

associated  with  the  velocity  of  dislocations  and  the  diffusion  time  scale  of  the  solute 

atoms.  This  phenomenon  is  known  as  dynamic  strain  ageing  (DSA)  and  was  first 

envisaged  by  Cottrell  [2].  In  spite  of  the  enormous  attention  the  subject  has  received, 

several  aspects  of  the  phenomenon  were  not  well  understood  until  recently.  There  has 

been  renewed  attempts  [3-9]  to  understand  this  phenomenon  by  introducing  concepts 

borrowed  from  the  theory  of  dynamical  systems.  This  has  helped  to  gain  new  insights 

hitherto  not  possible.  In  most  of  the  models,  the  negative  strain  rate  sensitivity  (SRS)  is 

an  input  into  diffusion  like  equations  [8,9].  Some  of  these  models  have  been  quite 

successful  in  explaining  a  number  of  features  of  the  PLC  effect  including  the  band 

velocity.  Even  the  existence  of  chaotic  solutions  remains  a  possibility  [9].  The  first  truly 

dynamical  model  was  attempted  by  Ananthakrishna  and  coworkers  [3-5]  several  years 

ago,  wherein  the  spatial  aspect  were  ignored.  This  method  starts  from  the  time  evolution 

equations  for  the  densities  of  three  types  of  dislocations.  These  equations  are  then 

coupled  to  the  equation  for  the  stress  developed  in  the  sample.  The  negative  SRS  results 

as  a  consequence  of  a  Hopf  bifurcation  from  a  time  homogeneous  steady  state  to  the  time 

oscillatory   state   [3-5,12].   The  theory  has  been  improved  to  include  the   spatial 

dependence  as  well  [13],  Although,  the  original  theory  ignored  the  spatial  aspects,  it 

proved  to  be  surprisingly  successful  in  that  it  could  explain  several  features  of  the  PLC 

effect  [5,12]. 

In  addition  to  above  results,  one  specific  prediction  of  the  model  is  the  possibility 
of  chaotic  behaviour  in  a  limited  range  of  strain  rates  [10,11].  The  first  detailed 
report  of  the  analysis  of  the  experimental  stress-strain  series  which  demonstrated 
the  existence  of  chaos  was  published  recently  [14],  eventhough,  this  result  was  quoted 
in  an  earlier  paper  [13].  Since  then  there  has  been  a  few  other  investigations  which 
have  provided  additional  support  that  jerky  flow  is  chaotic  [15-17].  The  implication 
is  that  the  dynamical  basis  of  the  model  with  only  a  few  degrees  of  freedom  is 
correct  [14]. 

The  purpose  of  the  paper  is  to  present  additional  evidence  that  the  dynamics  of  the 
underlying  phenomenon  is  indeed  chaotic.  The  methods  include  the  singular  value 
decomposition,  calculation  of  a  finite  correlation  dimension  and  a  positive  Lyapunov 
exponent.  In  addition,  we  carry  out  surrogate  analysis  to  demonstrate  that  the  time  series 
is  not  due  to  a  power  law  stochastic  process.  The  plan  of  the  paper  is  as  follows.  In  §2,  we 
briefly  present  the  model  for  the  PLC  effect  which  predicts  the  existence  of  chaos  along 
with  the  results  related  to  chaos.  In  §3,  we  outline  the  various  methods  of  analysis  used 
here.  We  then  apply  them  to  the  experimental  time  series.  Section  4  contains  discussion 
and  conclusions. 
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Chaos  in  jerky  flow 
2.  Chaos  in  a  model  for  the  PLC  effect 

In  the  following,  we  very  briefly  outline  our  model  for  the  PLC  effect  [3-5]  in  terms  of 
scaled  variables.  Details  of  the  model  can  be  found  in  the  original  papers.  It  is  a  dynamical 
model  using  three  types  of  dislocation  densities,  namely  the  mobile  dislocation  density 
denoted  by  x,  the  immobile  dislocation  density  denoted  by  y,  and  another  type  which  may 
be  regarded  as  dislocations  with  clouds  of  solute  atoms  denoted  by  z.  The  basic 
mechanisms  included  in  the  model  are:  dislocation  multiplication  through  cross 
glide,  annihilation  of  dislocations,  formation  of  dislocation  locks,  gradual  immobilization 
of  mobile  dislocations  by  solute  atoms,  and  the  re-mobilization  of  the  immobile 
dislocations  by  stress  or  thermal  activation.  These  mechanisms  lead  to  a  coupled  set  of 
nonlinear  differential  equations  for  the  rate  of  change  of  the  densities.  These  rate 
equations  are  coupled  to  the  machine  equation.  Using  a  power  law  dependence  for 
the  velocity  of  the  mobile  dislocations  in  terms  of  scaled  stress  (f>  given  by  Vm  =  Vo(<j6)m, 
we  can  write  the  equations  for  the  time  evolution  of  dislocation  densities  and  the  scaled 
stress  0  as 


x  =  c/)mx  —  box2  —  ax  -f-  y  —  xy  ,  (  1  ) 

y  =  bo[KbQX2  -  xy  —  y  +  az],  (2) 

z  =  c[x-z],  (3) 

<j>  =  d[e-<irx},  (4) 

where  the  parameter  a  refers  to  the  concentration  of  the  solute  atoms,  bo  to  the  strength  of 
reactivation  of  the  immobile  dislocations,  c  to  the  time  scales  over  which  the  slowing 
down  of  dislocation  occurs,  d  to  the  effective  modulus  and  e  to  the  applied  strain  rate. 
There  is  a  range  of  values  of  the  parameters  a,  &,  c,  rf  ,  AC,  e  and  m  for  which  we  find 
oscillatory  solutions.  The  model  is  known  to  produce  a  large  number  of  qualitative 
features  of  the  PLC  effect  [3-5].  One  specific  prediction  which  is  of  interest  here  is  that 
the  model  exhibits  chaos  in  an  intermediate  range  of  applied  strain  rates  [10-12].  The 
model  shows  the  period  doubling  route  to  chaos  when  the  velocity  exponent  m  ^  2 
[11,12].  The  value  of  the  Feigenbaum's  exponent  is  the  same  as  that  for  the  quadratic 
map.  For  m  >  2,  it  shows  a  bubble  structure.  For  m  close  to  unity,  the  model  shows  a 
period  adding  phenomenon.  Other  studies  carried  out  on  the  model  include  obtaining 
approximate  closed  form  expressions  for  the  limit  cycles  [4]  and  deriving  the  time 
dependent  Ginzburg-Landau  equation  for  the  order  parameter  for  the  creep  case  [18] 
(described  by  equations  1-3).  The  model  has  since  been  extended  to  include  spatial 
coupling  [13].  An  additional  result  that  emerges  is  the  velocity  of  the  band. 

3.  Detection  of  chaotic  behaviour  in  experiments 

3.1  Experimental  procedure 

Experiments  performed  under  constant  strain  rate  conditions  often  exhibit  two  features 
that  are  not  desirable  from  the  point  of  dynamical  analysis.  First,  there  is  an  upward  drift 
of  the  stress-strain  curve.  The  physical  origin  of  this  lies  in  the  fact  that  more  and  more 


dislocations  get  immobilized  and  are  stored  in  the  sample  with  a  time  scale 
commensurate  with  the  time  scale  of  immobilization.  This  is  called  as  strain  hardening. 
In  addition,  there  is  a  gradual  increase  in  the  amplitude  of  the  serrations.  Clearly  both 
these  features  are  undesirable  since  they  imply  nonstationarity  of  the  time  series. 
However,  it  is  often  possible  to  find  stress-strain  curves  which  are  flat.  In  the  present 
case,  the  stress  signals  are  in  such  a  region.  The  experiments  were  performed  on  single 
crystals  of  CuAl  14%  oriented  along  <  111  >  at  290 K  at  Poitiers.  The  strain  rate  of 
deformation  was  3.34  x  10~4s~1.  The  stress  signals  were  recorded  in  a  digitized  form 
from  a  single  deformation  curve.  Each  set  contained  4720  points  recorded  at  a  rate  of  242 
points  per  second.  Even  though  the  data  sets  are  short,  we  have  twelve  data  sets.  As  an 
illustrative  example,  we  will  present  the  results  on  one  data  set. 

3.2  Time  series  analysis 

It  is  known  that  the  stress  drops  in  constant  strain  rate  experiments  can  appear  noisy  in  a 
certain  regime  of  the  parameter  space  and  devoid  of  any  pattern.  Whenever  one  meets 
such  a  noisy  sequence,  the  main  objective  of  any  analysis  is  to  determine  whether  it  is  of 
deterministic  origin  or  stochastic  origin.  In  general  stochastic  noise  is  always  present,  for 
instance,  the  machine  noise  or  inherent  randomness  due  to  other  causes. 

Our  first  analysis  of  the  experimental  data  on  Al-Cu  alloys  [14]  has  shown  that  there  is 
a  finite  correlation  dimension.  This  was  obtained  using  the  algorithm  due  to  Grassberger 
and  Procaccia  (GP)  [19].  However,  the  existence  of  a  finite  correlation  dimension  cannot 
always  be  taken  as  an  indicator  of  chaotic  dynamics.  This  is  because  a  stochastic  process 
with  a  power  law  spectrum  can  also  exhibit  a  finite  correlation  dimension  [20,21],  and 
therefore  such  a  process  mimics  a  low  dimensional  attractor.  There  are  other  methods 
such  as  singular  value  decomposition  [22]  which  can  be  used  effectively,  particularly 
when  used  in  conjunction  with  other  methods  [23].  Even  so,  these  methods  only  provide 
dimensional  estimates,  and  as  such  cannot  always  be  taken  as  an  evidence  of  chaotic 
behaviour.  The  most  reliable  quantifier  of  chaotic  dynamics  is  the  existence  of  a  positive 
Lyapunov  exponent.  However,  conventional  methods  of  determining  the  Lyapunov  ex- 
ponent require  long  time  series  [24, 25]  which  are  impractical  to  obtain  from  experiments. 
Recently,  the  well  known  Eckmann's  algorithm  has  been  modified  to  suit  short  time 
series  [26]  giving  the  entire  Lyapunov  spectrum.  Yet  another  particularly  simple  method 
is  due  to  Gao  and  Zheng  [27].  This  method  allows  one  to  calculate  both  the  optimum 
delay  time  and  the  optimum  embedding  dimension  in  addition  to  providing  an  estimate  of 
the  positive  Lyapunov  exponent.  Finally,  it  is  always  worthwhile  to  generate  surrogate 
data  sets  and  repeat  the  analysis  to  ensure  that  the  signals  are  not  due  to  a  power  law 
stochastic  process. 

All  the  methods  are  based  on  phase  space  reconstruction  by  embedding  the  signal  in  a 
higher  dimensional  space  and  carrying  out  the  required  analysis.  The  purpose  of 
embedding  a  scalar  time  series  in  higher  dimension  is  to  reconstruct  other  variables  which 
constitute  the  dynamics  of  system  from  the  measured  time  series.  This  is  done  by 
embedding  the  signal  in  higher  and  higher  dimension  until  an  optimum  dimension  is 
reached.  The  object  of  increasing  the  embedding  dimension  is  to  remove  or  minimize  the 
contribution  from  'false  neighbours'  since  at  small  embedding  dimensions,  there  are  large 
number  of  such  'false  neighbours'.  The  rule  of  thumb  for  selecting  the  time  lag  is  to  use 

Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  n) 
708  Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


Chaos  in  jerky  flow 
5 


0  2000  4000 

Time 

Figure  1.    A  typical  stress-time  plot  for  AlCu  alloy. 


the  value  of  the  time  at  which  the  correlation  function  falls  to  l/e  of  the  initial  value. 
Some  use  a  value  of  the  first  zero  crossing  time.  Thus,  there  is  some  arbitrariness  in 
estimating  the  optimal  time  delay.  The  method  due  to  Gao  and  Zheng  [27]  allows  one  to 
determine  the  optimum  value  of  the  time  lag  as  well  as  the  embedding  dimension  (see 
below). 

Consider  a  stress  signals  measured  from  a  jerky  flow  normally  sampled  at  regular 
intervals  of  time  represented  by  {cr,-  =  1 , . . . ,  N}.  A  typical  stress-strain  curve  is  shown  in 
figure  1.  The  constructed  d  dimensional  vector  is  £,-  =  {o^,  <T(,-+T),  . . . ,  0[i+(d-i)r]}>  where 
r  is  some  fixed  time  interval  used  as  a  delay.  Noise  present  in  the  signal  can  be  'cured'  by 
using  singular  value  decomposition  technique.  In  this  method,  the  (dxN)  trajectory 
matrix  A 

is  rotated  onto  the  basis  of  its  principal  vectors.  The  d  principal  values  of  the  matrix  A 
(ie.,  the  eigenvalues  of  the  covariance  matrix  A' A)  are  positive.  Eigenvalues  are  conven- 
tionally ordered  in  a  decreasing  order.  If  some  of  the  principal  values  say,  (q  -f- 1, . . . ,  d) 
are  zero,  then,  it  is  clear  that  the  trajectory  remains  confined  to  the  subspace  spanned  by 
the  basis  (!,...,#).  In  practice  the  presence  of  noise  prevents  these  eigenvalues  from 
taking  zero  value  exactly.  However,  there  may  be  a  set  of  eigenvalues  which  are  small 
compared  to  the  largest  one.  Then,  an  estimate  of  the  embedding  dimension  is  obtained 
by  the  sharp  decrease  in  the  eigenvalue  to  a  certain  low  level.  It  is  possible  then  to  obtain 
the  cured  trajectory  matrix  by  retaining  only  the  first  q  components  and  back  rotating 
these  components.  Often,  this  method  is  used  as  a  noise  reduction  technique. 
The  correlation  integral  C(r)  is  calculated  by 

'(r-'lS-Sl)  '  (6) 

where  #(••)  is  the  Heaviside  step  function  and  Np  is  the  number  of  pairs  of  the  embedding 
vectors  (&,§•)  whose  distance  is  less  than  r.  These  vectors  may  refer  to  either  the  d 
dimensional  vectors  obtained  directly  from  the  time  series  or  to  the  d  dimensional  vectors 
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obtained  by  back  rotating  the  first  p  principal  components.  We  still  need  to  identify  the 
optimal  delay  time.  This  can  be  done  using  the  following  method. 

In  a  chaotic  time  series,  points  on  two  neighbouring  trajectories  diverge  exponentially  in 
time.  This  divergence  can  be  measured  by  calculating  the  distance  between  them  at  t  =  0 
and  at  a  later  time  after  k  time  steps.  In  a  short  time  the  difference  vector  corresponding  to 
the  two  points  aligns  itself  in  the  direction  of  maximum  stretching.  Consider  two  vectors  & 
and  £j  in  d  dimension.  Let  dtj(k}  =\\  &+*  -  ^j+k  \\,  where  ||  ••  ||  is  the  Euclidean  distance 
between  the  two  vectors  at  time  k  between  £,  and  £,-.  Let  ditj(Q)  be  the  initial  distance. 
Since  for  a  chaotic  system,  dtj(k)  should  be  larger  than  d,-j(0),  a  measure  of  the 
divergence  is  the  time  dependent  Lyapunov  exponent  A(fc,  d,  r)  =  (\n[dij(k,z)/dij(Q)]}, 
where  (••}  refers  to  the  average  over  all  pairs  of  points  for  which  d/j(0)  is  less  than  a 
chosen  do.  Another  quantity  is  A+(k,d,  r)  where  the  average  is  limited  to  those  pairs  of 
points  for  which  (dij(k)/dij(Q})  >  1.  An  appropriate  window  tw  =  (d  —  l)r  is  imposed 
such  that  points  on  the  same  orbit  satisfying  |i  -7]  >  tw  are  discarded.  The  optimal  time 
lag  is  found  by  looking  for  the  minima  in  the  plots  of  A(r,  d,  k}  vs  r  and  A+(r,  d,  k}  vs  r 
for  various  values  of  the  embedding  dimension.  The  optimum  embedding  dimension  can 
also  be  determined  by  looking  for  minimal  changes  in  A(r,  fc,  d}  versus  r  as  d  is  increased 
around  the  minimum  of  the  curve.  In  addition,  the  divergence  plots  of  In  [dij(k}/dij(0]]  vs 
In  [dij(0i]]  do  not  change  much  as  we  increase  the  embedding  dimension  beyond  a  certain 
dimension  which  is  identified  as  the  optimal  embedding  dimension. 

Since  the  time  series  is  short,  the  calculation  of  the  entire  Lyapunov  spectrum  is  limited 
to  a  few  embedding  dimensions  (d  <  5).  This  method  is  a  modification  of  the  algorithm 
due  to  Eckmann  et  al  [25].  It  relies  on  the  construction  of  a  sequence  of  (dM,  <J?M)  matrices 
TI,  (dM^d)  which  map  the  difference  vector  £/  —  £/  to  6+fc  ~~  £j+k  and  successively 
reorthogonalize  7}  using  the  standard  QiRi  decomposition.  (<2;  is  an  orthogonal  matrix 
and  Ri  is  an  upper  triangle  matrix  with  positive  diagonal  elements.)  Then  the  Lyapunov 
exponent  is  given  by 

/=1,2,...,<*  (7) 


where  p  is  the  number  of  available  matrices  and  k  is  the  propagation  time  in  units  of  Af 
which  is  the  time  step.  The  modification  effected  in  this  method  is  to  choose  all  orbits  in  a 
shell  rather  than  a  sphere  so  that  the  effect  of  randomness  is  eliminated.  The  time  lag  used 
in  this  algorithm  is  the  value  where  the  correlation  function  falls  to  l/e  of  the  initial  value 
or  an  optimum  choice  determined  by  another  method. 

Finally,  one  method  of  determining  if  the  signals  are  due  to  a  stochastic  process  with  a 
power  law  spectrum  is  to  construct  surrogate  data  sets  and  repeat  the  analysis.  Surrogate 
data  sets  can  be  generated.  in  a  number  of  ways  [21].  One  simple  way  is  to  take  the 
Fourier  transform  of  the  signals  randomize  their  phases  and  Fourier  invert  it.  If  the 
surrogate  data  sets  do  not  show  a  finite  correlation  dimension  or  a  positive  Lyapunov 
exponent,  while  the  original  data  sets  exhibit  both,  this  is  taken  as  a  confirmation  that  the 
data  is  due  to  a  low  dimensional  attractor. 

All  the  algorithms  have  been  checked  against  standard  benchmark  cases  such  as 
Rossler  and  Lorenz  attractor;  good  agreement  with  the  known  values  in  the  literature  has 
been  obtained. 
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Figure  2.     A  plot  of  A.(k,  r}  vs  r  for  various  values  of  k  for  the  Rossler  attractor. 
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Figure  3a,b.    A  plot  of  A(r)  and  A+(T)  vs  r  for  fc  =  20  for  various  values  of  d. 

3.3  Analysis  of  experimental  signals 

In  the  method  due  to  Gao  and  Zheng,  the  choice  of  an  appropriate  value  of  k  is  not 
specified.  Generally,  it  is  so  chosen  that  it  reflects  a  reasonable  extent  of  divergence  and 
is  taken  to  be  of  the  order  of  the  correlation  time.  However,  often  one  finds  that  the 
minima  in  A(r,  k,  d}  vs  r  coincides  with  the  value  of  k  chosen  and  therefore  gives  a 
spurious  optimal  lag  time.  We  found  that  it  is  possible  to  estimate  the  correct  value  k  by 
plotting  A(r)  vs  r  for  various  values  of  fe.  We  find  that  the.  minima  in  A(r)  vs  r  for 
various  values  of  k  initially  increases  as  k  is  increased  and  then  decreases  going  through  a 
maximum.  This  maximum  value  is  chosen  to  be  the  optimum  value  of  k.  A  plot  of 
A(r,  d,  k)  vs  r  for  the  Rossler  attractor  is  shown  in  figure  2  for  various  values  of  k.  The  opti- 
mum k  can  be  seen  to  be  9.  We  find  that  this  value  fcopt  is  independent  of  the  choice  of  dQ. 
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Figure  4.  The  local  divergence  plots  with  a  window:   (a)  d  =  4,r  =  21,   (b) 

d  =  5,r  =  27,   (c)   d  =  5,r  =  21,   (d)   cf  =  5,r  =  27   (without   a   window),    (e) 


Figures  3a-b  show  plots  of  A(r,  d)  and  A+(r,d)  respectively  for  d  =  4, 5  and  6  for 
£  =  20  for  one  data  set.  When  we  increase  the  dimension  form  4  to  6,  we  see  that  the 
curve  corresponding  to  d  =  5  lies  lower  than  d  =  4,  while  d  =  6  does  not  change 
significantly  from  d  =  5.  This  suggests  that  d  =  5  is  the  optimum  dimension.  Further,  we 
also  note  that  the  minimum  occurs  around  r  =  27,  corresponding  to  the  optimal  time  lag. 
Figures  4a-f  show  the  divergence  plots  for  various  embedding  dimensions  and  delay 
times.  It  is  clear  that  as  we  approach  the  optimal  d,  the  divergence  plots  become  most 
compact.  Thus,  the  optimal  embedding  dimension  appears  to  be  d  =  5.  Nonoptimal 
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Figure  6.    Plot  of  the  correlation  integral  for  different  d  for  ropt  —  27. 


values  of  the  time  lag  r  can  be  seen  to  give  rise  to  excess  positive  excursions  over  the 
optimal  value.  This  can  be  seen  from  figure  4f  shown  for  d  =  6  and  r  —  37.  Note  that 
most  of  the  values  are  positive  even  though  the  plot  appears  quite  compact.  Using  the 
value  of  the  optimal  time  delay,  we  calculate  the  time  dependent  Lyapunov  exponent 
A (T, &,</).  This  is  shown  in  figure  5  for  d—5  and  6.  We  note  that  the  curve 
corresponding  to  d  =  5  is  slightly  more  linear  and  has  a  small  positive  intercept,  The 
value  of  the  Lyapunov  exponent  obtained  from  this  plot  is  1.4.  Having  obtained  the 
optimal  time  delay,  we  can  use  this  to  compute  the  correlation  dimension  for  this  data 
using  the  G-P  algorithm.  This  is  shown  in  figure  6.  It  is  clear  that  even  for  very  small 
embedding  dimension  of  5  there  is  a  saturation  effect  of  the  slope.  Note  the  dashed 
straight  lines  drawn  f or  d  =  4  and  5  are  parallel  implying  the  convergence  in  the  slope. 
The  slope  thus  obtained  is  2.1.  It  may  be  worthwhile  to  point  out  that  the  shrinking  of  the 
scaling  regime  starts  at  fairly  small  values  of  r  for  d  =  6.  Beyond  this  embedding 
dimension  we  find  that  the  scaling  regime  becomes  very  small.  This  behaviour  is 
common  to  short  time  series.  The  saturation  of  the  slope  for  such  a  low  embedding 
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Figure  7.    A  plot  of  A(r,  k,  d)  vs  k  for  optimum  r  for  various  values  of  do  for  the 
Rossler  attractor. 


dimension  must  be  contrasted  with  the  high  embedding  dimension  required  for  a  similar 
saturation  in  our  earlier  calculation  where  a  much  shorter  time  lag  was  employed.  This 
can  be  clearly  attributed  to  the  small  delay  time  that  was  used  in  our  previous  calculation 
[14]  since  it  is  the  total  window  length  tw  that  remains  constant. 

It  is  worthwhile  to  comment  here  on  the  actual  values  of  the  Lyapunov  exponent 
obtained  by  this  method.  We  found  that  the  Lyapunov  exponent  depends  on  the  value  of 
do  chosen.  Using  large  values  of  do  gives  smaller  values  of  the  exponent,  since  the 
number  of  points  contributing  to  the  average  is  large,  and  often,  is  not  a  true  indicator  of 
the  divergence.  On  the  other  hand,  if  too  few  points  are  included  ( ie.,  do  is  small)  then  the 
statistics  gets  poor  as  is  the  case  for  do  =  0.05%  of  the  attractor.  Indeed,  the  extent  of 
linearity  appears  to  be  less  when  do  is  large  and  it  improves  as  do  is  reduced.  It  appears  to 
break  down  for  small  values.  Thus  a  choice  of  a  few  percent  appears  to  be  reasonable. 
This  is  shown  in  figure  7  for  the  case  of  the  Rossler  attractor.  The  value  of  the  Lyapunov 
exponent  obtained  by  this  method  is  a  lower  bound  to  that  obtained  via  calculations 
(which  give  the  largest  exponent  [24]  or  the  entire  spectrum),  since  in  the  present  method 
the  average  is  over  all  possible  values  including  the  contracting  directions.  Even  so,  we 
feel  that  fixing  an  appropriate  value  of  this  parameter  in  a  way  that  gives  a  correct  value 
of  the  Lyapunov  exponent  for  the  experimental  signals  is  not  easy  due  to  the  presence  of 
noise  and  therefore,  the  actual  value  cannot  be  taken  seriously.  However,  it  must  be 
emphasized  that  the  Lyapunov  exponent  is  positive.  This  is  an  adequate  criterion  for  the 
time  series  to  be  chaotic. 

The  singular  value  decomposition  technique  has  been  used  to  analyse  the  data  as  well. 
The  spectrum  for  various  values  of  window  length  tw  is  shown  in  figure  8  .  It  is  clear  that 
there  is  a  sharp  drop  in  the  relative  magnitude  of  the  eigenvalues.  Indeed,  the  maximum 
fall  is  at  the  second  component.  The  changes  there  after  are  not  so  large.  This  shows  that 
the  number  of  degrees  of  freedom  is  of  the  order  of  two  or  three,  consistent  with  the  value 
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Figure  8.     A  plot  of  the  singular  value  spectrum  for  various  values  of  window  length. 
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Figure  9.    A  plot  of  slope  of  the  correlation  integral  for  various  values  of  window 
length  retaining  only  the  first  two  principal  components. 

obtained  from  calculation  of  the  correlation  dimension.  As  an  illustration  of  the  method, 
we  can  estimate  the  correlation  dimension  on  the  cured  time  series  obtained  by  back 
rotating  by  keeping  only  the  first  few  components.  Figure  9  shows  the  result  obtained  by 
keeping  only  two  components.  The  convergence  of  the  slope  is  clearly  around  2.  General 
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Figure  10.     A  plot  of  the  Lyapunov  spectrum  as  a  function  of  time  for  d  =  3  for  the 
original  data. 


features  are  not  altered  if  more  components  are  included,  since  the  correlation  dimension 
when  all  the  components  are  considered  is  2. 1  as  seen  from  figure  6. 

The  spectrum  of  Lyapunov  exponents  have  been  calculated  using  a  delay  time  of 
r  =  27  for  the  first  few  embedding  dimensions.  From  d  =  3,  we  find  a  positive  exponent. 
A  plot  of  the  Lyapunov  spectrum  as  a  function  of  time  (in  units  of  A? )  is  shown  for  d  =  3 
in  figure  10.  The  value  of  the  largest,  the  smallest  and  the  zero  exponent  remains  roughly 
the  same  from  d  =  3  up  to  d  —  5.  However,  there  is  an  additional  positive  Lyapunov 
exponent  seen  for  d  =  5  which  is  spurious.  This  statement  is  based  on  calculations  carried 
out  on  standard  models  like  the  Lorenz  and  the  Rossler  attractor.  In  these  two  cases,  we 
find  that  at  d  =  5,  there  is  an  additional  spurious  positive  eigenvalue.  This  is  due  to  the 
fact  that  we  need  at  least  10000  points  for  d  =  5. 

Finally,  we  have  generated  six  surrogate  data  files.  We  have  calculated  both  the 
correlation  dimension  and  the  Lyapunov  spectrum.  We  find  no  saturation  in  the  slope  of 
the  scaling  regime  in  any  of  the  six  surrogate  data  sets.  Further,  we  do  not  see  any  positive 
Lyapunov  exponent  for  dimensions  up  to  d  =  5  for  any  of  the  six  surrogate  files.  This 
clearly  shows  that  the  underlying  dynamics  of  the  original  data  is  due  to  a  low 
dimensional  attractor. 


4.  Discussion  and  conclusions 

Using  a  combination  of  several  methods  we  have  analysed  time  series  obtained  from 
experiments  on  AlCu  alloys.  The  singular  value  decomposition  method  clearly  shows  that 
the  orbit  is  confined  mostly  to  a  few  dimensions.  We  have  used  a  method  due  to  Gao  and 
Zheng  to  obtain  the  optimum  time  delay  and  the  optimum  embedding  dimension.  Since 
the  method  gives  positive  values  for  the  Lyapunov  exponent,  one  can  confidently  say  that 
the  jerky  flow  is  chaotic.  In  addition,  using  the  optimum  value  of  the  time  lag,  we  have 
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Chaos  in  jerky  flow 

calculated  the  correlation  dimension  of  value  2.1.  We  also  have  calculated  the  spectrum 
of  Lyapunov  exponents  by  a  method  suited  for  short  time  series.  There  is  one  positive 
Lyapunov  exponent  of  magnitude  ~  4.  We  have  further  carried  out  a  surrogate  analysis  to 
show  that  the  time  series  is  due  to  a  low  dimensional  chaotic  attractor.  These  calculations 
confirm  the  theoretical  prediction  that  jerky  flow  could  be  chaotic.  Further,  the  analysis 
shows  the  optimum  embedding  dimension  for  the  time  series  to  be  Jopt  =  5.  This  is  also 
consistent  with  the  correlation  dimension  2.1  obtained  by  using  optimal  time  delay  of 
^opt  =  26.  In  conclusion,  the  important  prediction,  namely,  the  existence  of  chaotic  flow 
in  the  PLC  effect  is  confirmed.  This  verification  strongly  suggests  the  inherent  correctness 
of  the  basis  of  the  theory,  namely,  that  the  dynamical  origin  of  the  PLC  effect  is  the 
nonlinear  interaction  between  the  participating  defects.  Further,  the  optimum  embedding 
dimension  being  5  tells  us  that  we  need  only  five  degrees  of  freedom  to  describe  the 
dynamics  of  jerky  flow.  We  recall  that  we  had  only  four  degrees  of  freedom  since  the 
variable  corresponding  to  the  concentration  of  solute  atoms  was  deliberately  suppressed 
in  the  original  formulation  of  the  model  for  the  sake  of  simplicity.  Lastly,  it  may  be 
recalled  that  the  phenomenon  is  spatially  inhomogeneous  which  implies  infinite  degrees 
of  freedom.  Thus,  these  few  degrees  of  freedom  must  correspond  to  the  collective 
behaviour  of  dislocations  and  other  defects.  This  fact  gives  a  clue  as  to  why  this  theory 
works  so  well  in  spite  of  the  fact  that  it  ignores  the  spatially  inhomogeneous  aspect  which 
is  thought  to  be  so  crucial  for  a  proper  description  of  the  phenomenon. 

Acknowledgement 

This  work  is  partly  supported  by  IFCPAR  Grant  No.  1108-1.  One  of  the  authors  (GA) 
wishes  to  acknowledge  the  support  from  IUFM  de  Lorraine  and  the  Univeriste  of  Metz. 
The  authors  are  thankful  to  the  Poitier's  group  for  supplying  the  data. 

References 

[1]  B  J  Brindley  and  P  J  Worthington,  Metal.  Rev.  145,  101  (1970) 

[2]  A  H  Cottrell,  Phil.  Mag.  44,  829  (1953) 

[3]  G  Ananthakrishna  and  D  Sahoo,  J.  Phys.  D14,  2091  (1981) 

[4]  M  C  Valsakumar  and  G  Ananthakrishna,  /  Phys.  D16,1055  (1983) 

[5]  G  Ananthakrishna  and  M  C  Valsakumar,  /.  Phys.  D15,  L171  (1982) 

[6]  L  P  Kubin  and  Y  Estrin,  Acta  Metall.  33,  397  (1985) 

[7]  L  P  Kubin  and  Y  Estrin,  Acta  Metall.  38,  697  (1990) 

Y  Estrin  and  L  P  Kubin,  /.  Mech.  Behavior  of  Materials  2,  255  (1989) 
[8]  Viewpoint  Set  on  Propagative  Instabilities,  Scripta  Metall  Mater.  29,  1147  (1993) 
[9]  V  Jeanclaude,  C  Fressengeas  and  L  P  Kubin,  Non  Linear  Phenomena  in  Materials  Science  II: 

Solid  State  Phenomena,  Vol  23-24,  (  Switzerland:  Trans  Tech  Publications  1992)  pp.  403. 
[10]  G  Ananthakrishna  and  M  C  Valsakumar,  Phys.  Lett.  A95,  69  (1983) 
[11]  T  M  John  and  G  Ananthakrishjia  in  Directions  in  Chaos  Ed.  Hao  Bai-Lin,  (Singapore:  World 

Scientific  1990),  pp  133 
[12]  G  Ananthakrishna  in  Non  Linear  Phenomena  in  Materials  Science  II:  Solid  State  Phenomena, 

Vol  23-24,  (Switzerland:  Trans  Tech  Publishers  1992)  pp.  417 
[13]  G  Ananthakrishna,  Script.  Met.  29,  1183  (1993) 

[14]  G  Ananthakrishna,   C   Fressengeas,   M  Grosbras,   J   Vergnol,   C   Engelke,   J  Plessing, 
H  Neuhauser,  E  Bouchaud,  J  Planes  and  L  P  Kubin,  Scripta  Metall.  Mater.  32,  1731  (1995) 


[15]  G  Ananthakrishna  and  S  J  Noronha,  Solid  State  Phenomena,  Vol  42-43,  (Switzerland:  Scitech 

Publishers  1995),  pp.  277  . 

[16]  L  Quaouire  and  C  Fressengeas,  Solid  State  Phenomena,  Vol  42-43,  (Switzerland:  Scitech 

Publishers  1995),  pp.  293 
[17]  S  Venkadesan,  KPN  Murthy  and  M  C  Valsakumar,  Solid  State  Phenomena,  Vol  42-43, 

(Switzerland:  Scitech  Publishers  1995),  pp.  287 
[18]  M  Bekele  and  G  Ananthakrishna,  in,  So/iW  State  Phenomena,  Vol.  42-43,  (Switzerland: 

Scitech  Publications  1995)  pp.  303;  M  Bekele  and  G  Ananthakrishna,  submitted  to  Physica  D 
[19]  P  Grassberger  and  I  Procaccia,  Physica  D9,  189  (1983) 
[20]  A  R  Osborne  and  A  Provenzale,  Physica  D35,  357  (1989) 

[21]  J  Theiler,  S  Eubank,  A  Longtin,  B  Galdrikian  and  J  D  Former,  Physica  D58,  77  (1992.) 
[22]  B  S  Broomhead  and  G  P  King,  Physica  D20,  217  (1986) 

[23]  A  M  Albano,  J  Muench,  C  Schwartz,  A  I  Mees  and  P  E  Rapp,  Phys.  Rev.  A38,  3017  (1988) 
[24]  A  Wolf  J  B  Swift,  H  L  Swinney  and  J  A  Vastano,  Physica  D16,  285  (1985) 
[25]  J  P  Eckmann,  S  O  Kamphorst,  C  Ruelle,  and  S  Cileberto,  Phys.  Rev.  E34,  4971  (1986) 
[26]  X  Zeng,  R  Eykholt  and  R  A  Pielke,  Phys.  Rev.  Lett.  66,  3229  (1991) 
[27]  J  Gao  and  Z  Zheng,  Phys.  Rev.  E49,  3807  (1994) 


Pramana  -  J.  Phys.,  Vol.  48,  No.  2,  February  1997  (Part  H) 
718  Special  issue  on  "Nonlinearity  &  Chaos  in  the  Physical  Sciences" 


ui  reuruary 

physics  pp.  719-736 


Chaos  and  predictability  of  the  Indian  summer  monsoon 
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Abstract.  Predictability  of  the  Indian  summer  monsoon  is  investigated  by  conducting  three  multi- 
year  integrations  with  the  Geophysical  Fluid  Dynamics  Laboratory's  climate  model.  The  mean 
monsoon  simulated  by  the  model  is  realistic.  It  is  shown  that  a  significant  fraction  of  the  interannual 
variance  of  the  simulated  Indian  summer  monsoon  may  be  due  to  internal  dynamics.  It  is  discovered 
that  the  tropical  atmosphere  is  capable  of  sustaining  a  quasi-biennial  oscillation  (QBO)  accounting 
for  most  of  the  internal  low  frequency  variability.  It  is  also  shown  that  neither  air-sea  interaction  nor 
surface  hydrology  feedback  is  essential  for  the  QBO  of  the  model  atmosphere.  That  such  a  QBO 
can  arise  due  to  modulation  of  the  nonlinear  intraseasonal  oscillations  by  the  annual  cycle  is 
demonstrated  using  a  simple  nonlinear  dynamical  model.  The  phase  and  the  amplitude  of  the 
internal  mode  is  unpredictable  and  hence  may  be  responsible  for  limiting  the  long  range 
predictability  of  the  monsoon. 

Keywords.     Seasonal  monsoon;  predictability;  slowly  varying  forcing;  internal  dynamics. 
PACS  Nos    47.27;  92.10;  92.20;  92.60;  95.75 


1.  Introduction 

The  weather  representing  the  instantaneous  state  of  the  atmosphere  and  the  climate 
representing  the  space/time  averages  are  both  governed  by  a  set  of  physical  laws.  The 
atmosphere  and  the  coupled  ocean-atmosphere  system  may  be  represented  by  a  set  of 
equations  that  include  certain  approximations  in  describing  some  of  the  physical 
processes  such  as  radiative  heating,  convective  heating  associated  with  clouds,  boundary- 
layer  turbulent  fluxes  of  heat,  momentum  and  mass  etc.  With  initial  conditions  defined  by 
observations,  such  a  set  of  equations  or  a  'model'  may  be  integrated  forward  in  time  using 
a  powerful  computer.  With  improved  models  and  improved  specification  of  the  initial 
conditions,  what  limits  our  ability  to  predict  the  weather?  This  question  was  first 
addressed  by  Lorenz  [1].  Using  a  simple  model,  he  showed  that  small  errors  in  the  initial 
conditions  would  double  in  about  three  days.  Keeping  in  mind  some  of  the  intrinsic 
limitations  of  the  observing  system,  this  puts  a  limit  of  two  to  three  weeks  on  our  ability 
to  predict  the  weather.  This  limit,  often  known  as  the  'limit  on  deterministic 
predictability',  has  now  been  confirmed  using  more  sophisticated  models.  This  limit  is 
governed  by  the  instabilities  and  nonlinearities  of-  the  system.  The  instabilities  make  the 
initial  error  grow  in  time  while  the  nonlinearities  help  the  errors  to  reach  a  saturation  level 
keeping  the  system  within  certain  bounds. 

In  contrast  to  the  tremendous  strides  made  by  numerical  weather  prediction,  the 
climate  prediction  efforts   using  deterministic  models   are  still  in  early   stages  of 
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deterministic  limit  on  predictability  could  be  achieved.  However,  it  was  demonstrated  by 
Charney  and  Shukla  [2]  that  although  the  instantaneous  state  of  the  atmosphere  (or  the 
weather)  may  not  be  predictable  beyond  2-3  weeks,  the  climate  or  the  statistical 
properties  of  the  atmosphere  (such  as  time  and/or  space  means)  may  still  be  predictable 
beyond  this  limit.  This  is  possible  because  the  climate  is  governed  by  low  frequency 
planetary-scale  flow  patterns.  If  these  low  frequency  (LF)  planetary-scale  flow  patterns 
were  solely  governed  by  internal  dynamics  or  instabilities  of  their  own  scale  or  scale 
interactions  with  the  high-frequency  synoptic  disturbances,  then  there  will  not  be  much 
hope  for  predictions  beyond  the  limit  on  deterministic  predictability.  However,  these  LF 
planetary-scale  patterns  are  also  expected  to  be  governed  to  a  large  extent  by  the  forcing 
associated  with  slowly  varying  boundary  conditions  such  as  sea  surface  temperature 
(SST),  soil  moisture,  snow  cover,  etc.  These  slowly  varying  boundary  conditions  give  rise 
to  slowly  varying  forcings  in  the  atmosphere  that  provide  significant  coherent  changes  in 
the  planetary  scales  at  LF,  leading  to  potential  predictability  of  the  space-time  averages. 
This  has  been  supported  by  a  large  number  of  atmospheric  general  circulation  model 
(AGCM)  studies  forced  by  observed  sea  surface  temperature  (SST)  variations  (see  [3]  and 
references  therein)  and  provide  the  conceptual  basis  for  climate  prediction. 

Several  observational  and  modeling  studies  have  established  that  the  Indian  summer 
monsoon  is  linked  with  several  surface  boundary  forcings.  In  addition  to  links  with 
Eurasian  snow  cover  [4—6]  and  soil  moisture  over  Indian  continent  [7],  the  link  with 
Pacific  SST  and  Southern  oscillation  is  considered  strong  and  has  been  studied  by  many 
authors  [8-11].  This  background  led  to  the  optimism  that  long  range  dynamical 
prediction  of  the  Indian  summer  monsoon  one  or  two  seasons  in  advance  may  be  feasible. 
As  a  result,  a  large  number  of  studies  in  the  last  few  years  have  made  serious  attempts  to 
simulate  the  interannual  variability  (IAV)  of  Indian  summer  monsoon  [12-14]  and  to 
hind-cast  seasonal  mean  monsoon  rainfall  [15,  16]  using  different  atmospheric  general 
circulation  models  (AGCMs).  Three  important  observations  made  from  these  studies  are 
summarized  below. 

First,  most  models  still  have  a  serious  problem  in  simulating  the  mean  summer 
precipitation  over  the  Indian  region.  They  tend  to  simulate  too  little  precipitation  over  the 
continent.  In  some  cases,  even  during  the  summer  months  a  strong  inter  tropical 
convergence  zone  (ITCZ)  persists  over  the  equatorial  Indian  ocean.  Second,  most  models 
can  simulate  the  interannual  variability  of  some  planetary  scale  component  of  the 
monsoon  such  as  the  divergent  circulation  (as  given  by  the  velocity  potential).  However, 
the  simulation  of  the  LAV  of  the  monsoon  rainfall  differs  widely  from  one  model  to 
another  [15]  indicating  great  sensitivity  of  this  regional  part  of  the  circulation  on 
resolutions  and  physical  parameterizations  of  the  models.  In  addition,  while  the 
prediction  of  the  seasonal  mean  rainfall  in  other  parts  of  the  tropics  (e.g.,  Sahel  or 
equatorial  Pacific)  do  not  seem  to  be  sensitive  to  small  changes  in  the  initial  conditions, 
the  simulation  of  the  seasonal  mean  Indian  monsoon  rainfall  seems  to  be  rather  sensitive 
to  small  changes  in  the  initial  conditions  [17].  This  indicates  that  the  mean  monsoon 
circulation  in  the  tropics  may  be  unique  in  that  it  is  not  entirely  forced  by  slowly  varying 
boundary  forcings  but  is  also  governed  by  internal  dynamics  to  some  extent. 
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The  limit  on  predictability  of  Indian  summer  monsoon  would  be  determined  by  the 
degree  to  which  the  mean  monsoon  is  governed  by  internal  dynamics,.  Quantitative 
estimates  of  this  contribution  to  the  mean  monsoon  is  currently  lacking.  In  the  present 
study,  an  attempt  is  made  to  make  quantitative  estimates  of  contributions  from  boundary 
forcings  and  internal  dynamics  to  the  interannual  variability  of  the  monsoon.  As  our 
primary  objective  relates  to  the  predictability  of  the  Indian  summer  monsoon,  we  select 
an  AGCM  whose  simulation  of  mean  precipitation  during  the  northern  summer  over  the 
monsoon  region  is  reasonably  close  to  the  observed  mean  precipitation.  Then,  the  forced 
variability  is  estimated  from  a  multi-year  integration  of  the  model  with  observed  sea 
surface  temperature  as  the  boundary  condition.  The  internal  variability  is  estimated  from 
a  multi-year  run  of  the  same  model  with  fixed  seasonal  cycle  SST.  Another  sensitivity 
experiment  with  the  fixed  seasonal  cycle  of  SST  as  well  as  that  of  the  soil  moisture  has 
been  conducted  to  derive  some  insight  regarding  the  origin  of  the  low  frequency  internal 
oscillations  in  the  model.  The  model  used  and  the  experiments  conducted  are  described  in 
§  2.  The  mean  monsoon  simulation  of  the  model  is  discussed  in  §  3.  The  simulation  of  the 
SST  forced  interannual  variations  from  the  observed  SST  run  are  discussed  in  §  4.  The 
interannual  variability  due  to  internal  dynamics  is  also  examined  in  this  section.  The 
amplitude  of  interannual  variations  of  Indian  summer  monsoon  indices  due  to  internal 
dynamics  is  compared  to  the  amplitude  of  forced  variability.  The  nature  of  the  low 
frequency  variability  in  the  climatological  SST  run  is  also  investigated.  It  is  discovered 
that  the  model  atmosphere  exhibits  an  internal  quasi-biennial  oscillation.  Possible 
physical  mechanisms  for  the  origin  of  this  biennial  oscillation  are  discussed  in  §  5.  A 
summary  of  the  results  is  presented  in  §6.  The  implications  of  our  results  to  the 
predictability  of  Indian  summer  monsoon  are  also  discussed  here. 


2.  The  model  and  experiments 

The  atmospheric  model  used  in  this  study  is  a  version  of  the  Geophysical  Fluid  Dynamics 
Laboratory  (GFDL)  climate  model.  It  is  a  spectral  model  where  the  dynamic 
computations  are  performed  using  so  called  spectral  element  method  in  which  the 
distribution  of  a  predicted  variable  is  represented  by  a  series  of  spherical  harmonics  and 
grid  points.  This  version  has  a  rhomboidal  30  horizontal  resolution  (30  zonal  waves  and 
30  associated  Legendre  functions;  approximately  3.75°  longitude  by  2.25°  latitude 
resolution)  and  14  unevenly  spaced  sigma  levels  in  the  vertical  (R30L14  version).  The 
vertical  derivatives  in  the  prognostic  equations  are  computed  by  a  centered,  second  order 
finite  difference  scheme.  The  lower  resolution  version  (R15L09)  described  by  Gordon 
and  Stern  [18]  has  been  used  in  many  climate  studies  [19,  20].  The  physical  processes  and 
their  parameterizations  included  in  this  version  of  the  model  are  briefly  described  below. 
The  distribution  of  insolation  at  the  top  of  the  model  atmosphere  is  prescribed.  It  varies 
seasonally  but  does  not  have  a  diurnal  cycle.  The  solar  constant  is  1365  W/m2  and  the 
mixing  ratio  of  carbon  dioxide  is  taken  as  345  ppm.  Solar  radiation  is  computed  using 
method  similar  to  that  used  by  Lacis  and  Hansen  [21]  except  that  the  bulk  optical 
properties  of  clouds  such  as  reflectivity  and  absorptivity  are  prescribed.  It  includes 
multiple  reflections  by  clouds  but  does  not  account  for  aerosols.  Terrestrial  long  wave 
radiation  is  computed  following  the  method  described  by  Rodgers  and  Walshaw  [22]  and 
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as  modified  by  Stone  and  Manabe  [23].  It  includes  the  effect  of  the  water  vapour 
continuum.  Ozone  is  prescribed  as  a  function  of  latitude,  height  and  season  based  on  data 
from  Hering  and  Borden  [24]. 

Precipitation  is  computed  following  the  'moist  convective  adjustment  scheme' 
described  by  Manabe  et  al  [25].  Soil  moisture  is  computed  by  the  so  called  bucket 
method  [26].  Over  the  continents  surface  temperature  is  computed  from  the  condition  of 
heat  balance  at  the  surface  assuming  a  zero  heat  capacity.  The  model  also  include  a 
gravity  wave  drag  formulation  developed  by  Hayashi  and  described  by  Broccoli  and 
Manabe  [27].  A  simple  mixing  length  theory  for  vertical  diffusion  of  momentum,  heat 
and  moisture  is  also  included.  Prediction  of  cloud  cover  is  done  following  the  scheme 
used  by  Wetherald  and  Manabe  [28]. 

Three  multi-year  integrations  are  carried  out  with  the  model.  The  three  experiments  are 
briefly  described  below. 

OBS-SST  run:  In  this  run  the  observed  monthly  mean  global  SST  [29]  is  prescribed. 
Monthly  mean  values  are  interpolated  from  one  month  to  another  to  produce  the  required 
daily  values.  The  soil  moisture  and  snow  cover  are  predicted.  The  model  was  integrated 
for  15  years  with  observed  SST  from  January  1979  through  December  1993  as  boundary 
conditions.  The  model  was  initially  spun  up  with  mean  seasonal  cycle  SST  for  five  years 
from  a  resting  isothermal  state.  The  low  frequency  interannual  variations  of  the  observed 
SST  associated  with  the  El  Nino  and  southern  oscillation  (ENSO)  force  low  frequency 
interannual  variations  in  the  atmosphere  in  this  run. 

CLI-SST  run:  In  this  run  mean  seasonal  cycle  global  SST  was  prescribed  which 
was  repeated  every  year.  The  soil  moisture  and  snow  cover  are  predicted.  The  mean 
seasonal  cycle  of  SST  was  taken  from  Levitus  [30].  The  climatological  sea-ice  used  in 
this  run  was  taken  from  Walsh  and  Johnson  [31]  and  Zwally  et  al  [32].  The  initial 
conditions  for  this  run  consisted  of  a  resting  isothermal  atmosphere.  The  model  was 
integrated  for  40  years  and  the  last  20  years  are  considered  here.  The  model  atmosphere 
was  found  to  reach  a  quasi-equilibrium  after  about  15  years  initial  integration.  In  this  run, 
except  for  the  annual  cycle  forcing,  there  is  no  external  interannual  forcing.  Internal 
dynamics  and  feedback  with  surface  hydrological  processes  could  still  give  rise  to  some 
interannual  variability. 

CSST-SM  run:  Both  the  seasonal  cycle  of  SST  and  the  seasonal  cycle  of  soil  moisture 
are  prescribed  in  this  run.  The  mean  seasonal  cycle  of  SST  used  was  same  as  in  the  CLI- 
SST  run.  The  seasonal  cycle  of  soil  moisture  at  each  grid  point  were  constructed  from  the 
15  year  simulations  of  the  OBS-SST  run.  The  snow  cover  feedback  is  also  eliminated 
from  the  model.  This  integration  was  carried  out  for  18  years.  In  this  run,  even  the  surface 
hydrological  feedback  is  eliminated.  Therefore,  any  interannual  variability  in  this  run 
must  arise  purely  due  to  internal  dynamics  in  the  presence  of  the  annual  cycle  forcing. 
Monthly  mean  output  is  analyzed  from  all  the  runs. 


3.  Mean  monsoon  simulated  by  the  model 

The  Asian  summer  monsoon  has  some  unique  characteristic  features  such  as  the  low  level 
westerly  jet  over  the  north  Arabian  sea  and  the  easterly  jet  in  the  upper  troposphere.  The 
maximum  strength  of  the  easterly  jet  is  at  around  150  mb  with  its  center  shifted  to  the  east 
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Figure  1.  Mean  January  (JAN),  August  (AUG)  and  June-September  (JJAS) 
precipitation.  The  mean  derived  from  the  OBS-SST  run  (right  panel)  is  compared 
with  observations  ([33],  left  panelsX.  Contours  are  1 , 4, 8, 16, 32  mm/day.  Precipitation 
greater  than  8  mm/ day  is  shaded. 


of  the  low  level  westerly  jet  center  by  about  10°  longitude.  In  addition,  the  movement 
of  the  mean  position  of  the  ITCZ  to  the  Indian  continent  and  relatively  sudden  'onset' 
and  'retreat'  of  the  monsoon  constitute  other  characteristic  features  of  the  Indian 
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multi-year  simulations  with  the  same  model.  Since  the  annual  cycle  of  the  SST  is 

nearly  identical  and  the  annual  cycle  of  the  solar  forcing  is  identical  in  all  three 

experiments,  the  mean  annual  cycle  derived  from  the  three  runs  should  also  be  nearly 

identical.  It  is  indeed  found  to  be  so  (not  shown).  In  figure  1,  we  show  the  precipitation 

climatology  for  January,  August  and  June-September  mean  from  the  OBS-SST  run 

and  compare  them  with  observations  [33].  During  winter  (January),  the  position  of 

the  ITCZ  is  correctly  simulated.  However,  the  model  simulates  a  smaller  area  of 

heavy  precipitation   (>  8  mm/ day)   over  Indonesia  than  observed.   During   summer 

(August),  the  ITCZ  does  move  to  a  position  over  the  Indian  continent.  The  maximum 

over  the  northern  Bay  of  Bengal  is  well  simulated.  The  precipitation  over  most  of 

Indian  continent  is  reasonably  well  simulated.  The  maximum  off  the  western  coast  is 

not  well  simulated.  This  is  related  to  the  model's  resolution  being  insufficient  to 

resolve  the  Western  Ghat  mountains  adequately.  The  seasonal  mean  (June-September) 

precipitation  is  also  shown  here  as  in  our  discussion  of  interannual  variability  we  shall 

be  referring  to  seasonal  mean  anomalies.  It  is  noted  that  the  seasonal  mean  precipitation 

is  also  well  simulated  by  the  model.  The  simulated  precipitation  over  central  India 

is  slightly  weaker  than  observed.  Considering  the  low  resolution  of  the  model  and 

the  rather  simple  parameterization  of  cumulus  convection  employed  by  the  model,  the 

simulation  of  the  mean  precipitation  is  realistic.  The  model,  however,  underestimates  the 

weaker  second  precipitation  zone  over  the  equatorial  Indian  ocean  during  northern 

summer. 

We  have  also  examined  the  simulation  of  the  Somali  jet,  the  easterly  jet  in  the  upper 
troposphere  and  the  onset  and  retreat  of  monsoon  by  the  model.  It  is  found  the  model 
does  a  reasonably  good  job  of  simulating  all  these  features. 


4.  Interannual  variability:  SST  forced  versus  internal 

In  this  section  we  discuss  the  interannual  variability  of  some  monsoon  indices  simulated 
by  the  model  in  the  OBS-SST  run.  Figure  2  shows  the  interannual  variability  of  the  June- 
September  (JJAS)  normalized  precipitation  anomalies.  The  top  panel  contains  the 
observed  all  India  rainfall  anomalies.  1982  and  1987  were  deficient  rainfall  years  while 
1983  and  1988  were  excess  rainfall  years.  The  lower  panel  shows  simulated  precipitation 
averaged  only  over  'Indian  region'  (70°-95°E,  10°-30°N,  including  the  oceanic  region 
included  in  this  box).  The  model  captures  the  variability  during  1987  and  1988  but  fails  to 
do  so  during  1982.  In  general  the  model's  skill  in  simulating  the  observed  interannual 
variability  of  the  Indian  summer  monsoon  is  rather  modest.  Out  of  the  15  years  the  sign  of 
the  interannual  anomaly  is  correctly  simulated  in  9  years. 

In  order  to  estimate  how  much  of  the  interannual  variability  of  the  monsoon  discussed 
above  is  due  to  SST  forcing  and  how  much  may  be  due  to  internal  dynamics,  we  compare 
the  interannual  variability  among  the  OBS-SST  and  CLI-SST  runs.  Figure  3  shows  the 
precipitation  averaged  over  the  'monsoon  region'  and  the  'broad  scale  monsoon  shear 
index'.  The  'broad  scale  monsoon  shear  index'  is  defined  as  the  zonal  wind  shear  (U  at 
170mb  -  U  at  860mb)  averaged  over  EQ-200  N  and  40°-100°E  [11].  Both  indices  are 
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Figure  2.  Comparison  of  simulated  interannual  variability  of  the  seasonal  mean 
monsoon  (June-September)  precipitation  with  observations,  (a)  Observed  'All-India' 
precipitation  anomalies,  (b)  the  simulated  precipitation  anomalies  averaged  over  India 
and  the  Bay  of  Bengal  (70°-95°E  10°-30°N).  In  both  cases  the  precipitation 
anomalies  are  normalized  with  their  own  standard  deviation. 

averaged  over  the  summer  monsoon  season,  June-September.  An  examination  of  the 
interannual  variability  of  the  precipitation  index  shows  that,  although  the  observed  SST 
tends  to  enhance  the  variability,  the  internal  dynamics  alone  produces  considerable 
interannual  variability.  The  largest  precipitation  anomalies  in  the  OBS-SST  run  occur 
either  during  strong  warm  or  cold  events  in  the  Pacific  (1987, 1988).  The  amplitude  of  the 
interannual  variability  in  the  remaining  'normal'  years  is  quite  close  to  the  interannual 
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Figure  3.  Comparison  of  interannual  variability  of"two  summer  monsoon  indices 
between  OBS-SST  and  CLI-SST  runs.  Left  panels  correspond  to  JJAS  averaged 
precipitation  (mm/day)  averaged  over  Bay  of  Bengal  (70°-95°  E,  10°-30°  N),  right 
panels  correspond  to  JJAS  averaged  'broad  scale  monsoon  shear  index'  in  m/s.  The 
interannual  standard  deviations  are  noted  in  each  panel.  Results  of  20  years  of 
simulation  are  shown  for  the  CLI-SST  runs.  The  year  labeling  for  the  CLI-SST  run 
are  arbitrary  and  has  been  made  to  match  with  that  of  the  OBS-SST  run  for  the  first  15 
years.  The  last  five  years  of  the  plot  for  OBS-SST  run  is  left  blank. 

variability  in  the  CLI-SST  run.  This  indicates  that  strong  cold  (warm)  events  in  the 
Pacific  are  associated  with  increased  (decreased)  precipitation  over  the  monsoon  region. 
However,  if  the  SST  anomalies  are  not  very  large  during  the  summer  season,  the  internal 
variability  could  dominate  the  variability  of  the  monsoon  precipitation.  This  is  consistent 
with  the  predictability  studies  of  Brankovic  and  Palmer  [34].  The  simulation  of  the  'broad 
scale  monsoon  shear  index'  also  shows  that  the  amplitude  of  the  interannual  variability 
generated  purely  by  internal  dynamics  (CLI-SST)  is,  comparable  to  that  forced  by  the 
slowly  varying  SST  boundary  forcing  (OBS-SST).  In  the  OBS-SST  run  the  entire 
variability  is  dominated  by  the  1987  event.  The  interannual  variability  produced  by  the 
internal  dynamics  is  often  larger  than  that  in  the  SST  boundary  forced  run. 

4.1  A  Quasi-biennial  internal  oscillation 

From  the  above  discussions,  it  is  clear  that  the  model  atmosphere  can  generate  significant 
interannual  variability  even  in  the  absence  of  any  external  forcing  with  interannual 
periods.  What  then  is  responsible  for  these  interannual  variations?  Are  there  significant 
natural  oscillations  of  the  system  or  are  they  simply  part  of  a  red-noise  process?  To  get 
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Figure  4.  The  interannual  variability  in  the  CLI-SST  simulation.  Time  series  of  5 
month  running  mean  equatorial  (5°S-5°N)  precipitation  P  (mm/day,  140°-160°E) 
and  zonal  wind  at  170mb  (140°~180°E)  and  860mb  (50°-70°E)  (m/s). 


some  idea  regarding  the  temporal  variability  of  the  low  frequency  internal  oscillations, 
we  show  5  month  running  means  of  three  different  variables  averaged  over  three  different 
regions  from  the  CLI-SST  run  in  figure  4.  The  precipitation  is  averaged  over  5°  S-5°  N 
and  140°-160°E.  The  zonal  winds  at  170mb  and  860mb  are  averaged  over  5°S-5°N 
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Figure  5.  Power  spectra  of  un-filtered  monthly  mean  precipitation  and  zonal  winds 
averaged  over  same  regions  shown  in  figure  4,  for  the  CLI-SST  run.  (a)  Precipitation 
(5°  S-5°  N,  140°-160°  E),  (b)  zonal  wind  at  860mb,  (5°  S-5°  N,  50°-70°  E),  (c)  zonal 
wind  at  1 70  mb  (5°  S-5°  N,  140M  80°  E).  The  theoretical  red  noise  spectrum  based  on 
lag  1  auto-correlation  (thick  solid)  and  the  95%  confidence  limits  are  also  shown. 

and  140°-180°  and  50°~70°E  respectively.  It  is  clear  that  all  the  three  variables  show 
significant  low  frequency  variability.  In  particular  the  precipitation  time  series  tends  to 
indicate  a  quasi-biennial  variability.  Quasi-biennial  variations  can  be  also  be  detected  in 
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the  wind  time  series.  This  preliminary  examination  indicates  that  the  model  atmosphere 
has  a  quasi-biennial  internal  oscillation.  A  spectrum  analysis  of  the  original  monthly 
mean  time  series  confirms  this  (figure  5).  The  95%  confidence  limits  in  figure  5  are 
calculated  from  the  lag  1  auto  correlations  of  the  original  time  series.  It  is  noteworthy  that 
all  three  time  series  have  a  statistically  significant  peak  around  a  biennial  periodicity.  We 
must  add  here  that  the  three  regions  shown  here  have  been  selected  after  some  trial  and 
error.  This  is  because  the  biennial  variability  has  regional  scale  spatial  structure. 
Therefore,  if  we  average  the  anomalies  over  some  arbitrary  regions,  there  is  a  danger  of 
canceling  positive  and  negative  anomalies  leading  to  an  insignificant  signal. 


5.  Possible  mechanisms 

Air-sea  coupling,  air-land  surface  coupling  [35-37]  and  interaction  between  nonlinear 
intra-seasonal  oscillations  and  the  annual  cycle  [38]  have  been  invoked  to  explain  the 
origin  of  the  observed  biennial  component  of  the  interannual  variability  in  the  Indian  and 
Pacific  region.  As  climatological  SST  is  prescribed  in  our  run,  ocean-atmosphere 
coupling  could  not  be  responsible  for  our  biennial  oscillation.  The  other  possibility  is  the 
atmosphere  and  land-surface  feedback  as  envisaged  by  Meehl  [37].  Qualitatively,  this 
feedback  may  be  described  as  follows.  The  strength  of  the  monsoon  circulation  is  related 
to  the  north-south  temperature  gradient  over  the  region.  A  strong  monsoon  is  associated 
with  a  strong  north-south  temperature  gradient  and  results  in  excess  precipitation  and  soil 
moisture  over  land.  If  the  enhanced  soil  moisture  persists  through  the  next  three  seasons, 
land  would  be  wetter  than  normal  prior  to  the  next  monsoon.  Evaporation  and  enhanced 
latent  heat  flux  would  result  in  cooler  land  temperature  and  weaker  north-south 
temperature  gradient  leading  to  a  weaker  monsoon.  A  weak  monsoon  this  year  would  be 
associated  with  below  normal  precipitation  which  through  the  same  arguments  would 
lead  to  a  strong  north-south  temperature  gradient  and  a  strong  monsoon  next  year.  In  this 
manner  the  cycle  could  be  repeated  every  two  years.  While  this  mechanism  could  work  in 
principle,  there  is  no  convincing  evidence  that  soil  moisture  anomalies  persist  for  three  or 
more  seasons.  In  order  to  determine  whether  this  feedback  is  at  work  in  producing  the 
biennial  oscillation  in  our  model  atmosphere,  we  carried  out  an  additional  multi-year  run 
in  which  in  addition  to  the  annual  cycle  of  SST,  the  annual  cycle  of  the  soil  moisture  and 
snow  cover  were  also  prescribed  (CSST-SM).  This  experiment  was  conducted  for  18 
years.  We  then  carried  out  time  series  and  spectrum  analysis  of  various  monthly  mean 
fields  as  before.  It  is  found  that  this  run  also  contains  a  significant  biennial  signal  as  a 
dominant  part  of  its  interannual  variability.  The  spectra  of  un-filtered  monthly  mean 
precipitation  averaged  over  two  locations  are  shown  in  figure  6.  It  is  clear  that  both  the 
time  series  show  a  rather  strong  and  significant  peak  with  a  period  around  two  years. 
Therefore,  it  appears  that  the  biennial  oscillation  of  our  model  atmosphere  does  not  owe 
its  origin  to  a  ground  hydrology  feedback  as  envisaged  by  Meehl  [37]. 

This  leaves  the  possibility  of  producing  a  biennial  signal  through  interaction  between 
atmospheric  high  frequency  transients  (synoptic  and  intraseasonal  oscillations)  and  the 
annual  cycle  as  a  candidate  for  the  biennial  oscillation  in  our  model  atmosphere.  Lorenz 
[39]  first  addressed  whether  the  nonlinear  weather  events  (synoptic  systems)  in  the 
middle  latitude  modulated  by  the  annual  cycle  forcing  could  give  rise  to  any  significant 
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Figure  6.  Power  spectra  of  precipitation  averaged  between  5°  S-5°  N  and  over  two 
longitudinal  belts  (a)  (70° -95°  E),  and  (b)  (120°-140°  E)  from  CSST-SM  simulation. 
Other  conventions  are  same  as  in  figure  5. 


interannual  variability.  He  showed  that  due  to  the  intransitive  nature  of  the  nonlinear  high 
frequency  component,  a  periodic  forcing  can  give  rise  to  some  interannual  variability. 
However,  there  was  no  suggestion  of  a  preferential  selection  of  the  biennial  oscillation.  In 
contrast  to  the  mid  latitude,  in  the  tropics  the  day  to  day  weather  fluctuations  are  less 
energetic  but  the  intraseasonal  oscillations  such  as  the  active  break  phases  of  the  monsoon 
are  quite  energetic.  Here  we  show  that  the  modulation  of  the  nonlinear  atmospheric 
'intraseasonal  oscillations'  by  the  annual  cycle  could  result  in  an  atmospheric  biennial 
mode.  To  demonstrate  this,  we  choose  the  following  prototype  nonlinear  system  [39] 

(1) 
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Figure  7.    Spectrum  of  daily  Y  with  c  =  0.5.  (a)  for  a  constant  F  =  8.3  and  (b) 
annual  cycle  of  F  =  7.0  +  2.0cos7rf/l  year. 


XY  -  bXZ  -  cY  +  G, 
bXY  +  XZ-cZ, 


(2) 
(3) 


where  X  may  be  considered  as  the  zonally  symmetric  component  while  Y  and  Z  may  be 
interpreted  as  non-zonal  components.  F  is  the  zonally  symmetric  forcing  (e.g.,  solar 
forcing)  while  G  is  the  non-zonal  forcing  arising  out  of  land-ocean  contrasts.  Values  of 
the  parameters  a  and  b  representing  dissipation  and  strength  of  eddy-mean  flow 
interaction  used  by  Lorenz  are  retained  (a  =  0.25  and  b  =  4.0).  Also,  G  =  1.0.  Original 
equations  of  Lorenz  and  the  parameters  are  all  scaled  by  a  factor  c.  For  c  =  1.0,  the 
spectrum  of  the  nonlinear  system  within  the  chaotic  regime  shows  maximum  power 
around  synoptic  frequencies  (around  one  week).  With  c  =  0.5,  however,  the  preferential 
period  in  the  chaotic  regime  shifts  to  intraseasonal  range  (3  to  5  months).  In  figure  7(a), 
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Figure  8.  Some  gross  features  of  the  attractors  for  different  values  of  constant 
forcing,  F.  (a)  Long  term  mean,  Xm  and  (b)  standard  deviation,  Xs.  The  two  symbols 
represent  values  obtained  with  two  different  initial  conditions  for  each  forcing. 

the  spectrum  of  daily  Y  for  a  fixed  F  =  8.3  is  shown  based  on  the  last  100  years  of  a  150 
year  integration.  Next  we  introduced  an  annual  cycle  forcing  with  F  =  7. 0+2.0  cos  Trt/r, 
where  r  is  one  year.  Again,  the  spectrum  of  daily  Y  is  shown  based  on  the  last  100  yeas  of 
a  150  year  integration.  The  modulation  of  the  intraseasonal  oscillations  by  the  annual 
cycle  clearly  gives  rise  to  a  significant  biennial  oscillation  (figure  7b).  If,  however, 
c  =  1.0  and  the  nonlinear  oscillations  are  primarily  confined  to  the  synoptic  range,  the 
preferential  excitation  of  the  biennial  mode  is  very  weak  (not  shown).  . 

How  does  periodic  variation  of  forcing  for  the  atmosphere  with  a  period  of  1  year 
generate  responses  at  all  low  frequencies?  This  was  discussed  by  Lorenz  [39]  in  detail.  To 
understand  how  the  slowly  varying  forcing  introduces  the  low-frequency  response,  we 
need  to  examine  the  nature  of  variations  of  the  nonlinear  system  (l)-(3)  for  different 
values  of  the  steady  forcing.  In  figure  8,  we  show  some  gross  characteristics  of  the 
attractors  (long-term  mean  and  standard  deviation)  of  the  system  (l)-(3)  for  a  range  of 
values  of  F.  The  two  symbols  represent  attractors  attained  from  two  different  initial 
conditions.  The  system  has  periodic  attractors  for  F  between  5.0  and  7.5.  For  F  between 
7.5  and  8.8,  the  system  has  a  chaotic  attractor.  Beyond  F  =  8.8,  again  the  system  goes  to 
a  periodic  regime.  We  note  that  for  F  between  5.0  and  7.0,  the  system  has  at  least  two 
periodic  orbits,  one  with  high  amplitude  and  low  mean,  while  the  other  with  low 
amplitude  and  higher  mean.  Within  the  chaotic  regime  too,  the  system  tends  to  have  two 
preferred  regimes,  one  with  large  amplitude  oscillations  with  low  mean  (active  regime) 
and  another  with  small  amplitude  oscillations  with  high  mean  (weak  regime).  This  is  seen 
in  figure  9  where  the  frequency  distribution  of  mean  and  standard  deviation  calculated 
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Figure  9.  Frequency  distribution  of  (a)  mean  and  (b)  standard  deviation  of  three 
month  segments  for  F  =  8.0  over  a  period  of  184  years. 

over  three  month  segments  over  a  period  of  184  year  with  a  constant  F  =  8.0  (chaotic 
regime)  is  shown.  Both  the  standard  deviation  and  the  short  term  mean  show  the  tendency 
for  two  preferred  regimes. 

With  this  background,  we  may  try  to  understand  the  response  of  the  nonlinear  system 
when  the  forcing  changes  periodically.  When  the  forcing ,  varies  annually  between  two 
extremes,  say  between  F  —  9  and  F  =  5,  the  system  reaches  a  chaotic  attractor  that 
consists  of  a  strong  and  a  weak  regime.  The  two  regimes  resemble  the  orbits  of  the 
multiple  periodic  attractors  of  constant  forcings  discussed  earlier.  Whether  the  system 
would  tend  to  visit  the  weak  or  the  strong  regime  in  a  given  period  of  the  forcing  depends 
on  the  previous  history  of  state  (or  initial  conditions,  in  a  loose  manner).  At  many  of  these 
values  of  F,  when  they  are  held  constant,  the  system  may  possess  multiple  periodic 
attractors  that  depend  on  the  initial  conditions.  But  when  the  forcing  is  varying  because  of 
chaos,  the  system  reaches  a  different  state  on  the  same  day  (or  same  value  of  i)  of  every 
year.  This  difference  determines  the  difference  in  the  subsequent  behaviour  of  the  system- 
whether  to  visit  the  strong  or  the  weak  regime.  This  essentially  is  why  the  annual  cycle 
itself  is  capable  of  introducing  some  interannual  variability.  It  appears  that  in  one  year  the 
system  tends  to  reside  more  in  the  low  amplitude-high  mean  state  while  in  another  year  it 
tends  to  reside  in  the  high  amplitude  low  mean  state  giving  rise  to  a  biennial  variability. 


6.  Summary  of  results  and  implications  for  monsoon  predictability 

The  interannual  variability  of  the  Indian  summer  monsoon  simulated  by  three  multi-year 
simulations  with  the  GFDL  R30L14  model  is  studied.  The  monsoon  climatology  of 
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the  model  is  realistic  including  the  position  and  strength  of  the  low  level  westerly 
jet,  upper  level  easterly  jet  and  precipitation  over  continental  India.  The  skill  of 
simulation  of  interannual  variability  of  the  Indian  summer  precipitation  by  the  GFDL 
climate  model  is  only  modest.  Most  general  circulation  models  are  found  to  be  deficient 
in  simulating  the  observed  interannual  variability  of  the  Indian  summer  monsoon 
precipitation  [15].  The  rather  poor  predictability  of  Indian  summer  monsoon  precipitation 
is  shown  to  be  due  to  the  fact  that  the  interannual  variability  forced  by  slowly  varying 
SST  over  the  Indian  region  is  comparable  or  even  weaker  than  that  due  to  internal 
dynamics. 

Further  examination  of  the  nature  of  the  interannual  variability  in  the  CLI-SST  run 
reveals  that  much  of  the  interannual  variability  of  the  model  is  due  to  a  significant 
biennial  oscillation  of  the  model  atmosphere.  The  CSST-SM  experiment  was  carried 
out  to  address  whether  the  biennial  oscillation  could  have  resulted  from  a  soil  moisture- 
atmosphere  feedback  or  whether  it  was  internal  to  the  atmosphere.  The  fact  that  a 
significant  biennial  oscillation  is  seen  even  in  the  fixed  soil  moisture  run  rules  out  the 
possibility  that  ground  hydrology  feedback  is  responsible  for  the  biennial  oscillation.  This 
leaves  the  possibility  of  producing  a  biennial  signal  through  interaction  between 
atmospheric  high  frequency  transients  (synoptic  and  intra-seasonal  oscillations)  and  the 
annual  cycle  as  proposed  by  Goswami  [38],  as  a  candidate  for  the  biennial  oscillation  in 
our  model  atmosphere.  This  was  demonstrated  with  a  simple  paradigm  model  of  the 
tropical  atmosphere.  The  annual  cycle  may  be  considered  as  a  slowly  varying  forcing  for 
the  nonlinear  atmospheric  high  frequency  oscillations.  As  the  nonlinear  atmospheric 
oscillations  go  through  a  chaotic  regime  during  parts  of  the  slowly  varying  periodic 
forcing,  it  can  result  in  a  low  frequency  tail  in  the  atmospheric  oscillations. 

The  biennial  internal  oscillation  is  likely  to  be  a  major  limiting  factor  for  prediction  of 
seasonal  monsoon  precipitation.  The  biennial  mode  has  largest  amplitude  around  the 
precipitation  zone  over  the  Indonesian  region.  As  the  SST  forced  response  is  relatively 
weak  over  the  Indian  region,  and  the  biennial  internal  mode  has  large  amplitude,  it  may 
dominate  the  interannual  variability.  Although  the  amplitude  of  the  biennial  mode  is  large 
over  northern  Australia,  the  SST  forced  response  is  also  large.  As  a  result  the  biennial 
mode  cannot  dominate  the  interannual  variability  over  northern  Australia.  Over  the  Sahel 
region  in  Africa  the  amplitude  of  the  biennial  oscillation  is  very  weak.  Therefore,  the 
interannual  variations  over  Sahel  are  again  primarily  governed  by  SST  variations.  These 
conclusions  from  our  analysis  are  consistent  with  conclusions  that  the  seasonal 
predictions  of  precipitation  over  the  Sahel  [17]  and  northern  Australia  based  on  SST 
can  be  made  with  much  greater  confidence  than  can  be  done  over  the  Indian  monsoon 
region  [34].  The  discovery  of  a  quasi  biennial  internal  oscillation  of  the  tropical 
atmosphere  limiting  the  predictability  of  the  Indian  monsoon  needs  to  be  reexamined 
with  other  general  circulation  models. 
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Some  recent  applications  of  cell  dynamical  modelling  to 
phase  ordering  dynamics 
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Abstract.  We  study  experimentally  relevant  effects  in  phase  ordering  dynamics  using  Cell 
Dynamical  System  (CDS)  models.  In  particular,  we  present  representative  numerical  results  for 
phase  ordering  in  random  magnets  and  phase  separation  in  binary  fluids. 

Keywords.    Cell  dynamical  modelling;  phase  ordering  dynamics;  disordered  systems. 
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1.  Introduction 

There  has  been  much  recent  interest  in  the  problem  of  phase  ordering  dynamics,  viz., 
the  temporal  evolution  of  a  homogeneous  two-phase  mixture  which  has  been 
rendered  thermodynamically  unstable  by  a  sudden  quench  below  its  critical 
temperature  [1].  Typically,  the  evolving  system-  separates  into  domains  which  are 
rich  in  one  or  the  other  constituent  of  the  mixture.  These  domains  coarsen  with 
time  and  are  characterised  by  a  single  length  scale  R(t)  at  time  t  (for  isotropic  systems). 
A  direct  consequence  of  the  existence  of  a  unique  length  scale  is  the  so-called 
dynamical  scaling  of  the  time-dependent  correlation  function  g(f),  i.e.,  g(r)  =  G(r/R(t)}, 
where  r  is  the  spatial  variable  and  G(x)  is  a  master  function,  which  is  independent  of 
time  [2].  The  nature  of  the  domain  growth  law;  and  the  scaling  form  of  the  correlation 
function  are  of  great  experimental  relevance  in  disciplines  as  diverse  as  physics, 
chemistry,  materials  science  and  metallurgy.  They  depend  critically  on  a  variety  of  factors 
as  follows: 

(a)  whether  the  evolution  is  characterised  by  a  nonconserved  order  parameter  (e.g., 
ordering  of  a  ferromagnet)  or  conserved  order  parameter  (e.g.,  segregation  of  a  binary 
alloy); 

(b)  whether  or  not  hydrodynamic  effects  are  relevant; 

(c)  the  nature  of  disorders  (quenched  or  annealed)  present  in  the  phase  ordering  system, 
etc. 

Theoretical  descriptions  of  phase  ordering  dynamics  are  usually  in  terms  of 
nonlinear  partial  differential  equations,  which  are  analytically  intractable  -  especially 
in  the  case  with  conserved  order  parameter.  Thus,  it  is  necessary  to  have  computationally 
efficient  numerical  models  which  enable  us  to  access  the  asymptotic  behaviour  of  phase 
ordering  dynamics  with  limited  numerical  effort.  Some  time  ago,  we  formulated  discrete 
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from  their  relevance  in  phase  ordering  dynamics,  our  L-Lto  moaeis  nave  aiso  served  as  an 
important  example  of  the  relevance  of  discrete  space-time  models  in  the  modelling  of 
nonequilibrium  phenomena. 

In  this  paper,  we  would  like  to  briefly  discuss  two  experimentally  relevant  problems  of 
phase  ordering  dynamics,  which  we  have  recently  tackled  using  CDS  models.  This  paper 
is  organised  in  the  following  fashion.  In  §  2,  we  discuss  conventional  models  of  phase 
ordering  dynamics  and  also  describe  our  CDS  models.  In  §  3,  we  study  the  first  problem 
of  interest  to  us  here,  viz.,  phase  ordering  dynamics  in  systems  with  quenched  disorder. 
Section  4  of  this  paper  examines  the  second  problem  of  interest  to  us  here,  viz.,  the  role  of 
hydrodynamic  effects  in  phase  separation  dynamics.  Section  5  ends  this  paper  with  a 
summary  and  discussion. 


2.  Conventional  and  cell  dynamical  modelling  of  phase  ordering  dynamics 

At  the  coarsegrained  level,  phase  ordering  dynamics  is  conventionally  modelled  via 
stochastic  partial  differential  equations.  Typically,  systems  with  a  nonconserved  order 
parameter  (e.g.,  ordering  of  ferromagnets)  are  described  by  the  time-dependent  Ginzburg- 
Landau  (TDGL)  equation: 


dt    ~      ty(?,o         ') 

where  ^(r,  i)  is  the  order  parameter  of  the  system  at  point  r  at  time  t.  For  pure  and 
isotropic  systems,  Jff^r,  t}]  is  usually  taken  to  be  the  coarse-grained  ^4-free  energy 
functional: 

*))2],  (2) 

where  we  assume  that  all  quantities  have  been  rescaled  into  appropriate  dimensionless 
units.  The  Gaussian  white  noise  <r(r,  t}  in  (1)  has  zero  expectation  value  and  satisfies  the 
usual  fluctuation-dissipation  relation.  In  the  classification  scheme  of  Hohenberg  and 
Halperin  [4],  Eq.  (1)  is  referred  to  as  Model  A. 

Of  course,  real  experimental  systems  are  neither  pure  nor  isotropic  and  the  free- 
energy  functional  in  (2)  constitutes  an  oversimplification.  For  example,  in 
ferromagnets,  quenched  disorder  typically  arises  in  the  form  of  nonmagnetic 
impurities.  At  the  phenornenological  level,  this  is  modelled  by  assigning  a  random 
spatial  dependence  to  the  coefficients  of  various  terms  in  the  free-energy  functional  of  (2) 
[5,  6].  The  distribution  of  this  random  fluctuation  is  usually  taken  to  be  either  Gaussian  or 
uniform.  For  studies  of  phase  ordering  dynamics,  the  precise  nature  of  this  distribution  is 
not  relevant  [6]. 

Phase  ordering  dynamics  in  systems  with  a  conserved  order  parameter  but  no 
hydrodynamic  effects  (e.g.,  segregation  of  a  binary,  alloy  AB)  is  described  by  the 
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Cahn-Hilliard-Cook  (CHC)  equation  [7]: 

dt      ~  6ip(r, t}    J      °^r>r"  (  ' 

where  symbols  have  the  same  meaning  as  previously.  In  the  case  with  conserved  order 
parameter,  Jdrijj(r,  t)  is  a  constant  in  time,  as  is  clear  on  integrating  Eq.  (3)  over  space. 
Eq.  (3)  is  also  referred  to  as  Model  B  [4]. 

The  simplest  model  which  incorporates  hydrodynamic  effects  in  segregation 
dynamics  is  known  as  Model  H  [4]  and  consists  of  a  conserved  scalar  density  (the 
order  parameter)  coupled  to  the  hydrodynamic  velocity  field.  We  will  consider  here  a 
variant  of  Model  H,  with  the  following  form  in  three  dimensions  (again  in  appropriate 
dimensionless  units): 


,  0  +  & 


In  (4),  ^(r,t]  and  Jt(r,t)  (i  =  1,2,3)  are,  respectively,  the  order  parameter  and  the 
dimensionless  velocity  fields  as  a  function  of  dimensionless  space  and  time.  The 
Gaussian  white  noises  u(r,t)  and  v,-  (r,  it)  (i  =  1,2,3)  are  the  noises  for  the  order 
parameter  field  and  the  velocity  field,  respectively.  They  satisfy  the  appropriate 
fluctuation-dissipation  relations.  The  rescaled  parameters  in  (4)  are  the  coupling 
constant  a  (a  =0  corresponds  to  the  usual  case  of  binary  alloys,  viz.,  Model  B); 
and  the  transport  coefficients  fj  and  cr,  which  are  the  viscosities.  We  should  remark 
that  (4)  reduces  to  the  standard  form  of  Model  H  [4]  if  we  impose  the  additional 
constraint  V  •  J(r,  t)  =  0.  However,  as  was  pointed  out  by  Farrell  and  Vails  [8],  this 
extra  constraint  causes  considerable  numerical  complications  and  we  do  not  impose  it 
here. 

Given  the  numerical  effort  involved  in  successfully  simulating  the  stochastic  partial 
differential  equation  models  in  Eqs.  (l)-(4)  or  performing  Monte  Carlo  (MC)  simulations 
of  appropriate  microscopic  models,  it  is  desirable  to  have  computationally  efficient 
models  to  study  the  scaling  regime.  We  have  proposed  CDS  models  of  phase  ordering 
dynamics,  which  are  highly  efficient  in  terms  of  computer  time  usage  [3].  These 
models  impose  the  space-time  coarsening,  implicit  in  conventional  phenomenological 
models,  by  using  a  discretised  space-time  lattice.  An  order  parameter  variable  is 
associated  with  each  cell  of  the  spatial  lattice,  which  can  be  considered  to  be  the  result  of 
coarse-graining  the  microscopic  model.  The  underlying  principles  of  our  modelling  have 
already  been  discussed  extensively  in  the  literature  and  we  refer  the  interested  reader  to 
ref.  [3].  Here,  we  merely  confine  ourselves  to  presenting  the  resultant  models,  which  are 
formulated  on  a  lattice  whose  cells  are  labelled  by,  say,  n.  We  associate  an  order 
parameter  value  at  (discrete)  time  t  with  each  cell  and  refer  to  it  as  i>(t,n).  Our  CDS 
model  for  the  case  with  nonconserved  order  parameter  in  d-dimensional  space  takes  the 
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following  form  [3]: 

))  +  —  A,>^(f,n) 

«)],  (5) 

where  /(i/>)  is  a  monotonic  function  with  an  unstable  fixed  point  at  -0  =  0  (which  mimics 
the  unstable  homogeneous  state);  and  two  symmetrically  located  stable  fixed  points  at 
ifj  =  ±-0o  (which  mimic  the  stable  spontaneously  ordered  states).  In  (5),  D  is  a  positive 
constant  proportional  to  the  phenomenological  diffusion  constant;  and  AD  is  the 
isotropically  discretised  Laplacian  operator  at  the  site  n.  As  the  local  order  parameter 
takes  continuous  values,  this  CDS  model  falls  into  the  subclass  of  Coupled  Map  Lattice 
(CML)  models  [9,  10]. 

We  can  also  incorporate  thermal  noise  terms  in  the  deterministic  model  of  Eq.  (5). 
However,  we  have  demonstrated  numerically  that  thermal  noise  is  asymptotically 
irrelevant  for  phase  ordering  dynamics  -  at  least  in  the  context  of  pure  systems  [3], 
Therefore,  the  deterministic  model  in  (5)  suffices  if  one  is  interested  in  universal 
asymptotic  behaviour.  Furthermore,  we  should  stress  that  any  function  f(ijj)  with  the 
specified  properties  provides  a  reasonable  CDS  model  for  phase  ordering  with  a 
nonconserved  order  parameter.  This  is  a  consequence  of  the  strong  universality  of  the 
ordering  process  and  is  discussed  at  length  in  Ref.  [3]. 

The  corresponding  CDS  model  for  the  case  with  quenched  disorder,  e.g.,  ordering  in 
random  magnets,  is  obtained  as  a  simple  extension  of  (5)  by  associating  a  random  spatial 
dependence  with  the  parameters  of  the  function  /(•0),  so  that  the  local  fixed  points 
are  randomly  distributed  around  ±^o  [6].  In  our  subsequent  discussion,  the  term 
"amplitude  of  disorder"  refers  to  the  amplitude  of  fluctuations  in  the  local  fixed  points  of 
the  function/^). 

The  corresponding  CDS  model  for  the  conserved  case  is  easy  to  obtain.  Naively,  it  may 
be  obtained  analogously  to  the  continuum  case  by  appending  an  extra  Laplacian  operator 
to  the  chemical  potential  for  the  nonconserved  case  as  follows  [3]: 

tf(f  +  l,n)  =  V(r,n)  -^ADTOM]  -tfM].  (6) 

Our  procedure  for  obtaining  CDS  models  can  also  be  interpreted  as  a  novel  method  of 
discretising  the  corresponding  partial  differential  equations,  which  enables  the  use  of 
rather  coarse  mesh  sizes,  without  any  loss  of  numerical  stability  [3].  As  a  matter  of  fact, 
we  can  use  this  interpretation  to  derive  the  CDS  model  equivalent  to  (4),  i.e.,  for  the  case 
with  conserved  order  parameter  when  hydrodynamic  effects  are  relevant.  The  CDS  model 
we  obtain  from  (4)  (again  neglecting  noise  terms)  has  the  following  form  in  three 
dimensions  [11]: 


Ji(t  +  1,  n)  =  /,-(*,  n]  +  ri&oJfa  n)  +  <jT  V^-V^O,  n] 


k=\ 


+  wftt,  n)  VzvtFM*  ,  n)]  -  ftt,  n)],  (7) 
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Phase  ordering  dynamics 

where  a,  f]  and  a  are  phenomenological  parameters.  They  are  analogous  to  the 
corresponding  parameters  in  the  continuum  model  (4).  In  (7),  V&  is  the  symmetrically 
discretised  gradient  operator.  The  values  of  the  parameters  are  dictated  by  the 
requirements  that  the  scheme  be  stable  and  that  the  results  be  physically  reasonable 
[3,  11]. 

We  will  next  describe  some  recent  results  obtained  using  the  CDS  models  we  have  just 
discussed.  As  mentioned  earlier,  we  focus  on  two  experimentally  relevant  problems,  viz., 
phase  ordering  in  random  magnets;  and  phase  separation  in  binary  fluids. 

3.  Phase  ordering  dynamics  in  disordered  magnets 

We  have  studied  phase  ordering  in  random  magnets  by  implementing  the  disordered 
version  of  the  CDS  model  defined  by  (5)  on  a  2-dimensional  lattice  of  size  TV2.  Periodic 
boundary  conditions  were  applied  to  both  edges  of  the  lattice.  The  parameter  values  for 
our  simulations  and  the  manner  in  which  we  introduce  disorder  inf(ijj)  are  discussed  at 
length  in  Ref.  [6]  and  we  do  not  replicate  the  discussion  here. 

The  initial  condition  for  a  single  run  corresponds  to  small-amplitude  uniformly- 
distributed  random  fluctuations  about  a  zero  background,  mimicking  the  disordered  state 
before  the  quench.  The  statistical  quantities  of  interest  in  phase  ordering  dynamics  are  the 
time-dependent  structure  factor,  which  is  the  Fourier  transform  of  the  correlation 
function,  and  the  characteristic  domain  length  scale  R(t}.  The  time-dependent  structure 
factor  is  defined  as 


where  if)(k,  t]  is  the  Fourier  transform  of  the  order  parameter  (spontaneous  magnetisation, 
in  this  case)  with  wave-vector  k;  and  •?/>(£,  t)*  is  the  complex  conjugate  of  Tp(k,  t}.  In  (8), 
the  angular  brackets  refer  to  an  averaging  over  independent  initial  conditions  and  the 
square  brackets  refer  to  an  averaging  over  disorder  configurations.  In  the  dynamical 
scaling  regime,  the  structure  factor  has  the  scaling  form  S(k,t)  =  R(t}dF(kR(t}},  where 
F(x)  is  a  time-independent  master  function. 

All  numerical  results  presented  below  are  obtained  on  1282  systems  as  averages  over 
20  disorder  configurations,  with  20  independent  initial  conditions  for  each  disorder 
configuration.  Because  of  the  discrete  nature  of  our  lattice,  the  wavevectors  k  take 
discrete  values  (2n/N)(kx,ky},  where  kx  and  ky  range  from  —N/2  to  (N/2)  —  1.  As  the 
system  of  interest  is  isotropic,  we  can  improve  statistics  by  spherically  averaging  the 
vector  function  S(k,  f)  to  obtain  a  scalar  function  S(k,  t}.  This  is  the  quantity  we  show  in 
subsequent  figures. 

As  mentioned  earlier,  the  second  quantity  of  interest  is  the  time-dependent  length  scale 
R(t).  We  define  R(t}  as  the  reciprocal  of  the  first  moment  of  the  scalarised  structure 
factor,  i.e.,  R(t)  =  (k)~l,  where 


In  (9),  the  upper  cut-off  km  is  taken  to  be  half  the  magnitude  of  the  largest  wavevector 
lying  in  the  Brillouin  zone  of  the  lattice.  We  have  confirmed  that  the  structure  factor  has 
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Figure  1.  (a)  Superposition  of  data  for  the  scaled  structure  factor  S(Jt,  r){fc)2  vs. 
k/(k)  obtained  from  CDS  studies  of  phase  ordering  dynamics  in  random  magnets.  All 
structure  factor  data  is  obtained  on  1282  systems  as  an  average  over  20  disorder 
configurations,  with  20  independent  initial  conditions  for  each  disorder  configuration. 
We  present  data  for  the  pure  case  (denoted  by  circles);  and  a  range  of  disorder 
amplitudes  (denoted  by  the  symbols  indicated).  The  manner  in  which  disorder  is 
introduced  into  the  CDS  models  is  detailed  in  ref.  [6].  The  term  "disorder  amplitude" 
refers  to  the  amplitude  of  random  spatial  fluctuations  introduced  into  the  local  fixed 
points  of  the  function  f(ifi)  in  Eq.  (5).  Data  for  the  pure  case  (disorder  amplitude 
C  =  0.0)  is  from  dimensionless  time  t  =  1000.  Data  for  the  disordered  cases 
(C  =  0.30, 0.35  and  0.40)  is  from  dimensionless  time  t  =  3000.  (From  Ref.  [6].) 
(b)  Semi-log  plot  of  data  in  figure  l(a),  i.e.,  we  plot  ln(S(*,f){*)2)  vs.  k/(k).  (From 
ref.  [6].) 
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decayed  sufficiently  upto  this  value  of  km  and  that  numerical  results  for  R(t]  are 
unchanged  by  considering  larger  values  of  km.  The  length  scale  thus  defined  is  measured 
in  units  of  the  lattice  spacing.  We  should  stress  that  this  definition  of  R(t)  is  not  unique 
and  one  can  also  define  R(t)  by  using  higher  moments  of  the  structure  factor  or  various 
zero-crossings  of  the  correlation  function.  However,  in  the  dynamical  scaling  regime,  all 
these  definitions  are  equivalent  with  the  only  difference  being  in  numerical  pref actors  [2]. 

Our  extensive  numerical  investigations  of  phase  ordering  dynamics  with  quenched 
disorder  revealed  the  following  facts.  Firstly,  we  found  that  the  coarsening  systems 
exhibit  dynamical  scaling  of  the  structure  factor  as  in  the  pure  case,  indicating  thereby 
that  the  presence  of  quenched  disorder  does  not  affect  the  existence  of  a  unique  domain 
length  scale.  Furthermore,  we  found  that  the  functional  form  of  the  scaled  structure  factor 
appears  to  be  independent  of  the  amplitude  of  disorder  and  is  the  same  as  that  for  the  pure 
case  -  at  least  within  the  rather  wide  range  of  disorder  amplitudes  considered.  We  have 
referred  to  this  important  property  as  "superuniversal"  scaling  of  the  structure  factor  [6]. 
Figure  1  exhibits  the  superuniversal  scaling  of  the  structure  factor  for  phase  ordering  in 
random  magnets.  In  figure  l(a),  we  plot  the  scaled  structure  factor  S(k,t)(k}2  vs.  k/(k) 
for  the  pure  case  and  for  a  range  of  disorder  amplitudes.  Clearly,  the  functional  form  of 
the  scaled  structure  factor  appears  to  be  unaffected  by  the  presence  of  quenched  disorder. 
In  figure  l(b),  we  show  the  data  of  figure  l(a)  on  a  semi-log  scale,  i.e.,  we  plot 
ln(S(k,t}(k}2}  vs.  k/(k}.  It  is  clear  that  the  coincidence  of  the  scaled  structure  factors 
extends  into  the  intermediate  tail  region  also.  The  data  in  the  extreme  tail  is  not  very 
reliable,  in  any  case,  as  large  wave-vector  values  are  affected  by  the  nonzero  interfacial 
thickness.  Superuniversal  scaling  of  the  structure  factor  has  an  important  physical 
implication  in  that  it  suggests  that  the  morphology  of  growing  domains  is  unchanged 
inspite  of  trapping  by  disorder  sites.  Of  course,  we  have  only  considered  the  lowest-order 
structure  factor  and  it  is  not  necessary  that  higher-order  structure  factors  also  exhibit 
superuniversal  scaling.  Nevertheless,  the  fact  remains  that  the  lowest-order  structure 
factor  is  the  experimentally  relevant  one  and  the  higher-order  ones  are  primarily  of 
theoretical  relevance.  Therefore,  our  numerical  observation  has  important  experimental 
implications. 

Our  numerical  results  are  not  as  conclusive  in  regards  to  the  time-dependence  of  the 
domain  length  scale  R(t).  Figure  2  plots  the  square  of  the  length  scale  R(t}2  vs.  t  for  the 
pure  case  and  for  the  disordered  cases  discussed  previously.  (It  should  be  kept  in  mind 
that  the  length  scale  is  measured  in  units  of  the  lattice  spacing.)  In  this  figure,  we  plot 
R(t}2  vs.  t  rather  than  R(t)  vs.  t  because  the  data  for  the  pure  case  is  well-known  to  exhibit 
the  Lifshitz-Cahn-Allen  (LCA)  growth  law  R(t)  ~  f1/2  and  this  serves  as  a  benchmark  for 
discussing  data  for  the  disordered  cases.  It  is  evident  that  domain  growth  with  quenched 
disorder  does  not  exhibit  the  LCA  growth  law,  except  in  the  very  early  stages  where 
domains  are  too  small  to  be  trapped  by  disorder  sites.  The  crossover  time  from  the  LCA 
growth  law  is  inversely  proportional  to  the  amplitude  of  disorder.  Furthermore,  if  we  plot 
the  data  of  figure  2  on  a  log-log  scale  (not  shown  here),  it  is  also  evident  that  data  for  the 
disordered  cases  does  not  show  a  power-law  behaviour  at  all.  Unfortunately,  these  are  the 
only  conclusive  statements  we  can  make  in  this  context.  Inspite  of  extensive  simulations 
by  us  [6,  12]  and  other  authors  [13],  there  is  still  no  understanding  of  the  functional  form 
of  the  asymptotic  growth  law,  or  even  whether  there  is  a  universal  functional  form  valid 
for  a  wide  range  of  disorder  amplitudes.  There  have  also  been  theoretical  investigations  of 
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Figure  2.  Square  of  the  characteristic  domain  side  R(t}2  as  a  function  of  update  time 
t  for  phase  ordering  dynamics  in  random  magnets.  We  show  data  for  the  pure  case  and 
the  disordered  cases  of  figure  1.  Symbols  have  the  same  meaning  as  in  figure  1.  (From 
ref.  [6].) 


this  problem  [14],  which  have  proposed  a  domain  growth  law  of  the  form  R(t)  ~  (In  t}x, 
where  x  is  a  universal  exponent.  These  "logarithmic"  domain  growth  laws  are  argued  on 
the  basis  of  thermally-activated  hopping  of  domains  over  potential  barriers  set  up  by  the 
trapping  sites.  However,  extant  numerical  results  do  not  support  such  domain  growth 
laws,  except  in  the  case  with  quenched  random  fields  [15]. 

We  have  also  investigated  the  dynamics  of  phase  separation  in  binary  alloys  with 
quenched  disorder  [16].  Our  results  for  that  case  show  similar  trends  to  those  discussed 
above  for  the  nonconserved  case,  i.e.,  the  structure  factor  exhibits  superuniversal  scaling; 
and  the  asymptotic  domain  growth  law  is  nonalgebraic,  but  we  are  again  unable  to 
ascertain  the  precise  form  of  the  universal  growth  law,  if  any. 


4.  Phase  separation  dynamics  in  binary  fluids 

We  studied  phase  separation  in  binary  fluids  by  implementing  the  CDS  model  defined  by 
(7)  on  a  3-dimensional  lattice  of  size  N3  with  periodic  boundary  conditions.  The  results 
presented  below  are  for  the  local  relaxation  function/^)  =  A  tanh  -0  (with  A  =  1 .5)  and 
D  =  0.3  [3].  We  can  associate  mesh  sizes  with  these  parameter  values  by  comparing  the 
CDS  scheme  with  the  usual  Euler  discretisation  scheme  for  partial  differential  equations 
[3].  The  corresponding  values  are  At  =  0.5  and  AJC  =  2.45.  In  (4),  we  set  77  =  1  and 
a  —  2,  following  Farrell  and  Vails  [8].  The  phenomenological  parameters  ??  and  a  in  (7) 
are  then  fixed  as  r\  =  f}At/(Axf  =  0.08  and  a  =  aAt/(Axf  =  0.17.  This  should  not  be 
interpreted  as  a  rigorous  prescription  for  fixing  parameters  but  rather  as  a  rule  of  thumb  to 
associate  parameter  values  in  our  CDS  model  with  those  in  the  corresponding  partial 
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Figure  3.  Superposition  of  data  for  the  scaled  structure  factor  S(k,  t}(k}3  vs.  k/(k) 
for  phase  separation  in  binary  fluids.  Structure  factor  data  is  obtained  on  803  systems 
as  an  average  over  40  independent  runs.  We  superpose  data  from  dimensionless  times 
2000,  4000  and  6000,  denoted  by  the  symbols  indicated.  (From  Ref.  [11].) 


differential  equation  model.  We  should  stress  that  the  choice  of  values  for  viscosity 
parameters  does  not  affect  the  asymptotic  behaviour  of  phase  separation  in  binary  fluids 
and  only  changes  the  time  of  crossover  to  the  asymptotic  regime.  Typically,  higher  values 
of  viscosity  suppress  hydrodynamic  effects  and  delay  the  onset  of  the  asymptotic 
hydrodynamic  regime. 

Finally,  we  discuss  the  most  important  parameter  in  (7),  viz.,  a,  which  fixes  the 
strength  of  the  coupling  between  the  order  parameter  and  the  velocity  field.  We  have 
studied  domain  growth  as  modelled  by  (7)  for  a  range  of  values  of  a.  In  the  limit  a  — >  0, 
we  recover  the  Lifshitz-Slyozov  growth  law  R(t)  ~  f1/3,  which  characterises  domain 
growth  in  Model  B,  as  expected.  In  this  paper,  we  present  results  for  a  —  0.41,  which 
corresponds  (in  the  prescription  defined  above)  to  61  =  2  in  (4). 

All  results  presented  here  are  for  the  case  of  a  critical  quench,  where  there  are  equal 
concentrations  of  both  components  in  the  mixture.  The  structure  factor  is  calculated  for 
systems  with  N  =  80  and  as  an  average  over  40  runs  from  independent  initial  conditions, 
each  of  which  consists  of  the  order  parameter  and  velocity  field  uniformly  and  randomly 
distributed  about  a  zero  background  with  small  amplitudes.  The  time-dependent 
characteristic  length  scale  R(t)  is  defined  as  in  Eq.  (9). 

For  purposes  of  comparison,  we  have  also  performed  a  CDS  simulation  for  the  case 
without  hydrodynamics  (i.e.,  Model  B)  on  an  803  lattice.  We  use  the  same  functional 
form  for /(?/>)  and  the  same  values  for  A  and  D  as  in  the  case  with  hydrodynamics. 

Figure  3  superposes  data  for  the  scaled  structure  factor  ,$(/:,  t)  (k)  vs.  k/  (k)  from 
dimensionless  times  2000,  4000  and  6000.  The  different  data  sets  collapse  neatly  onto  a 
master  curve,  verifying  the  validity  of  dynamical  scaling  in  this  case.  We  next  turn  our 
attention  to  the  temporal  behaviour  of  the  characteristic  length  scale.  Figure  4  plots  R(t) 
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Figure  4.    Characteristic  domain  size  R(t)  vs.  update  time  t  for  phase  separation  with 
and  without  hydrodynamics,  denoted  by  circles  and  squares,  respectively. 


vs.  t  for  our  CDS  model  in  (7)  (marked  by  circles);  and  for  the  case  without 
hydrodynamics  (marked  by  squares).  After  an  initial  transient  regime  (which  can  be 
extended  by  using  a  smaller  value  of  a),  the  domain  size  for  the  hydrodynamic  case 
grows  linearly  in  time,  viz.,  R(t)  ~  t.  At  times  somewhat  beyond  those  shown  in  figure  4, 
freezing  sets  in  because  of  the  finite  size  of  the  system.  To  ensure  that  our  data  is  not 
affected  by  finite-size  effects,  we  have  also  performed  (less  thorough)  simulations  on 
lattices  of  size  643  and  1003.  The  results  are  the  same  as  those  presented  here  with  the 
only  difference  being  that  the  onset  of  freezing  is  delayed  in  larger  systems.  Domain 
growth  for  the  case  without  hydrodynamics  is  much  slower  and  can  be  easily  confirmed 
to  obey  the  usual  Lifshitz-Slyozov  growth  law  R(t)  ~  J1/3,  though  we  do  not  show  this 
explicitly  here.  Our  numerical  observation  of  temporally  linear  domain  growth  in  the 
phase  separation  of  binary  fluids  confirms  a  long-standing  theoretical  prediction  by 
Siggia  [17].  Similar  results  have  also  been  obtained  by  Koga  and  Kawasaki  [181  using  a 
CDS  model  in  which  they  solve  for  the  instantaneous  configuration  of  the  fluid  velocity 
field  using  the  Oseen  tensor  approximation.  There  are  also  a  number  of  related  works, 
some  of  which  we  cite  in  Ref.  [19],  suggesting  that  there  are  a  variety  of  growth  regimes 
depending  on  the  dimensionality  and  system  parameters. 


5.  Summary  and  discussion 

In  this  paper,  we  have  attempted  to  acquaint  the  reader  with  some  recent  investigations  of 
experimentally  relevant  problems  in  phase  ordering  dynamics.  Initial  numerical  studies  of 
idealised  (i.e.,  pure  and  isotropic)  phase  ordering  systems  were  greatly  facilitated  by  Cell 
Dynamical  System  (CDS)  models  [3].  It  is  our  belief  that  similar  modelling  will  also  help 
elaborate  late-stage  behaviour  in  experimentally  realistic  problems.  To  this  end,  we  have 
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described  numerical  results  from  CDS  models  of  phase  ordering  dynamics  with  quenched 
disorder;  and  segregation  dynamics  with  hydrodynamic  effects. 

In  the  context  of  domain  growth  with  quenched  disorder,  we  find  that  the  scaled 
structure  factor,  which  characterises  the  morphology  of  the  coarsening  system,  appears  to 
be  very  similar  to  that  for  the  pure  case.  This  should  be  experimentally  verifiable  and  we 
urge  experimentalists  to  carefully  examine  this  aspect  of  the  problem.  Unfortunately, 
however,  we  are  unable  to  make  a  conclusive  statement  about  the  asymptotic  nature  of  the 
domain  growth  law  in  this  problem.  This  is  an  outstanding  problem  which  needs  to  be 
resolved  but  it  is  not  at  all  clear  that  one  merely  needs  to  perform  "larger"  and  "longer" 
simulations  to  obtain  a  deeper  insight.  As  a  matter  of  fact,  we  believe  that  some  novel 
theoretical  input  is  necessary  to  enable  a  better  understanding  of  this  problem. 

In  the  context  of  phase  separation  in  binary  fluids,  our  numerical  results  confirm  a 
theoretical  prediction  due  to  Siggia  [17],  viz.,  hydrodynamic  modes  on  the  surfaces  of 
growing  domains  greatly  enhance  domain  growth  in  the  late  stages,  resulting  in  a 
crossover  in  the  domain  growth  law  from  R(t]  ~  r1/3  to  R(t]  ~  /.  This  crossover  has  also 
been  observed  in  a  large  number  of  experiments  [1]  and  it  is  gratifying  to  verify  it 
numerically. 

Models  similar  to  those  described  in  this  paper  have  also  been  used  to  investigate  a 
variety  of  other  problems  in  nonequilibrium  statistical  mechanics.  Typical  examples 
include  the  segregation  dynamics  of  block  copolymers  [20];  population  dynamics  [21]; 
dynamics  of  vortices  and  other  topological  defects  [22];  and  the  dynamics  of  pattern 
recognition  [23].  In  all  these  cases,  the  enhanced  computational  efficiency  of  CDS 
models  has  enabled  the  study  of  truly  asymptotic  behaviours.  It  is  our  belief  that  CDS 
modelling  will  provide  us  with  a  quantitatively  accurate,  and  yet  efficient,  means  of 
studying  a  host  of  physical  phenomena. 
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Abstract  We  discuss  the  use  of  coupled  nonlinear  stochastic  differential  equations  to  model  the 
dynamics  of  complex  systems,  and  present  some  analytical  insights  into  their  critical  behaviour. 
These  concern  in  particular  the  role  of  infrared  divergences  which  show  up  in  a  self-consistent 
resummation  of  perturbation  theory  (mode-coupling  approximation),  and  their  effects  on  critical 
exponents  obtained  in  earlier  work. 
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1.  Introduction 

The  theory  of  self-organized  criticality  (SOC)  [1]  provided  the  original  impetus  for  the 
study  of  systems  with  inherent  complexity  -  systems,  therefore,  which  could  be  observed 
in  many  metastable  states,  with,  frequently,  strong  hysteretic  behaviour.  The  chosen 
paradigm  for  SOC  was  the  sandpile;  it  turned  out,  however  [2, 3],  that  real  sandpiles  did 
not  manifest  the  scale  invariance  postulated  in  that  theory,  and  this  proved  to  be  the  motor 
for  a  significant  body  of  work  on  granular  media  [4]. 

The  absence  of  scale  invariance  was  postulated  [5]  to  be  due  to  the  presence  of 
competing  dynamical  mechanisms,  one  independent-particle  and  one  collective,  for  the 
reorganisation  of  grains  in  a  sandpile.  In  addition  to  numerous  simulations  [6, 7]  which 
have  supported  this  picture,  there  have  been  experiments  recently  [8]  which  have  shown 
its  validity. 

This  picture,  however,  necessitates  the  invention  of  new  classes  of  models  which  are 
able  to  capture  its  intricacies.  We  have  been  involved  with  the  formulation  of  discrete 
models  which  have  quantified  this  picture,  firstly,  in  the  language  of  cellular  automata  [9] 
and  then  in  that  of  coupled-map  lattice  models  [10];  each  of  these  has  a  representation  for 
the  two  relaxation  modes  referred  to  above  which  turns  out  to  be  intimately  related  to  the 
presence  of  times  and  lengths  characteristic  of  the  system,  and  hence  the  absence  of  scale 
invariance.  At  the  continuum  end  of  things,  we  came  up  with,  first,  a  system  of  noisy 
nonlinear  coupled  equations  at  the  macroscopic  level  [1 1]  and  next  the  equations  [4, 12] 
that  form  the  subject  of  the  present  study,  which  were  designed  to  be  a  local 
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representation  of  sandpile  dynamics.  The  description  of  these  equations  forms  the  subject 
matter  of  §  2,  which  also  contains  a  summary  of  the  main  results  obtained  thus  far  on  their 
critical  behaviour  [12]. 

Section  3  concerns  the  analysis  of  these  equations  using  a  self-consistent  resummation 
of  perturbation  theory  (mode-coupling  approximation),  which  provides  a  satisfactory 
explanation  for  the  numerical  results  recalled  in  §2.  Finally,  in  §4  we  make  some 
concluding  remarks. 

2.  Coupled  local  equations  for  a  driven  sandpile  -  the  MLN  equations 

There  have  been  a  number  of  recent  approaches  [13-17]  which  have  attempted  to  model 
the  surfaces  of  driven  sandpiles;  starting  from  the  most  basic  dynamical  mechanism,  that 
of  diffusion  [13],  these  have  incorporated  various  subtleties,  ranging  from  lateral  growth 
[14]  to  disorder  [17].  However,  they  have  nearly  always  involved  one  variable,  the  local 
height  /i(x,  t]  of  the  surface,  rather  than  any  form  of  grain-cluster  coupling.  In  our 
approach  [12],  however,  the  underlying  picture  is  of  currents  of  grains  moving  down 
the  slope,  knocking  out  bumps  and  filling  in  holes  in  clusters  on  the  landscape.  The 
effective  coordinate  representing  clusters  is  the  local  height  h(x.,t),  while  the  moving 
grains  are  represented  by  their  local  density  p(x,f)  -  in  a  driven  sandpile  these 
coordinates  will  clearly  be  coupled,  and  the  specific  form  we  have  chosen  for  this 
coupling  follows  later. 

We  now  review  some  general  facts  about  rough  interfaces  [18,  19].  Three  critical 
exponents,  a,  /3,  and  z,  characterize  the  spatial  and  temporal  scaling  behaviour  of  a  rough 
interface.  They  are  conveniently  defined  by  considering  the  (connected)  two-point 
correlation  function  of  the  heights 


,0).  (2-1) 

We  have 

S(x,0)~|x|2Q     (M-KX)),         5(0,0  ~M2/?     (W~>oo),  (2.2) 

and  more  generally 

S(x,0«|x|2a*W|x|*)  (2.3) 

in  the  whole  long-distance  scaling  regime  (x  and  t  large).  The  scaling  function  F  is 
universal  in  the  usual  sense;  CM  and  z  —  Oi/{3  are  respectively  referred  to  as  the  roughness 
exponent  and  the  dynamical  exponent  of  the  problem. 

Our  stochastic  dynamical  equations  have  the  following  general  form  in  one  space 
dimension  [12] 


dh/dt  =  DhV2h-T  +  r]h  (x,  0  ,  (2.4) 

dp/dt  =  -V;  +  DpV2p  +  T  +  77P(x,  t).  (2.5) 
For  the  time  being  we  set 

T  =  -KpV2h  -  Ap(V/0+  -  p,pVh  -  i/(VA)_  (2.6) 


noisy  nonlinear  coupled  equations 

and 

J  =  -7p(V/i)_.  (2.7) 

We  have  introduced  the  notations  V  =  d/dx,  and 

for    *>0,  a_e,v>      fO    for    z>  0, 

for    z<05     fc--zti      UWJ-|Z    for    z<0) 

(2.8) 
with  6(z)  being  Heaviside's  step  function. 

The  content  of  our  dynamical  equations  is  as  follows: 

«  The  first  term  on  the  right  hand  side  of  (2.4)  represents  the  rearrangement  of  clusters  in 
the  presence  of  an  applied  noise  and  is  the  conventional  EW  [13]  term,  representing 
diffusion. 

•  The  second  block  of  terms,  T,  represents  grain-cluster  exchange,  where: 

—  The  term  KpV2h  represents  diffusive  motion  of  the  heights  mediated  by  the  flowing 
grains. 

—  The  term  jupV/i  also  represents  intercluster  motion  of  grains,  but  in  a  slope- 
dependent  fashion. 

—  The  term  v(Vh)_   represents  the  spontaneous  generation  of  flowing  grains 
whenever  the  local  slope  is  larger  than  critical;  this  exists  even  in  the  absence 
of  flowing  grains  and  is  meant  to  represent  the  effect  of  tilting  a  stationary 
sandpile. 

—  The  term  Ap(V/z)+  is  a  representation  of  the  effect  of  the  boundary  layer,  in  the 
sense  that  it  enables   the  reconversion  of  moving  to   stationary  grains;   it 
compensates  for  the  effect  of  the  tilt  term  by  limiting  the  release  of  flowing 
grains.  Although  originally  conceived  chiefly  as  a  regulator  [12],  it  has  a  definite 
physical  meaning  and,  as  we  shall  see,  is  responsible  for  the  anomalous  roughening 
of  the  interface. 

«  The  first  term  in  (2.5),  —  V/,  represents  the  variation  in  p  due  to  the  nonuniformity  of 
the  current  of  flowing  grains,  in  such  a  way  that  the  total  number  of  particles  is 
conserved.  The  current  j(x,  t]  has  been  written  in  (2.7)  as  the  product  of  the  number  of 
mobile  grains  by  their  velocity,  the  latter  being  assumed  to  read  v  =  —  7(VA)_. 

•  The  second  term  in  (2.5)  represents  the  diffusive  relaxation  of  the  flowing  grains,  and 
is  a  crude  way  of  representing  intergrain  collisions. 

•  The  source  terms  r)h(x,  t}  and  7]p(x,  t}  are  taken  to  be  two  independent  Gaussian  white 
noises,  such  that 


-  x')S(t  - 


Pouring  grains  onto  a  sandpile  should  be  represented  by  noise  only  in  p,  while  shaking 
it  would  be  represented  by  noise  in  h. 

We  describe  below  the  numerical  results  obtained  in  earlier  work  [12]  on  some  special 
cases  of  (2.4),  (2.5). 
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2.1  The  asymmetric  situation  (cases  1  to  3) 

This  situation  is  the  most  commonly  encountered  one,  of  a  (sloping)  sandpile  with  a 
preferred  direction  of  flow.  The  dynamical  equations  read,  on  setting  Dh  =  Dp  =  X  =  1 

and  K  —  /u  =  0, 


(dh/dt  =  V2h-T  +  r,h(x,  r), 

Cases  1  to  3  :     I  dp/dt  =  V2p  +  T  +  r,p(x,  f),  (2.10) 

(       T=.-p(Vh}+-v(Vh)_. 

We  have  considered  [12]  the  following  three  cases: 

•  Case  1:  noise  in  h  (A/,  >  0,  Ap  =  0). 

•  Case  2:  noise  in  p  (A/,  =  0,  Ap  >  0). 

•  Case  3:  noise  in  h  and  p  (Ah  >  0,  Ap  >  0). 

Non-trivial  long-range  spatial  and  temporal  critical  fluctuations  are  observed  for  both 
species  h  and  p  in  these  three  cases.  Case  1  is  of  particular  interest,  as  it  is  the  most 
directly  comparable  with  the  well-known  EW  [13]  and  KPZ  [14]  models.  We  observe  a 
rougher  behaviour  of  the  /z-profile  than  the  two  aforenamed.  This  effect  is  very 
pronounced  in  the  spatial  direction  (ah  =  0.94  ±  0.07  is  to  be  compared  with  1/2  in  both 
cases),  and  still  appreciable  in  the  temporal  one  ($,  =  0.43  ±  0.04  is  to  be  compared  with 
1/3  and  1/4).  This  anomalous  roughening  is  due,  as  we  will  see  by  the  analysis  in  the 
next  section,  to  the  effect  of  the  Ap(V/i)+  term. 

We  have  also  investigated  numerically  [12]  a  simpler  variant  of  the  above  (referred  to 
therein  as  case  la)  -  these  were  essentially  identical,  apart  from  notational  differences,  to 
the  equations  analysed  by  another  group  [20].  They  read 


( 

Case  la  :     J  dp/dt  =  V2p  +  T  +  r]p(x,  ?),  (2-11) 

[        T=  - 


The  critical  exponents  for  this  case  are  also  presented  in  table  1. 


2.2  The  symmetric  situation  (cases  4  to  6) 

We  now  turn  to  the  description  of  the  symmetric  version  of  our  dynamical  equations  - 
these  have  the  x  <->  —  x  symmetry,  and  describe  the  surface  of  a  sandpile  which  is  flat  on 
average.  The  dynamical  equations  now  read  in  full  generality 


Cases  4  to  6  : 


dh/dt  = 

dp/dt  =  -Vj  +  DpV2p  +  T  +  r)p(x,  t), 


(2.12) 


T  =  -KpV2h  -  \p\Vh\  H-  i/(|V/t|  -  C)+, 
j  =  7psign(V/i)(|V/i|-C)+, 

where  C  is  a  slope  threshold.  In  order  to  investigate  the  critical  regime,  we  simplify  the 


Noisy  nonlinear  coupled  equations 

above  equations  by  setting  7  =  C  =  0.  We  again  consider  three  cases: 

«  Case  4:  noise  in  h  (A,,  >  0,  Ap  =  0). 
•  Case  5:  noise  in  p  (A/,  =  0,  Ap  >  0). 
»  Case  6:  noise  in  h  and  in  p  (A/,  >  0,  Ap  >  0). 

In  cases  4  and  6  [12],  we  observe  o^  w  /^  w  0.40  and  z/,  =  1  within  error  bars,  just  as 
in  case  la,  whereas  p(x,t}  does  not  exhibit  divergent  fluctuations:  the  structure  factors 
Sp(q,Q]  and  Sp(0,a;)  rather  saturate  to  constant  values  for  small  enough  values  of 
wavevector  q  or  frequency  u>,  implying  the  fast  decay  of  p  —  p  correlations  at  long 
separations  in  space  and  in  time.  In  case  5,  where  noise  is  present  only  in  the  equation  for 
p,  we  find  that  h  gets  frozen,  soon  after  the  initial  period  has  elapsed,  into  a  nontrivial 
rough  landscape,  entirely  inherited  from  the  transient  period,  and  therefore  characterised 
by  a  roughness  exponent  a  w  0.40.  The  evolution  of  p  which  then  takes  place,  with  this 
frozen  height  configuration  as  a  background,  is  effectively  linear,  and  the  EW  exponents 
are  observed,  in  spite  of  the  background. 

3.  Mode-coupling  analysis 

In  this  section  we  provide  some  analytical  insight  into  the  scaling  behaviour  of  the 
various  cases  of  coupled  Langevin  equations  discussed  in  the  previous  section.  Our 
analysis  is  based  on  a  self-consistent  resummation  of  perturbation  theory  at  the  one-loop 
level,  introduced  long  ago  in  the  context  of  fluid  mechanics  [21],  and  usually  referred  to 
as  the  mode-coupling  approximation.  The  main  outcome  will  be  that  those  models  exhibit 
infrared  singularities  that  control  their  scaling  behaviour. 

We  illustrate  this  point  by  analysing  case  la  in  detail.  The  two  coupled  equations  for 
the  variables  h  and  p  are  given  in  (2.11)  and  (2.9).  We  denote  by  Sh(k,uS)  the  double 
Fourier  transform  of  the  correlation  function  (2.1),  such  that 

(h(kju}}h(kf,u}'}}  =  6(k  +  kf}6(uj  +  u}')Sk(k,u}  (3.1) 

and  by  Gh(k,v)  the  Green's  function,  or  propagator,  of  (2.11)  for  h,  averaged  over  the 
noise,  and  similar  notations  for  the  p-field. 

In  the  absence  of  coupling  (fj,  —  0),  (2.11)  degenerate  into  two  independent  free 
(EW)  models  in  the  variables  h  and  p.  The  Green's  functions  and  correlation  functions 
then  read 


lAi, 
f=TZ,  (3-2) 


and  similar  expressions  for  GJ,    and  S®\ 

The  effect  of  the  coupling  n  is  to  dress  the  propagator,  namely  to  change  it  from  the 
free  one,  given  in  (3.2),  to  the  full  one,  Gh(k,u}.  Dyson's  equation  yields 

where  £;,(&,  u;)  is  the  self-energy,  or  mass  operator  [22]. 
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Within  the  self-consistent  one-loop  (mode-coupling)  approximation,  where  the  internal 
lines  of  the  self-energy  graph  are  fully  dressed,  we  have 


)  =  Ai2  !~  l^-Sp(k>,J}k(k-k'}Gh(k-k'^-J}.  (3.4) 

J     27T  J     27T 

Now  we  make  the  following  scaling  ansatz  about  Zh(k,uj}  and 


(k  >  0)  (3.5) 

Sh(k,u)  w  Jk-1-20 


and  similarly  for  Sp(fc,  w)  and  Sp(fc,  u),  with  exponents  zp  and  QJP.  The  determination  of  zh 
involves  the  computation  of  the  static  (zero-frequency)  self-energy  S/,(fc)  = 
SA(Jt,o;  =  0).  We  assume  zh  <  2,  so  that  SA(fc)  dominates  A:2  in  the  scaling  region  [this 
hypothesis  will  be  checked  a  posteriori].  Hence  in  (3.3)  we  have 

Gh(k,u)~l  «  -KJ  +  £A(fc,a;).  (3.6) 


Accordingly,  we  explore  the  integral  on  the  right-hand-side  of  (3.4)  at  zero  external 
frequency.  The  integral  has  an  infrared  divergence  when  the  momentum  k?  of  the 
correlation  function  goes  to  zero.  Indeed  in  this  limit  k(k  —  kf]  w  k2,  and 

G^1  (k-k1,  -J}  w  iJ  +  Zh(k-  k1) 


«  SA(*).  (3.7) 

Because  of  the  scaling  laws  (3.5)  we  effectively  explore  the  low-frequency  behaviour 
when  we  explore  low  values  of  k1.  The  self-energy  of  (3.4)  now  becomes 


since 


and  a  similar  formula  for  the  p-field,  as  a  consequence  of  (3.5).  The  integral  on  the  right- 
hand-side  of  (3.8)  clearly  exhibits  an  infrared  divergence.  For  ap  >  0,  the  integral 
diverges  at  small  A7.  If  we  regularise  the  integral  by  introducing  as  an  infrared  cutoff  a 
smallest  wavevector  ICQ,  (3.8)  yields 


.  (3.10) 

The  above  result  at  zero  frequency  implies  ZH  =  1;  an  identical  analysis  yields  zp  =  1. 
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Now,  in  order  to  determine  o^,  we  investigate  the  correlation  function  Sh(k,u>}.  Again 
within  the  self-consistent  one-loop  approximation,  we  have 


(3.11) 
Using  (3.9),  we  have 

ft-*7)2         1 


27T    k  - 


(312) 

^••L^/ 


The  low-momentum  divergence  of  the  integral  on  the  right-hand-side  of  (3.12)  implies 
that  the  integral  scales  as  |fc|1~2a/t£0"2ap,  which  is  much  smaller  than  k'1'20*  in  the  long- 
wavelength  limit.  Thus  ah  remains  at  its  EW  value,  namely  ah  =  1/2.  An  identical 
argument  on  the  p-correlation  yields  aA  =  ap  =  1/2  and  z  =  1  in  case  la. 

A  variant  of  case  la  is  the  model  studied  by  Bouchaud  et  al  [20].  In  our  notation,  this 
model  reads 


(dhjdt  =  V2h  +  cVh  +  f^pVh  +  r)h(x,  t), 
[dp/dt  =  V2p  -  cVh  -  ppVh. 

The  zeroth-order  (free)  Green's  function  and  correlation  function  for  h  are  respectively 


(3-14) 


The  coupling  between  both  fields  can  be  viewed  as  a  coloured  noise  acting  on  p.  The 
zeroth-order  correlation  function  for  p  is 


— T-^T,  (3-15) 

hence 

/5(0)(fc,o;)  —  = 2c2^h -  (3-16) 

The  full  Green's  function  is  then  again  given  by  (3.3),  with  the  self-energy  as  in  (3.4). 
The  low-momentum  region  produces  a  divergent  integral,  because  of  (3.9),  so  that  the 
conclusion  ZH  =  1  is  still  valid.  The  analysis  of  the  correlation  function  follows  in  a 
manner  similar  to  that  done  for  case  la,  and  leads  to  a/,  =  1/2. 
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Bouchaud  et  al  [20]  applied  standard  perturbation  theory  to  their  equations  (3.13). 
They  did  not  meet  the  infrared  singularity  described  above,  since  this  phenomenon  can 
only  appear  through  treatments  which  go  beyond  strict  perturbation  theory,  such  as  our 
self-consistent  resummation  procedure.  Hence  the  conclusion  that  the  model  (3.13)  is  free 
(EW-like)  on  a  perturbative  basis  is  incorrect. 

To  close  up  this  analysis,  we  come  back  to  our  case  1.  We  observe  that  the  transfer  term 


T  =  -i/(Vfc)_  -  Ap(V/i)+  (3.17) 

ught  of  as 
G-functi 
for  the  transfer  term 


can  be  thought  of  as  a  formal  infinite  series  by  invoking  a  suitable  representation  for 
Heaviside's  G-function.  We  are  thus  led  to  consider  the  following  more  general  structure 


T  =  -i/VA  -  XpVh  -  V"  vn(Vh}n+l  -  pV  An(V/z)"+1.  (3.18) 

-^  f 


n=l  n=l 

Dimensional  analysis  along  the  lines  of  ref.  [12]  shows  that,  among  the  terms  in  the  sums, 
only  i>i(Vh}2  is  relevant  in  one  space  dimension.  Now  consider  the  calculation  of  S/,(fc), 
within  the  above  self-consistent  one-loop  approach.  Two  contributions  are  to  be  taken 
into  account.  We  have  already  dealt  with  one  of  them  in  (3.4).  That  forces  z/,  =  1, 
independently  of  a/,.  The  other  contribution  is  of  the  KPZ  variety  [14],  for  which  a 
symmetry  property,  referred  to  as  Galilean  invariance,  requires  ah  +  Zh  =  2  [18,19].  This 
in  turn  forces  a/,  =  1  and  fa  =  1. 

The  numerical  work  [12]  for  case  1  suggests  a/,  «  1  and  fa  «  1/2,  consistent  with 
Zh  ~  2.  One  possible,  although  rather  exotic,  explanation  for  the  difference  between  the 
analytical  prediction  ZH  =  1  and  the  numerical  estimate  Zh  «  2  is  the  phenomenon 
observed  recently  [23]  in  a  suitably  regularized  mean-field  version  of  the  KPZ  problem. 
These  authors  have  shown  that  the  dynamics  of  the  approach  to  the  stationary  state  is 
much  slower  than  the  linear-response  dynamics  in  the  stationary  state. 

Finally,  for  our  cases  4  and  6,  which  bear  a  close  resemblance  to  the  models  studied  by 
Krug  and  Spohn  [19],  the  present  work  yields  ap  =  1/2  and  ih  —  1,  as  was  indeed  found 
numerically  in  [12].  The  situation  of  case  5  was  already  discussed  in  §  2. 

4.  Discussion 

In  the  above,  we  have  presented  analytical  insights  on  the  behaviour  of  a  system  of 
coupled  noisy  nonlinear  equations  put  forward  earlier  [4,  12]  as  a  model  of  sandpile 
dynamics.  The  anomalous  roughening  seen  in  case  1,  in  particular,  is  seen  to  be  due 
predominantly  to  the  effect  of  the  term  Ap(V7t)+;  this  has  a  simple  physical  explanation 
in  the  context  of  the  observed  agreement  of  our  results  with  the  experimental  work  of 
Kurnaz  et  al  [24].  The  authors  of  that  work  found,  for  the  case  of  a  rotated  and  shaken 
sandpile,  exponents  ah  =  0.92  ±0.05  and  fa  =  0.48  ±0.16  (in  close  agreement  with 
case  1  in  our  table),  and  attributed  this  to  the  effects  of  interstitial  fluids  in  their  system. 
The  effect  of  the  Xp(Vh)+  term  is  precisely  to  put  back  "roughness"  on  the  profile  of  the 
pile  at  a  rate  proportional  to  the  density  p  of  mobile  grains.  We  see  therefore  that  our 
analysis  above,  among  other  things,  provides  a  rather  appealing  picture  of  an  experi- 
mental scenario. 
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Table  1.  Critical  exponents  a,  (3,  and  z  for  both  fields  h(x}  t}  and  p(x,  t),  measured 
from  numerical  simulations  of  the  coupled  stochastic  equations  in  the  various  cases 
described  in  text  (after  ref.  [12]).  The  exact  EW  and  KPZ  values  are  recalled  for 
comparison.  A  hyphen  means  that  the  corresponding  quantity  is  in  our  opinion  not 
critical.  An  asterisk  means  that  the  exponent  z  cannot  be  accurately  evaluated  from  the 
available  data  on  a  and  /?. 


Model 

Species 

Oi 

& 

z 

EW 

h 

1/2 

1/4 

2 

KPZ 

h 

1/2 

1/3 

3/2 

,.,.  asymmetric 

h 

0.94  ±  0.07 

0.43  ±0.04 

2.2  ±0.4 

noise  in  h 

t* 

0.22  ±  0.08 

0.07  ±0.02 

* 

f.  N  no-tilt 
(la)       .     .             . 
noise  in  hand  in  p 

h 
P 

0.36  ±  0.02 
0.39  ±  0.05 

0.41  ±  0.08 
0.27  ±  0.04 

0.9  ±  0.2 
1.4  ±0.4 
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Abstract.  Nonlinear  fluctuating  hydrodynamic  (NFH)  models  for  relaxation  in  the  supercooled 
liquid  are  considered.  Recent  results  on  self  consistent  mode  coupling  theory  for  the  slow  relaxation 
of  density  fluctuations  are  analyzed  to  explain  the  glassy  dynamics.  The  relaxation  mechanisms  for 
different  types  of  models  with  and  without  wave  vector  dependences  are  discussed.  For  the 
schematic  models  where  all  wave  vector  dependences  are  dropped  a  sequence  of  time  scales  enters 
the  relaxation  process.  For  the  non-ergodicity  parameter  very  close  to  the  ideal  transition  point  is 
scaled  by  an  exponent  equal  to  1/2.  This  is  demonstrated  here  through  an  analysis  of  the  mode- 
coupling  equations  for  the  wave  vector  dependent  models  that  follow  from  equations  of  NFH. 

Keywords.     Glass  transition;  ergodicity;  correlation  function. 
PACS  Nos    64.70;  05.60;  64.60 

1.  Introduction 

In  the  last  ten  years  a  good  amount  of  progress  has  been  achieved  on  the  understanding  of 
the  dynamics  of  supercooled  liquids  using  models  derived  from  the  nonlinear  fluctuating 
hydrodynamic  equations  for  the  compressible  fluid  [1,2].  In  these  models  the  transport 
coefficients  for  the  supercooled  liquid  are  obtained  in  terms  of  self-consistent  expressions 
of  the  hydrodynamic  correlation  functions.  This  constitute  a  feedback  mechanism  [3, 4] 
giving  rise  to  slow  relaxation  in  the  supercooled  liquid.  In  the  self  consistent  mode 
coupling  models  for  glass  transition,  the  feed  back  effects  from  the  terms  involving  the 
slowly  decaying  density  fluctuations  are  analyzed  assuming  that  they  produce  the 
dominant  contribution  at  supercooled  states.  In  this  paper  we  will  focus  our  discussion  on 
the  nonlinear  hydrodynamic  models,  specially  with  structural  effects  taken  into  account. 
The  present  form  of  the  mode-coupling  theory  (MCT)  for  supercooled  liquid  dynamics 
was  initiated  from  the  work  of  Leutheusser  by  showing  that  a  model  obtained  from  the 
kinetic  theory  of  dense  fluids  exhibits  an  ergodic-nonergodic  transition  which  has 
features  similar  to  a  liquid-glass  transition.  The  model  leads  to  a  viscosity  which  diverges 
as  (T  -  7b)~a  as  the  temperature  T  approaches  the  ideal  glass  transition  TQ.  The  exponent 
a  was  found  to  take  values  a  ~  2.  Later  it  was  demonstrated  [5]  that  the  sharp  transition 
is  cutoff  due  to  a  mechanism  arising  from  coupling  of  density  and  current  fluctuations  in 
a  compressible  fluid  and  this  keeps  the  system  ergodic  at  all  temperatures.  However  the 
analysis  done  in  ref.  [5]  also  showed  that  the  feed  back  mechanism  from  the  coupling  of 
density  fluctuations  does  cause  a  substantial  enhancement  of  the  viscosity.  Thus  although 
the  sharp  transition  is  cutoff  there  are  some  strong  remnants  of  it  with  a  qualitative 
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change  in  the  supercooled  liquid  dynamics  around  a  temperature  TO  higher  than  the  usual 
glass  transition  temperature.  Indeed  such  behavior  was  observed  in  a  number  of  liquids 
[6,7]  termed  as  the  fragile  liquids.  In  a  number  of  subsequent  works  [7]  the  cutoff 
mechanism  responsible  for  ergodic  behavior  at  the  longest  time  scale  in  the  supercooled 
liquid  was  simply  ignored  and  the  relaxation  behavior  around  the  point  which  involves  a 
sharp  transition  from  ergodic  to  non  ergodic  behavior  was  analyzed.  The  simplest 
example  of  this  type  is  the  Leutheusser  model  which  consider  a  schematic  form  of  the 
mode-coupling  theory  with  all  q  dependence  being  dropped,  but  only  the  main 
characteristics  of  the  dynamic  instability  being  preserved.  A  sequence  of  time  scales  enter 
the  relaxation  process  in  the  schematic  model.  However,  when  the  full  wave  vector 
dependence  in  the  models  are  taken  into  account  ,  through  a  realistic  static  structure 
factor  this  aspect  changes  somewhat.  We  discuss  these  in  the  present  work  by  presenting 
a  simple  analysis  of  the  wave  vector  dependent  model  here. 

We  will  divide  the  discussion  mainly  into  two  parts  namely  the  idealized  models  with  a 
sharp  transition  and  the  fully  self-consistent  model  in  which  there  is  no  sharp  transition  in 
the  dynamic  behavior  of  the  liquid.  In  §2  we  first  discuss  the  nonlinear  fluctuating 
hydrodynamic  equations  for  a  compressible  fluid  with  a  simple  choice  of  conserved 
densities  as  slow  variables  and  discuss  how  the  mode  coupling  model  is  obtained  from 
these  equations.  In  §  3,  we  discuss  the  idealized  model  which  has  a  dynamic  instability 
giving  rise  to  the  so  called  glass  transition  and  discuss  the  various  time  scales  that  enter 
the  relaxation  process.  In  the  next  section  we  discuss  the  fully  self  consistent  model  with 
the  cutoff  mechanism  for  restoring  ergodicity  and  discuss  some  recent  results  regarding 
the  nature  of  the  kernel  for  the  cutoff  mechanism.  We  end  the  paper  with  a  small 
discussion. 

2.  Fluctuating  hydrodynamics  of  compressible  liquid 

In  the  present  work  we  will  consider  the  set  of  hydrodynamic  variables  [5]  as  the 
mass  density  p,  the  momentum  density  g,  and  the  flow  velocity  v  defined  through  the 
nonlinear  constraint 

g  =  pv.  (1) 

The  equations  of  motion  for  the  hydrodynamic  variables,  are  obtained  using  the  well 
known  Zwanzig-Mori  [8]  formalism  and  are  valid  for  small  and  finite  wavelengths.  The 
equation  for  p  is  given  by 

dp 

^  =  -V-g  (2) 

and  that  for  g  is  the  generalized  nonlinear  Navier  Stokes  equation  with  thermal  noise 

-vj^+e,,       (3) 

where  Ftt[p(x}]  is  the  potential  energy  part  of  the  effective  Hamiltonian  F  defined  [9]  as 

(4) 
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Following  the  usual  forms  common  in  the  density  functional  theories,  Fu  is  taken  as  an 
expansion  of  an  inhomogeneous  equilibrium  liquid 

(j3m)Fu[p(x)}  =  I  dXp(x}{ln[p(x)/p0}  -  1},  +FM,  (5) 

«/ 

where  the  first  term  is  the  ideal  gas  entropy  term  and  the  interaction  term  Fjnt  to  lowest 
order  can  be  obtained  (up  to  a  constant)  as 

FM  =  ~~  t  d3jtdVc(2>  (x  -  xl}8p(x}8p(x'}  (6) 

Lm  J 

with  8p(x,  t)  =  p(x,  t)  —  pQ  and  /3  =  l/kBT-c(x)  is  the  equilibrium  two  particle  correlation 
function  for  the  liquid.  In  an  isotropic  fluid  the  bare  transport  matrix  Ly(x)  is  related  to 
the  Gaussian  noise  ©,-  through  the  fluctuation  dissipation  relation 


,(x',  t'))  =  2kBTLy(x)8(x  -  x?)8(t  -  1'}.  (7) 

For  an  isotropic  fluid  the  bare  transport  matrix  L//(x)  or  its  Fourier  transform 

Lg(q)=  f  tee^Lgtx)  (8) 

»/ 

can  be  expressed  in  terms  of  two  independent  transport  coefficients  given  by 

%(<?)  =  q*tf*(q)  +  [4%  -  OTW(*).  (9) 

In  the  small  wavenumber  limit,  r°(#)  and  if  (q)  are  the  bare  longitudinal  and  shear 
viscosities  respectively.  Since  we  will  be  applying  these  equations  for  finite  wavelengths 
here,  more  generalized  expressions  for  these  quantities  are  obtained  with  the  use  of 
Enskog  type  models  [10].  These  reflect  the  short  time  properties  referring  to  uncorrelated 
random  collisions  in  the  system. 

In  order  to  investigate  the  effects  the  nonlinearities  in  the  hydrodynamic  equations  will 
have  on  the  transport  properties  of  the  fluid,  a  field  theory  of  the  Martin-Siggia-Rose 
(MSR)  [11]  type  was  used  in  ref.  [5].  The  advantage  of  using  the  MSR  field  theory  here  is 
that  the  renormalized  expressions  for  the  various  quantities  are  obtained  in  a  self 
consistent  manner  in  terms  of  the  full  correlation  functions  and  is  very  useful  in 
demonstrating  the  feedback  mechanism  that  results  in  slow  relaxation  at  supercooled 
densities.  The  field  theoretic  treatment  of  the  problem  is  done  following  the  techniques 
developed  in  the  theories  of  dynamical  critical  point  phenomena  and  for  this  the 
interested  reader  is  referred  to  the  papers  cited  under  ref.  [11].  For  the  analysis  as  applied 
to  the  theory  of  the  compressible  fluids  see  ref.  [5,  12].  In  the  present  discussion  we  give  a 
very  brief  account  of  the  steps  used  in  obtaining  the  mode  coupling  models  as  applied  for 
supercooled  liquids.  The  fully  renormalized  theory  of  the  hydrodynamic  correlation 
functions  are  obtained  in  terms  of  the  self  energy  matrix  S  defined  through  the  Dyson 
equation 

GT^q.w)  =  Gb-'Cq,")  -  S(q,o;),  (10) 

where  GO  refers  to  the  matrix  of  correlation  functions  obtained  from  the  equations  of 
linearized  hydrodynamics.  The  main  quantity  of  interest  here  is  the  density  auto 
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^},  (11) 

where  the  angular  bracket  refers  to  the  average  over  the  stationary  states.  In  ref.  [5]  the 
following  form  for  the  Fourier-Laplace  transform  of  Gpp(x,  t)  normalized  with  respect  to 
its  equal  time  value  is  obtained  in  the  small  q  and  u)  limit 


Here  £l2  =  q2[(3mS(q}}~1  and  DLR(q,z)  is  the  renormalized  longitudinal  viscosity.  The 
Laplace  transform  of  <p(t)  is  defined  as 

/oo 
dfeto#/),     Im(z)>0.  (13) 

Similarly  the  Laplace  transform  for  the  transverse  current  fluctuation  (normalized  with 
respect  to  its  equal  time  value)  is  given  by 


where  rf(q,z)  is  the  renormalized  shear  viscosity.  In  the  formulation  of  the  MSR  type 
field  theory  the  renormalized  memory  kernels  on  the  R.H.S  of  (12)  and  (14)  have  the 
mode  coupling  contributions  at  the  one  loop  level  respectively  given  by 


(27T) 

xGpp(q-MGpp(k,0  (15) 

and 


(16) 

where  X0  =  (2/3m4/90)  and  u  —  q-k  while  q  is  the  unit  vector  along  the  direction 
of  q.  The  quantity  7(9,2)  in  the  R.H.S  of  (12)  arises  from  the  coupling  between 
the  density  fluctuations  and  current  fluctuations  in  a  compressible  fluid.  In  the  asymp- 
totic limit  when  the  viscosity  become  large  due  to  the  feedback  coming  from  mode 
coupling  contributions,  the  density  autocorrelation  function  given  by  (12)  develops  a 
pole  at 

z  +  h(q,z)=Q.  (17) 

Thus  ergodicity  is  restored  over  a  time  scale  of  1/7(0,  0),  representing  the  longest  time 
scale  in  the  relaxation  of  the  density  correlation  function.  However,  in  the  simple  model 
where  all  the  nonhydrodynamic  corrections  coming  from  time  scales  of  the  order  of  q2 
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NFH  in  supercooled  liquids 

are  ignored  the  simplified  model  has  a  dynamic  instability  similar  to  the  liquid-glass 
transition.  In  the  next  section  we  will  consider  this  model  first  to  demonstrate  the  several 
relaxation  processes  that  can  be  included  in  the  context  of  mode  coupling  theory  and  the 
consequences  of  a  proper  wave  vector  dependence  in  the  theory. 

3.  Effects  of  the  feedback  mechanism  and  sequence  of  time  scales  of  relaxation 

3.1  Schematic  models  without  wavevector  dependence 

In  this  section  we  consider  the  simple  model  with  the  sharp  instability  showing  a 
transition  to  an  ideal  glassy  phase.  In  this  regard  several  different  types  of  models 
have  been  investigated.  First  we  consider  the  schematic  models  where  all  the  wave 
vector  dependences  in  the  mode  coupling  kernels  for  transport  coefficients  are  dropped 
and  the  density  autocorrelation  function  0(z)  following  (12)  is  given  by  the  functional 
equation 


where  Q.Q  is  a  microscopic  frequency.  In  the  schematic  models  the  renormalized 
expression  for  the  viscosity  DL(z)  is  obtained  as 


Jo 


where  DQL  is  the  bare  viscosity  governing  the  microscopic  dynamics.  In  studying  the 
feedback  mechanism  for  slow  relaxation  in  the  supercooled  liquid  the  memory  kernel  H 
is  expressed  as  a  functional  of  the  density  correlation  function  <f>(t)  [7,  13]  given  by 

^(0.  (20) 


n=l 

In  the  model  considered  by  Leutheusser  H[<f>]  =  cify2.  The  more  general  model  with 
c\  ~  0  was  first  introduced  in  ref.  [1].  The  importance  of  the  linear  term  was  first  pointed 
out  by  Goetze  [14].  In  the  above  expressions  for  the  memory  kernel  all  wave  vector 
dependences  are  assumed  to  be  weak  and  are  suppressed.  The  basic  assumption  in  the 
analysis  of  the  equation  (18)  is  that  depending  on  the  the  kernel  H[4>]  i.e.  the  coefficients 
cns  there  is  a  time  range  over  which  (f>(t)  is  approximately  time  independent  following  the 
form 


where/  is  the  value  of  4>(t)  in  some  metastable  state  which  is  yet  to  be  specified.  For  time 
scales  where  the  inequality  \z4>v(z)\  <C  1  is  valid,  we  obtain  to  leading  order  from 
equation  (18) 

(t}}  =  0     (22) 


I  —  f 

/].  (23) 


We  have  used  the  notation  H"(f]  =  (d2H(f})/df2  etc.  and  Lz  stands  for  the  Laplace 
transform  of  the  function  in  the  argument  as  defined  in  equation  (13)  with  variable  z.  An 
ideal  metastable  state  is  obtained  when  both  AQ  and  AI  are  zero  giving  a  solution  for  the 
decaying  function  $v(t}.  We  can  determine/  by  setting  AI  =  0.  This  is  obtained  for 


with  C  is  a  constant  not  depending  on  /.  At  this  condition  A0  =  (C  -  !)/(!  —  /).  An 
ideal  state  is  obtained  when  for  a  given  set  of  cn  we  have  Ao(cn)  =  0.  Thus  the  ideal 
transition  takes  place  with  for  C  =  1  giving  AQ  —  0.  For  the  Leutheusser  model  we  obtain 
that  /  =  1/2  and  c^  has  a  critical  value  of  c\  =  4.  For  the  model  by  Goetze  [14]  we  have 
linear  term  in  the  kernel,  i.e. 


(25) 
with  a  line  of  critical  values  satisfying 

(26) 


For  higher  order  models  there  are  critical  surfaces.  Note  that  with  AI  =  0  the  equation 
(22)  can  be  written  as 

("  =  o.  (27) 


For  the  high  frequency  region  where  the  term  proportional  to  A0  can  be  neglected  i.e. 

the  above  equation  reduces  to  the  simple  form, 

zfilfc)  ~  A£[$;(f)]  =  0  (29) 

with  A  =  ((1  -/)3/2C)  #"(/).  This  equation  is  satisfied  by  the  function 

Q)    \  t 1   — *  A  \  t &tif\  i  f^f\\ 

The  exponent  a  is  given  in  terms  of  the  parameter  A  by  the  equation 

F(l  -  2a)  =  A)  <31) 

where  T  is  the  gamma  function.  The  inequality  (28)  and  the  solution  (30)  obviously  holds 
in  the  time  region 

"/2B-  (32) 
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The  time  scale  ra  diverges  at  the  ideal  transition  point  i.e.  A0  =  °-  Therefore  <j>(i)  decays 
algebraically  towards  the  metastable  value  /.  For  A0  ^  0  i.e.  points  away  from  the 
transition  the  dynamics  is  quite  different  for  the  cases  A0  >  0  and  ^o  <  °-  For  ^o  >  °' 
equation  (27)  has  the  solution  $v(z)  ~  A^2/*  as  z  -»  0  and 

/=/o  +  c(A0)1/2  +  0(Ao),  (33) 

where  /0  is  the  value  of/  at  the  ideal  transition  point  and  c  is  a  constant.  For  A0  <  0  we 
have  from  equation  (27)  a  solution  for  tf>v(z)  which  is  more  singular  that  1  /z  as  z  :  -*  0  and 
has  the  following  form  in  the  time  regime 

&(0  -  -B(t/ra}\  (34) 

where  5  is  a  positive  constant.  Similar  to  a  in  (30),  b  satisfies  the  equation 

A  (35) 

v   ' 


Equation  (34)  is  referred  to  as  the  von-Schweidler  relaxation  law.  However  this  relaxation 
eventually  violates  the  inequality  \Z(t>v(z)\  <  1  and  this  decay  mechanism  is  valid  in  the 
region 

Ta  <  t  <  Ta,  (36) 

where  ra  is  given  by 

ra  =  |A0r(1^+1/^.  (37) 

For  /  >  ra,  the  system  enters  into  the  a  relaxation  regime.  In  these  regime  a  fully  analytic 
solution  of  the  mode  coupling  equations  is  still  not  available.  Numerical  solution  for  a  set 
of  parameters  [13]  has  shown  that  the  solution  is  well  fit  by  a  stretched  exponential 

0(,)=/e-«/<  (38) 

The  final  decay  of  the  correlation  function  takes  place  over  the  time  scale,  given  by  1/7. 
We  show  in  figure  1  the  sequence  of  times  scales  that  enter  the  relaxation  process  in  the 
mode  coupling  models  that  uses  the  schematic  forms  for  the  memory  kernel  in  terms  of 
the  density  correlation  function. 

3.2  Effects  of  including  proper  structure  factor 

In  this  case  we  consider  the  set  of  equations  (for  different  q  values), 


where  the  memory  kernel  representing  the  transport  behavior  is  given  by 

DL(q,  Z}  =  Dl(q]  +  Qj  f°°  d,e^[0(^  ^  (40) 

JO 
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/n  t 

Figure  1.  Schematic  plot  of  the  sequence  of  time  scales  in  the  relaxation  predicted 
by  MCT;  (a)  power-law  decay  rQ,  (b)  von  Schweidler  relaxation  -5/*,  (c)  primary 
relaxation  e~WT)  ,  (d)  exponential  relaxation  e~7'. 

where  DQL(q)  is  the  bare  viscosity  [15]  governing  the  microscopic  dynamics  over  length 
scale  corresponding  to  wave  vector  q.  The  function  Hq[<f>]  now  depends  on  all  the  values 
of  (/)  at  different  q  values.  In  general  we  can  have  the  kernel  in  the  form 


^ 


E 


(41) 


In  the  model  obtained  using  the  nonlinear  fluctuating  hydrodynamics  described  in 
equation  (15)  V^  happens  to  be  zero.  Subsequently,  Kim  [16]  has  considered  models  of 
fluctuating  hydrodynamics  that  obtains  a  term  in  the  kernel  linearly  proportional  to  the 
density  correlation  function.  We  look  for  a  solution  of  the  above  equations  in  the  form 


,  0  = 


(42) 


If  fq  happens  to  be  a  nonzero  set  of  values  beyond  a  certain  density  then  this  corresponds 
to  an  ideal  non  ergodic  state  where  the  density  correlation  function  decays  to  a  finite 
value  instead  of  decaying  to  zero.  Following  an  analysis  similar  to  what  is  described  in 
the  previous  section  for  the  schematic  model,  we  obtain  an  equation 


dHq 

dfk 


+ 


'ndfkdfK      l~fq 


with 


(43) 
=  0 

(44) 
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Figure  2.    The  long  time  limit  of  the  density  correlation  function  fq,  computed  from 
the  simple  model  at  the  density  n*  =  0.99. 


As  in  the  other  case  here  also  we  determine  the/^'s  by  choosing  the  term  linear  in  <£„  to  be 
vanishing.  This  gives  rise  to  the  condition  that 

(45) 


«  -1   _   f    , 

1        J4 

where  Cq  are  constants.  Consequently,  with  this  choice  we  can  write 


^  =  1^7-  (46) 

For  the  ideal  transition  point  one  has  the  condition  Aj  =  0  i.e.  Cq  =  1.  Thus  we  obtain 
the  following  condition  fork's, 


fa  = 


Hn 


Hq+l 


(41) 


In  figure  2  we  show  the  solution  for/?  that  is  obtained  using  the  model  described  by  the 
kernel  in  equation  (15).  For  the  density  n*  =  no3  below  0.99  fqs'  all  are  zero  showing  that 
the  dynamic  instability  signifying  transition  to  the  nonergodic  phase  takes  place  at  this 
density.  In  obtaining  this  equation  we  get  the  static  structure  factor  or  the  direct 
correlation  function  that  appear  in  the  mode  coupling  integrals  using  the  Percus  Yevick 
(PY)  solution  [10]  for  the  hard  sphere  system.  For  the  points  away  from  the  transition  we 
see  that  with  Ag  >  0  the  function  <f>v(q,z)  ~  (Ag)1/2/z-  Thus  the  nonergodicity 
parameter  giving  the  strength  of  the  1/z  pole  in  the  density  correlation  function  has 
the  behavior 

/.=/!  +  cfl€1/2  +  0(C)1  W 
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Figure  3.  The  nonergodicity  parameter  [fq  -  f%]  vs.  e  for  (a)  qa  =  4.00;  (b)  qcr  = 
7.16.  The  dashed  line  is  for  the  exponent  |  and  the  solid  line  for  a  q  dependent 
exponent. 

where  e  denotes  the  distance  from  the  ideal  transition  point.  Such  a  behavior  was  also 
found  by  numerical  solution  of  the  integral  equations  (47)  giving/9's.  In  figures  3(a)  and 
3(b)  we  show  the  fit  for  a  power  law  with  exponent  1/2  (dashed  line)  and  a  wave  vector 
dependent  exponent  (solid  line)  for  two  different  values  of  the  wave  vector  qa  =  4.00  and 
7.16  respectively.  For  distances  (in  temperature  or  density)  from  the  transition  which  are 
very  small  the  square  root  law  seems  to  work  but  for  large  e  this  seems  to  break  down. 
This  square  root  law  with  the  non  ergodicity  parameter  is  indeed  seen  in  a  polymeric 
liquid  [17]  where  the  transition  point  is  found  to  be  TO  =216  K,  about  35  K  above  the 
phenomenological  glass  transition  point.  This  observation  has  been  viewed  as  important 
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4.  The  self-consistent  model  with  the  ergodicity  restoring  mechanism 

If  the  quantity  7(2,  z)  is  maintained  then  the  system  shows  ergodicity  over  long  time 
scales.  In  ref.  [5]  it  was  demonstrated  that  the  quantity  j(q,  z)  can.be  obtained  in  terms  of 
the  hydrodynamic  correlation  functions  by  analyzing  the  self  energy  matrix  element  EV/VJ 
introduced  in  (10)  in  the  form 

/dk 
—-3  ([uf  C^(q  -  k,  r)  +  (1  -  «?)Gj,(q  -  k,  f)]Gw(k,  0 
27T 


(49) 

with  definition  u  =  k-q  and  MI  =  #(q  —  k)/|q  —  k|  and  the  superscripts  L  and  T 
respectively  refer  to  longitudinal  and  transverse  parts  of  the  corresponding  quantities  in 
the  isotropic  fluid.  This  self  consistent  expression  for  7  was  also  used  [12]  for  obtaining  a 
close  set  of  functional  equations  for  the  density  and  current  correlation  functions.  The 
time  scales  of  relaxation  that  followed  from  numerical  solution  of  these  equations 
demonstrated  good  agreement  with  computer  simulation  results  [19].  Indeed  if  7  would 
self-consistently  reduce  to  very  small  values,  the  supercooled  dynamics  is  pushed  to  very 
long  times.  Hence  the  exact  form  of  the  kernel  7(2,  z)  representing  the  cutoff  mechanism 
for  eliminating  the  glass  transition  singularity  is  important.  In  ref.  [12]  this  issue  was 
considered  through  a  self-consistent  solution  of  (39)  and  (49).  In  figure  4  we  show  the 
behavior  of  the  static  longitudinal  viscosity  Z)L(0,  0)  =  f  with  the  reduced  density  in  a 
hard  sphere  system.  Initially  the  viscosity  shows  a  power  law  increase  (with  exponent 
close  to  2)  but  for  higher  densities  die  sharp  transition  is  cutoff.  This  work  demonstrated 
that  the  relaxation  time  increases  by  two  to  three  orders  of  magnitude  showing  a  change 
in  the  dynamics  but  it  did  not  give  rise  to  any  diverging  time  scales  around  this  mode 
coupling  singularity. 

The  question  regarding  the  behavior  of  the  quantity  7  i.e.  how  small  it  can  be  pushed  to 
produce  extremely  slow  relaxation  has  also  been  investigated.  This  is  done  by  using  a 
lower  cutoff  time  in  the  computation  of  the  mode  coupling  effects.  The  argument  for  this 
cutoff  time  is  as  follows:  the  mode  coupling  contribution  to  the  memory  function 
computes  effects  due  to  correlated  collisions  or  cooperative  dynamics  in  the  supercooled 
liquid.  The  long  time  and  short  time  contributions  to  the  viscosity  can  be  separated 
according  to 

t  roo 

,t)t  (50) 


rto  r 

=  /     dfe'*DT(<?,0+  / 
JO  Jt0 


where  the  time  to  sets  the  scale  beyond  which  cooperative  dynamics  is  effective.  The  first 
integral  on  the  right  side  of  (50)  represents  short  time  dynamics  and  is  taken  into  account 
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Figure  4.     Square  root  of  the  inverse  of  the  normalized  longitudinal  viscosity  f  vs. 
density.  The  dashed  line  shows  the  power  law  fit  for  intermediate  densities. 

through  the  bare  transport  coefficients.  The  mode  coupling  contribution  to  the  viscosity  is 
to  a  good  approximation  given  by  approximating  it  by  the  second  integral  in  (50)  and 
ignoring  the  first  one  since  the  bare  transport  coefficient  already  takes  short  time 
dynamics  into  account.  For  supercooled  densities  the  same  cutoff  time  is  included  in  the 
expression  for  the  kernel  7  [20, 13].  The  cutoff  time  t0  in  the  theory  can  be  estimated  by 
fitting  the  value  of  the  integrand  with  the  computer  simulation  results  on  transport 
coefficients  [15,21].  For  a  hard  sphere  system  good  agreement  with  simulation  data  near 
freezing  densities  were  obtained  with  ?o  =  lOfe  where  tE  is  the  Enskog  time  [22]  referring 
to  the  time  scale  for  short  time  dynamics  for  the  hard  sphere  liquid.  For  supercooled 
densities  however,  the  mode  coupling  contribution  to  the  viscosity  is  not  very  sensitive  to 
to,  but  7  does  get  influenced  by  the  cutoff  time.  In  a  number  of  recent  works  [23]  the 
experimental  data  for  relaxation  in  supercooled  systems  has  been  used  to  obtain  an 
estimation  of  the  cutoff  function.  In  this  respect  the  computation  of  the  cutoff  function 
from  a  fully  self-consistent  approach  is  relevant.  It  has  been  observed  that  even  if  the 
short  time  part  of  the  dynamics  is  excluded  from  the  mode  coupling  contribution  through 
fo,  the  self-consistent  calculation  does  not  produce  any  drastic  change  in  the  relaxation 
time  as  is  seen  in  the  strong  liquids.  In  a  number  of  recent  works  [23]  the  cutoff 
mechanism,  smoothing  off  the  ideal  glass  transition  in  the  framework  of  mode  coupling 
theory,  has  been  incorporated  with  the  use  of  experimental  data.  The  strength  of  the  cutoff 
function  is  estimated  there  through  an  analysis  of  the  data.  In  ref.  [16]  Kim  investigated 
the  behavior  of  the  cutoff  function  in  a  schematic  model  where  all  q  dependences  are 
dropped  from  the  model  and  obtained  an  expression  of  the  form 


7 


r 
= 

JtQ 


(51) 
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NFH  in  supercooled  liquids 

It  was  found  [16]  that  if  ft  was  chosen  sufficiently  large  (f0  =  SQo"1),  7  is  greatly 
suppressed.  Therefore  the  suppression  of  7  is  obtained  once  the  cutoff  time  exceeds  the 
microscopic  value.  In  the  fully  q  dependent  theory  we  see  a  qualitatively  similar  but  less 
enhanced  effect.  With  reasonable  approximations  for  ?o  relevant  for  the  short  time 
dynamics,  does  not  give  rise  to  the  very  strong  enhancement  of  the  viscosity.  It  does 
indicate  a  slowing  down  of  the  relaxation  over  two  to  three  orders  of  magnitudes  over  a 
temperature  or  density  range  higher  than  the  glass  transition  temperature.  This  is  similar 
to  what  has-been  termed  in  literature  [24]  as  fragile  liquids.  It  does  not  however  give  rise 
to,  within  sensible  approximations,  in  the  framework  of  a  self  consistent  calculation  the 
very  sharp  increase  in  viscosity  seen  in  the  strong  liquids.  Thus  the  basic  results  for  the 
relaxation  time  does  not  change  drastically  with  the  introduction  of  the  cutoff  time  t$  in 
the  framework  of  the  fully  self-consistent  calculation. 


5.  Discussion 

In  this  paper  we  discuss  the  nonlinear  fluctuating  hydrodynamic  equations  for 
compressible  liquid,  and  how  they  were  used  for  obtaining  the  mode  coupling  models 
for  the  supercooled  liquid  dynamics.  The  theory  demonstrated  existence  of  a 
characteristics  temperature  TO  higher  than  the  calorimetric  glass  transition  temperature 
TQ  such  that  within  a  narrow  temperature  range  around  TQ  there  is  a  freezing  out  of  the 
large  scale  structural  rearrangements  involving  collective  motion  of  many  molecules.  At 
this  temperature  there  is  a  qualitative  change  in  the  collective  dynamics  in  the  liquid 
although  there  is  no  sharp  glass  transition  characterized  by  a  diverging  viscosity.  The 
temperature  TQ  [27, 7, 23]  is  a  signature  of  the  dynamic  singularity  due  to  mode  coupling 
effects.  In  a  careful  analysis  [5]  of  the  NFH  equations  it  was  demonstrated  that  ergodicity 
is  maintained  at  all  densities.  In  the  Das  and  Mazenko  analysis  the  role  of  the  \/p 
nonlinearities  appearing  in  the  equations  were  implemented  in  the  field  theoretic  analysis 
through  the  constraint  g  =  pv.  The  cutoff  mechanism  responsible  for  the  absence  of  the 
sharp  transition  is  a  direct  consequence  of  this  [29].  Ergodicity  was  also  demonstrated 
[25]  in  the  asymptotic  dynamics  obtained  in  similar  mode  coupling  models  obtained  from 
microscopic  approaches.  In  a  subsequent  work  Schimitz  De  and  Dufty  [26]  (hereafter 
referred  to  as  SDD)  has  also  considered  the  equations  of  nonlinear  fluctuating 
hydrodynamics  to  obtain  a  self  consistent  mode  coupling  theory  for  supercooled  liquids 
extending  it  to  short  wavelengths.  The  analysis  presented  by  these  authors  demonstrate 
the  absence  of  a  sharp  transition  to  an  ideal  glassy  phase  similar  to  the  earlier  work  by 
Das  and  Mazenko.  In  the  work  by  SDD  the  role  of  the  nonlinearities  in  the  fluctuating 
hydrodynamic  equations  are  investigated  with  the  underlying  microscopic  dynamics 
being  constrained  by  the  detailed  balance  condition.  The  authors  show  that  special 
nonlinearities  of  density  p  and  momentum  field  g  which  appear  in  the  continuity  equation 
to  maintain  detailed  balance,  do  eliminate  a  complete  structural  arrest  that  would  have 
occurred  if  only  coupling  of  density  fluctuations  were  considered.  In  a  recent  work  [28]  it 
was  demonstrated  that  both  the  works  gives  the  identical  result  for  the  final  relaxation 
process.  The  absence  of  the  sharp  glass  transition  in  the  mode  coupling  models  is  linked 
to  the  fact  that  with  the  increase  of  density  the  kernel  7  [5]  does  not  self  consistently 
reduce  to  zero  and  the  densirv  correlation  function  decavs  tn  zero  in  the  lone  time  limit. 


Indeed  such  an  effect  is  also  observed  in  a  wide  range  of  systems  showing  a  qualitative 
change  in  dynamics  of  the  supercooled  liquid  around  a  characteristics  temperature  higher 
than  the  calorimetric  glass  transition  temperature  although  there  is  no  sharp  transition  to 
an  ideal  glassy  phase.  7  however  gets  small,  indicating  a  two/three  orders  of  magnitude 
rise  in  the  value  of  transport  coefficients.  In  ref.  [30]  the  authors  have  used  a  sophisticated 
form  of  scaling  in  a-relaxation  of  supercooled  liquids.  From  the  measurements  of  the 
dielectric  susceptibility  of  a  variety  of  glass  forming  liquids  they  observed  that  the  data 
for  all  the  sample  liquids  studied  can  be  scaled  so  that  they  fall  on  top  of  one  another  over 
13  decades  of  frequency.  The  curve  is  described  by  two  parameters,  the  peak  frequency  z/p 
and  the  normalized  width  W  (with  respect  to  the  Debye  width  =  1.14)  of  the  imaginary 
part  of  the  dielectric  susceptibility.  While  the  observed  scaling  does  fit  very  well  to  the 
stretched  exponential  form  for  low  frequencies  it  deviates  significantly  from  the  stretched 
exponential  form  on  the  high  frequency  side.  In  a  subsequent  analysis  by  Kirn  and 
Mazenko  [131  demonstrated  the  existence  of  a  cross  over  temperature  in  the  data  of  Nagel 
et  al  by  plotting  inverse  of  the  peak  frequency  z/p  with  temperature.  The  high  frequency 
data  again  fits  well  to  the  form  (T  -  T0)2  and  T0  =  270  K  for  salol.  These  authors  have 
also  argued  that  the  high  frequency  tail  observed  in  Nagel's  scaling  curve  is  a  realization 
of  the  von-Schweidler  relaxation  in  the  mode  coupling  theory  if  one  assumes  a  relation 
between  the  exponent  0  of  stretched  exponential  relaxation  with  the  exponent  b  of  von- 
Schweidler  relaxation.  Such  a  relation  does  not  however  follow  from  the  mode  coupling 
models  obtained  from  simple  nonlinear  hydrodynamic  models.  Thus  while  the  mode 
coupling  models  has  led  to  considerable  progress  in  understanding  the  sequence  of  time 
scales  associated  with  relaxation  near -glass  transition,  there  remain  still  significant 
questions  for  further  research. 
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